The embodiments disclosed herein are generally directed towards systems and methods for identifying obesity risk for individuals. More specifically, there is a need for systems and methods for analyzing an individual's metabolome to make more precise assessments of its risk for health effects associated with obesity.
Obesity is one of the most widespread problems facing our society's health today. Excessive weight significantly increases an individual's risk for conditions like diabetes mellitus and cardiovascular disease. Worldwide, the prevalence of obesity has nearly tripled since 1975, with 39% of the world's adults being overweight and 13% being obese. The high prevalence can be attributed to increasing consumption of hypercaloric foods and sedentary lifestyles. While BMI (body mass index, kg/(m2)) is generally used to characterize obesity, it is a crude measure that does not capture the complexity of a person's state of health. Because of the importance of having a healthy body, better methods of measuring health are needed, and the underlying biology of obesity needs to be better understood. Previous studies have identified metabolic signatures associated with obesity, including increased levels of branched-chain and aromatic amino acids as well as glycerol and glycerophosphocholines. However, conventional approaches to identifying metabolic signatures of obesity have been limited by a focus on a relatively small number of metabolites, individuals, or phenotypes.
With the advent of artificial intelligence and machine learning techniques, it is now possible to process large stores of health data documenting the natural weight gain and loss of large cohorts of individuals over time to identify metabolome changes that can be predictive indicators of obesity as well as identify metabolic biomarkers for different types of obesity (e.g., biomarkers of so-called healthy obesity, diabetes-prone obesity, and cardiovascular disease-prone obesity, etc.).
The ability to measure phenotypic indicators of people with obesity allows for a better understanding of factors that make people susceptible to (or protected from) obesity, accompanied by better elucidation of the factors that account for variability in success of different obesity treatments. As such, there is a need for techniques and/or assays that can provide more accurate predictions of an individual's obesity state and/or health effects associated with it.
The details of one or more embodiments of the disclosure are set forth in the accompanying drawings/tables and the description below. Other features, objects, and advantages of the disclosure will be apparent from the drawings/tables and detailed description, and from the claims.
Obesity is currently identified using the body mass index (BMI) of an individual. This metric (which is derived from the mass and height of an individual) is imprecise, but it is commonly used for health (and medical) recommendations and clinical decisions. A more precise assessment of a person's obesity may also involve the use of anthropomorphic measurements (e.g., waist circumference, waist-height ratio, waist-hip ratio, etc.), biological (hyper-triglyceridemic waist, metabolites, genomic markers, etc.), and imaging (e.g., CT, MRI, DXA, etc.). There are a number of individual metabolites that are known to be associated with BMI and obesity. These include branched chain amino acids (leucine, isoleucine, valine), aromatic amino acids (tyrosine, tryptophan), uric acid, phospholipids, glucose, mannose, asparagine, glycerol, and glycerophosphocholines. However, these metabolites are not currently considered singly or in aggregate to calculate a person's metabolic BMI (mBMI).
Various aspects and embodiments are disclosed herein for analyzing an individual's metabolome to make more precise assessments of his/her risk for obesity and/or health effects associated with obesity. Specifically, the systems and methods disclosed herein relate to detecting, measuring and analyzing an individual's (blood, plasma, serum or some combination thereof) metabolite signature (metabolite profile) to accurately predict an individual's mBMI. Importantly, this signature can identify individuals whose predicted mBMI can be very different from their conventional BMI (determined using conventional weight and height measurements). That is, individuals with different mBMIs can have very different health outcomes even though they are in the same conventional BMI class.
Using linear regression, the levels of an initial set of 901 distinct metabolites were compared to the conventional BMIs of overlapping sets of unrelated individuals in a first population cohort at three time points spanning a total range of 8-18 years. Of that initial set of metabolites, a first subset of 284 metabolites were significantly associated (p<5.5×10−5) with conventional BMI at one or more time points. From that first subset, 110 metabolites were identified as being significantly associated with BMI at all 3 time points.
These 110 metabolites were further studied in an additional set of unrelated individuals in a second population cohort in order to replicate the initial associations. Of the 84 metabolites that had been measured in both the first and the second cohorts, 83 showed directions of effect that were consistent between the two cohorts, and 49 were shown to be statistically significant replications.
In addition to these 49 strongly associated metabolites, there were an additional 23 metabolites that were statistically significantly associated with BMI in the second population cohort. Overall, 307 metabolites were identified as showing statistically significant associations in at least cohort at one time point, and 49 metabolites with overwhelmingly significant signals, which were used to build a metabolic signature of obesity.
The 307 metabolites that exhibit at least some statistically significant association with obesity are shown in Table 1 below:
While all 307 of these metabolites are associated with obesity, not all were used in the final metabolic signature to determine mBMI due to either missing data or insufficient evidence. Many of the 307 metabolites have strong correlations with one or more of the subset of 49 strongly associated metabolites and would be expected to show significant associations in a larger study and make similar contributions to the final model as their respective proxies in the subset of 49.
As discussed above, 49 metabolites were identified to have consistent and strong signals associated with conventional BMI. In various embodiments, the levels of the 49 metabolites were measured to calculate each person's metabolic BMI (mBMI) using ridge regression in R's glmnet package. The formula for the calculation was identified using machine learning and artificial intelligence techniques and is as follows:
mBMI=sum((coefficient)×(metabolite value))+Intercept Eq. 1:
The 49 metabolites that are the most strongly associated with obesity are shown in Table 2 below in rank of correlation:
Before the coefficients are applied, the metabolite data is rank-ordered and forced to a normal distribution with a mean of 0 and standard deviation of 1. After applying the coefficients, the sum of the 49 metabolite level values for each person is taken, and the intercept is added. This final value is the metabolic BMI (mBMI) or the metabolic signature of obesity.
The various embodiments of the disclosure are provided below:
In some embodiments, the disclosure relates to a method of diagnosing obesity or a disease related thereto in a subject, comprising, obtaining a biological sample from the subject; detecting, in the biological sample, levels or activities of at least 3, 4, 5, 6, 8, 10, 12, 13, 15, 20, 25, 30, 35, 40, 45, 49, 50, 60, 75, 100, 125, 150, 200, 250, 300, or 307 metabolites or derivatives thereof, wherein the metabolites are selected from the metabolites of Table 11, Table 12A, Table 12B, or Tables 1-7, preferably Tables 2-7, especially Table 2 or Tables 4-7; calculating a metabolomic body mass index (mBMI) value for the subject based on the detection, wherein the metabolites of the Tables are listed in the order of relative correlation to the subject's calculated mBMI value; and diagnosing the subject as having obesity if the mBMI value of the subject is modulated compared to a reference standard.
In some embodiments, the disclosure relates to a method of diagnosing obesity or a disease related thereto in a subject, comprising, obtaining a biological sample from the subject; detecting, in the biological sample, levels or activities of at least 3, 4, 5, 6, 8, 10, 12, 13, 15, 20, 25, 30, 35, 40, 45, 49, 50, 60, 75, 100, 125, 150, 200, 250, 300, or 307 metabolites or derivatives thereof, wherein the metabolites are selected from the metabolites of Table 1 or Table 2; calculating a metabolomic body mass index (mBMI) value for the subject based on the detection, wherein the metabolites of the Tables are listed in order of effect on the mBMI value or the order of relative correlation to the subject's calculated mBMI value; and diagnosing the subject as having obesity if the mBMI value of the subject is modulated compared to a reference standard.
In some embodiments, the disclosure relates to a method of diagnosing obesity or a disease related thereto in a subject in accordance with the foregoing, wherein the biological sample comprises a blood sample (e.g., whole blood, plasma, serum, or a combination thereof).
In some embodiments, the disclosure relates to a method of diagnosing obesity or a disease related thereto in a subject in accordance with the foregoing, wherein the levels and/or activities of the metabolites or derivatives thereof is determined using a chemical analytical method selected from the group consisting of HPLC, thin layer chromatography (TLC), electrochemical analysis, Mass Spectroscopy (MS), refractive index spectroscopy (RI), Ultra-Violet spectroscopy (UV), fluorescent analysis, radiochemical analysis, Near-Infra Red spectroscopy (Near-IR), Nuclear Magnetic Resonance spectroscopy (NMR), fluorescence spectroscopy, dual polarization interferometry, computational methods, Light Scattering analysis (LS), gas chromatography (GC), GC coupled with MS, and direct injection (DI) coupled with LC-MS/MS or a combination thereof.
In some embodiments, the disclosure relates to a method of diagnosing obesity or a disease related thereto in a subject in accordance with the foregoing, wherein the disease related to obesity is selected from coronary artery disease, hypertension, stroke, peripheral vascular disease, insulin resistance, glucose intolerance, diabetes mellitus, hyperglycemia, hyperlipidemia, hypercholesteremia, hypertriglyceridemia, hyperinsulinemia, atherosclerosis, cellular proliferation and endothelial dysfunction, diabetic dyslipidemia, lipodystrophy and metabolic syndrome, type II diabetes, diabetic complications including diabetic neuropathy, nephropathy, retinopathy or cataracts, heart failure, inflammation, thrombosis, congestive heart failure, asthmatic or pulmonary disease related to obesity, and cardiovascular disease related to obesity or a combination thereof.
In some embodiments, the disclosure relates to a method of diagnosing obesity or a disease related thereto in a subject in accordance with the foregoing, wherein the derivative of the metabolite is selected from salts, amides, esters, enol ethers, enol esters, acetals, ketals, acids, bases, solvates, hydrates, and polymorphs or a combination thereof.
In some embodiments, the disclosure relates to a method of diagnosing obesity or a disease related thereto in a subject in accordance with the foregoing, wherein the modulation of mBMI comprises an increase or a decrease in mBMI compared to a reference standard. Preferably, if the subject's mBMI is increased compared to a reference standard, then the subject is diagnosed as having obesity with metabolic repercussions (e.g., predictive of metabolic syndrome and cardiovascular risk). Particularly, if mBMI>threshold obesity BMI of 30, then the subject is diagnosed as having obesity with severe metabolic repercussions.
In some embodiments, the disclosure relates to a method of diagnosing obesity or a disease related thereto in a subject, comprising, obtaining a biological sample from the subject; detecting, in the biological sample, levels or activities of at least 3, 4, 5, 6, 8, 10, 12, 13, 15, 20, 25, 30, 35, 40, 45, 49, 50, 60, 75, 100, 125, 150, 200, 250, 300, or 307 metabolites or derivatives thereof, wherein the metabolites are selected from the metabolites of Table 11, Table 12A, Table 12B, or Tables 1-7, preferably Tables 2-7, especially Table 2 or Tables 4-7; calculating a metabolomic body mass index (mBMI) value for the subject based on the detection, wherein the metabolites of the Tables are listed in the order of relative correlation to (or effect on) the subject's calculated mBMI value; and diagnosing the subject as having obesity if the mBMI value of the subject is modulated compared to a reference standard comprising the subject's BMI. Preferably, if the subject's mBMI is increased compared to the subject's BMI, then the subject is diagnosed as having obesity with metabolic repercussions (e.g., predictive of metabolic syndrome and cardiovascular risk). Particularly, if mBMI>threshold obesity BMI of 30, then the subject is diagnosed as having obesity with severe metabolic repercussions.
In some embodiments, the disclosure relates to a method of diagnosing obesity or a disease related thereto in a subject, comprising, obtaining a biological sample from the subject; detecting, in the biological sample, levels or activities of at least 3, 4, 5, 6, 8, 10, 12, 13, 15, 20, 25, 30, 35, 40, 45, 49, 50, 60, 75, 100, 125, 150, 200, 250, 300, or 307 metabolites or derivatives thereof, wherein the metabolites are selected from the metabolites of Table 11, Table 12A, Table 12B, or Tables 1-7, preferably Tables 2-7, especially Table 2 or Tables 4-7; calculating a metabolomic body mass index (mBMI) value for the subject based on the detection, wherein the metabolites of the Tables are listed in the order of relative correlation to (or effect on) the subject's calculated mBMI value; further determining a secondary parameter selected from blood pressure, waist/hip ratio, android/gynoid ratio, % body fat, % visceral fat, % subcutaneous fat and insulin resistance or a combination thereof; and diagnosing the subject as having obesity if the mBMI value and the level of at least 1, 2, 3, 4, 5, 6 or 7 secondary parameter is increased compared to a reference standard. Particularly, the reference standard comprises a subject whose BMI>30. Under this embodiment, preferably, the method comprises generating a composite score of the mBMI and the secondary parameter and comparing the composite score to a reference standard. Particularly, the reference standard comprises a positive reference standard comprising a composite score of the mBMI and the secondary parameter for an obese subject and/or a negative reference standard comprising a composite score of the mBMI and the secondary parameter for a non-obese or healthy subject.
In some embodiments, the disclosure relates to a method for diagnosis of healthy obesity or unhealthy obesity or a disease related thereto by carrying out the foregoing methods. Preferably healthy obesity comprises a subject whose BMI>threshold obesity BMI of 30 but whose mBMI≤30; and the unhealthy obesity comprises a subject whose BMI≤threshold obesity BMI of 30 but whose mBMI>30.
In some embodiments, the disclosure relates to a method of diagnosing and treating obesity or a disease related thereto in a subject, comprising, (a) detecting, in a biological sample obtained from the subject, levels or activities of at least 3, 4, 5, 6, 8, 10, 12, 13, 15, 20, 25, 30, 35, 40, 45, 49, 50, 60, 75, 100, 125, 150, 200, 250, 300, or 307 metabolites or derivatives thereof, wherein the metabolites are selected from the metabolites of Table 11, Table 12A, Table 12B, or Tables 1-7, preferably Tables 2-7, especially Table 2 or Tables 4-7 and calculating a metabolomic body mass index (mBMI) value for the subject based on the detection, wherein the metabolites of the Tables are listed in the order of relative correlation to (or effect on) the subject's calculated mBMI value; (b) diagnosing subject with obesity if the mBMI value of the subject is modulated compared to a reference standard; and (c) administering an effective amount of a therapy selected from the group consisting of anti-obesity pharmacotherapy, surgery, and lifestyle therapy to the subject diagnosed with obesity. Preferably under this embodiment, if the subject's mBMI is greater than a reference standard, e.g., a threshold obesity BMI of 30, then the subject is diagnosed as having obesity or a disease related thereto with metabolic repercussions (e.g., predictive of metabolic syndrome and cardiovascular risk). Particularly, if mBMI>>threshold obesity BMI of 30, then the subject is diagnosed as having obesity with severe metabolic repercussions.
In some embodiments, the disclosure relates to a method of diagnosing and treating obesity or a disease related thereto in a subject, comprising, (a) detecting levels and/or activities of at least three markers of Table 1 or derivatives thereof in a biological sample obtained from the subject and computing a metabolomic body mass index (mBMI) value for the subject based on the detection, wherein the at least 3 metabolites comprises: urate, 5-methylthioadenosine, and glutamate; (b) diagnosing subject with obesity if the mBMI value of the subject is modulated compared to a reference standard; and (c) administering an effective amount of a therapy selected from the group consisting of anti-obesity pharmacotherapy, surgery, and lifestyle therapy. Preferably under this embodiment, if the subject's mBMI is greater than a reference standard, e.g., a threshold obesity BMI of 30, then the subject is diagnosed as having obesity or a disease related thereto with metabolic repercussions (e.g., predictive of metabolic syndrome and cardiovascular risk). Particularly, if mBMI>>threshold obesity BMI of 30, then the subject is diagnosed as having obesity with severe metabolic repercussions.
In some embodiments, the disclosure relates to a method of diagnosing and treating obesity in a subject, comprising, (a) detecting levels and/or activities of at least three markers of Table 2 or derivatives thereof in a biological sample obtained from the subject and computing a metabolomic body mass index (mBMI) value for the subject based on the detection, wherein the at least 3 metabolites comprises, urate, glutamate and 1-(1-enyl-palmitoyl)-2-oleoyl-GPC (P-16:0/18:1); (b) diagnosing subject with obesity if the mBMI value of the subject is modulated compared to a reference standard; and (c) administering an effective amount of a therapy selected from the group consisting of anti-obesity pharmacotherapy, surgery, and lifestyle therapy. Preferably under this embodiment, if the subject's mBMI is greater than a reference standard, e.g., a threshold obesity BMI of 30, then the subject is diagnosed as having obesity or a disease related thereto with metabolic repercussions (e.g., predictive of metabolic syndrome and cardiovascular risk). Particularly, if mBMI>>threshold obesity BMI of 30, then the subject is diagnosed as having obesity with severe metabolic repercussions.
In some embodiments, the disclosure relates to diagnosing and optionally treating obesity in a subject in accordance with the foregoing methods, comprising further detecting at least one secondary parameter and further optionally detecting at least one genetic parameter. Preferably, the secondary parameter is selected from the group consisting of android/gynoid ratio; total triglycerides; waist/hip ratio; subcutaneous fat; visceral fat; insulin resistance; HDL; percent fat; diastolic blood pressure; systolic blood pressure; total cholesterol; and LDL, or a combination thereof, particularly preferably, android/gynoid ratio; total triglycerides; waist/hip ratio; subcutaneous fat; visceral fat; insulin resistance; and HDL. Preferably, the genetic parameter is selected from genetic variants of melanocortin 4 receptor gene (MC4R) or a lipdystrophy gene selected from zinc metallopeptidase STE24 (ZMPSTE24) gene or the 1-acylglycerol-3-phosphate O-acyltransferase 2 (AGPAT2) gene or lipase E, hormone sensitive type (LIPE) gene or Bernardinelli-Seip congenital lipodystrophy type 2 (BSCL2) gene, or any combination thereof; especially an MC4R variant selected from M292fs, R236C, S180P, A175T, and T11A, but not I170V; and/or a genetic variant of a lipodystrophy gene selected from ZMPSTE24, AGPAT2, LIPE gene, BSCL2, or any combination thereof. In some embodiments, the disclosure relates to a method for screening a test agent for treating obesity, comprising, (a) detecting, in a biological sample obtained from the subject, levels and/or activities of at least 3, 4, 5, 6, 8, 10, 12, 13, 15, 20, 25, 30, 35, 40, 45, 49, 50, 60, 75, 100, 125, 150, 200, 250, 300, or 307 metabolites or derivatives thereof, wherein the metabolites are selected from the metabolites of Table 11, Table 12A, Table 12B, or Tables 1-7, preferably Tables 2-7, especially Table 2 or Tables 4-7 and computing a first metabolomic body mass index (mBMI) value; (b) administering a composition comprising the test agent to the subject; (c) detecting levels and/or activities of the metabolites of step (a) in the biological sample obtained from the subject to compute a second mBMI value; and (d) selecting a test agent if the second mBMI value is modulated compared to the first mBMI value for the subject. Preferably under this embodiment, if the subject's second mBMI is reduced compared to the first mBMI, e.g., to a value below a threshold obesity BMI of 30, then the test agent is selected for treating obesity.
In some embodiments, the disclosure relates to a method for screening a test agent for treating obesity, comprising, (a) detecting levels and/or activities of at least three metabolites of Table 1 or derivatives thereof in a biological sample obtained from the subject to compute a first metabolomic body mass index (mBMI) value, wherein the at least 3 metabolites comprises, in the order of rank of relative correlation to the subject's obesity, urate, 5-methylthioadenosine, and glutamate; (b) administering a composition comprising the test agent to the subject; (c) detecting levels and/or activities of the metabolites of step (a) in the biological sample obtained from the subject to compute a second mBMI value; and (d) selecting a test agent if the second mBMI value is modulated compared to the first mBMI value for the subject.
In some embodiments, the disclosure relates to a method for screening a test agent for treating obesity, comprising, (a) detecting levels and/or activities of at least three metabolites of Table 2 or derivatives thereof in a biological sample obtained from the subject, wherein the at least 3 metabolites comprises, in the order of rank of relative correlation to the subject's obesity, urate, glutamate and 1-(1-enyl-palmitoyl)-2-oleoyl-GPC (P-16:0/18:1); (b) administering a composition comprising the test agent to the subject; (c) detecting levels and/or activities of the metabolites of step (a) in the biological sample obtained from the subject to compute a second mBMI value; and (d) selecting a test agent if the second mBMI value is modulated compared to the first mBMI value for the subject.
In some embodiments, the disclosure relates to a method for screening a test agent for treating unhealthy or healthy obesity, preferably unhealthy obesity, comprising, (a) detecting levels and/or activities of at least three metabolites of Table 2 or derivatives thereof in a biological sample obtained from the subject, wherein the at least 3 metabolites comprises, in the order of rank of relative correlation to the subject's obesity, urate, glutamate and 1-(1-enyl-palmitoyl)-2-oleoyl-GPC (P-16:0/18:1); (b) administering a composition comprising the test agent to the subject; (c) detecting levels and/or activities of the metabolites of step (a) in the biological sample obtained from the subject to compute a second mBMI value; and (d) selecting a test agent if the second mBMI value is modulated compared to the first mBMI value for the subject. In some embodiments, the healthy obesity comprises a subject whose BMI>threshold obesity BMI of 30 but whose mBMI≤30; and the unhealthy obesity comprises a subject whose BMI≤threshold obesity BMI of 30 but whose mBMI>30.
In some embodiments, the disclosure relates to a method for screening a test agent for treating unhealthy or healthy obesity, preferably unhealthy obesity, comprising, (a) detecting levels and/or activities of at least three metabolites of Table 2 or derivatives thereof in a biological sample obtained from the subject, wherein the at least 3 metabolites comprises, in the order of rank of relative correlation to the subject's obesity, urate, glutamate and 1-(1-enyl-palmitoyl)-2-oleoyl-GPC (P-16:0/18:1); (b) administering a composition comprising the test agent to the subject; (c) detecting levels and/or activities of the metabolites of step (a) in the biological sample obtained from the subject to compute a second mBMI value; and (d) selecting a test agent if the second mBMI value is modulated compared to the first mBMI value for the subject, wherein the method further comprises (e) detecting a secondary parameter selected from the group consisting of android/gynoid ratio; total triglycerides; waist/hip ratio; subcutaneous fat; visceral fat; insulin resistance; HDL; percent fat; diastolic blood pressure; systolic blood pressure; total cholesterol; and LDL, or a combination thereof, preferably, android/gynoid ratio; total triglycerides; waist/hip ratio; subcutaneous fat; visceral fat; insulin resistance; and HDL.
In some embodiments, the disclosure relates to a method for screening a test agent for treating unhealthy or healthy obesity, preferably unhealthy obesity, comprising, (a) detecting levels and/or activities of at least three metabolites of Table 2 or derivatives thereof in a biological sample obtained from the subject, wherein the at least 3 metabolites comprises, in the order of rank of relative correlation to the subject's obesity, urate, glutamate and 1-(1-enyl-palmitoyl)-2-oleoyl-GPC (P-16:0/18:1); (b) administering a composition comprising the test agent to the subject; (c) detecting levels and/or activities of the metabolites of step (a) in the biological sample obtained from the subject to compute a second mBMI value; and (d) selecting a test agent if the second mBMI value is modulated compared to the first mBMI value for the subject, wherein the method further comprises (e) detecting a genetic parameter selected from a rare (MAF<0.01%) coding variant in the melanocortin 4 receptor gene (MC4R), preferably an MC4R variant selected from M292fs, R236C, S180P, A175T, and T11A, but not I170V; genetic variants of lipodystrophy genes selected from ZMPSTE24 gene or AGPAT2 gene or LIPE gene or BSCL2 gene, or any combination thereof; or a combination of a rare coding variant of MC4R gene and a variant of a gene selected from ZMPSTE24, AGPAT2, LIPE and BSCL2.
In some embodiments, the disclosure relates to a computer readable medium comprising computer-executable instructions, which, when executed by a processor, cause the processor to carry out a method or a set of steps for diagnosing obesity in a subject, comprising detecting a metabolite profile in a metabolome dataset received from a subject's sample, wherein the metabolite profile comprises levels or activities of at least 3, 4, 5, 6, 8, 10, 12, 13, 15, 20, 25, 30, 35, 40, 45, 49, 50, 60, 75, 100, 125, 150, 200, 250, 300, or 307 metabolites or derivatives thereof, wherein the metabolites are selected from the metabolites of Table 11, Table 12A, Table 12B, or Tables 1-7, preferably Tables 2-7, especially Table 2 or Tables 4-7; and the computer readable medium comprises machine learning techniques to determine obesity of subject based on the metabolite profile.
In some embodiments, the disclosure relates to a computer readable medium comprising computer-executable instructions, which, when executed by a processor, cause the processor to carry out a method or a set of steps for diagnosing obesity in a subject, comprising detecting a metabolite profile in a metabolome dataset received from a subject's sample, wherein the metabolite profile comprises levels or activities of at least 3 metabolites of Table 1 or derivatives thereof and the computer readable medium comprises machine learning techniques to determine obesity of subject based on the metabolite profile, wherein the at least 3 metabolites comprises, in the order of rank of relative correlation to the subject's obesity, urate, 5-methylthioadenosine, and glutamate.
In some embodiments, the disclosure relates to a computer readable medium comprising computer-executable instructions, which, when executed by a processor, cause the processor to carry out a method or a set of steps for diagnosing obesity in a subject, comprising detecting a metabolite profile in a metabolome dataset received from a subject's sample, wherein the metabolite profile comprises levels or activities of at least three metabolites of Table 2 or derivatives thereof and the computer readable medium comprises machine learning techniques to determine obesity of subject based on the metabolite profile, wherein the at least 3 metabolites comprises, in the order of rank of relative correlation to the subject's obesity, urate, glutamate and 1-(1-enyl-palmitoyl)-2-oleoyl-GPC (P-16:0/18:1).
Preferably, in the foregoing embodiments, the computer readable medium comprising computer-executable instructions, comprises an algorithm that is trained with a compendium of metabolite profiles each of which are associated with obesity and the algorithm computes the predictive power of each metabolite using a rigorous mathematical algorithm.
In some embodiments, the disclosure relates to an obesity profiling system, comprising: (a) a metabolome detector/analyzer configured to detect/analyze levels or activities of a plurality of metabolites (or derivatives thereof) (e.g., at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 39, 40, 50, 80, 100, 150, 200, 250, 300, 307, or more, e.g., 500 metabolites or derivatives thereof) from Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7) in a subject's biological sample; (b) an obesity determining engine configured to determine obesity based on levels and/or activities of metabolites or derivatives thereof; (c) an optional data source (e.g., human metabolome database); and (d) a display communicatively connected to a computing device and configured to display a report containing the subject's obesity profile, wherein each of components (a), (b), (c) and (d) is communicatively connected to each other either directly or via indirectly (e.g., via the internet).
In some embodiments, the disclosure relates to an obesity profiling system, comprising: (a) a metabolome detector/analyzer configured to detect/analyze levels or activities of at least 3 metabolites of Table 1 or derivatives thereof in a subject's biological sample, wherein the at least 3 metabolites comprises, in the order of rank of relative correlation to the subject's obesity, urate; 5-methylthioadenosine; and glutamate; (b) an obesity determining engine configured to determine obesity based on levels and/or activities of metabolites of (a) or derivatives thereof; (c) a data source (e.g., human metabolome database); and (d) a display communicatively connected to a computing device and configured to display a report containing the subject's obesity profile, wherein each of components (a), (b), (c) and (d) is communicatively connected to each other either directly or via indirectly (e.g., via the internet).
In some embodiments, the disclosure relates to an obesity profiling system, comprising: (a) a metabolome detector/analyzer configured to detect/analyze levels or activities of at least 3 metabolites of Table 2 or derivatives thereof in a subject's biological sample, wherein the at least 3 metabolites comprises, in the order of rank of relative correlation to the subject's obesity, urate, glutamate and 1-(1-enyl-palmitoyl)-2-oleoyl-GPC (P-16:0/18:1); (b) an obesity determining engine configured to determine obesity based on levels and/or activities of metabolites of (a) or derivatives thereof; (c) a data source (e.g., human metabolome database); and (d) a display communicatively connected to a computing device and configured to display a report containing the subject's obesity profile, wherein each of components (a), (b), (c) and (d) is communicatively connected to each other either directly or via indirectly (e.g., via the internet).
In some embodiments, the disclosure relates to an obesity profiling system of the foregoing, comprising: (a) a detector/analyzer configured to detect levels or activities of at least 3 metabolites of Table 1 or derivatives thereof in a subject's biological sample, wherein the at least 3 metabolites comprises, in the order of rank of relative correlation to the subject's obesity, urate, 5-methylthioadenosine, and glutamate.
In some embodiments, the disclosure relates to an obesity profiling system, comprising: (a) a detector/analyzer configured to detect metabolic profile comprising at least 3 metabolites of Table 2 or derivatives thereof in a subject's biological sample, wherein the at least 3 metabolites comprises, in the order of rank of relative correlation to the subject's obesity, glutamate and 1-(1-enyl-palmitoyl)-2-oleoyl-GPC (P-16:0/18:1).
In some embodiments, the disclosure relates to a kit for determining a lipid or fat content of a biological sample, comprising, a plurality of probes for detecting a metabolite profile of the biological sample; vessels for holding the biological sample; optionally together with instructions for performing the detection, wherein the metabolite profile comprises at least three of the metabolites of Table 1 or derivatives thereof, wherein the at least 3 metabolites comprises: urate, 5-methylthioadenosine, and glutamate or derivatives thereof.
In some embodiments, the disclosure relates to a kit for determining a lipid or fat content of a biological sample, comprising, a plurality of probes for detecting a metabolite profile of the biological sample; vessels for holding the biological sample; optionally together with instructions for performing the detection, wherein the metabolite profile comprises at least three of the metabolites of Table 2 or derivatives thereof, wherein the at least 3 metabolites comprises: urate, glutamate and 1-(1-enyl-palmitoyl)-2-oleoyl-GPC (P-16:0/18:1) or derivatives thereof.
The disclosure relates to various exemplary embodiments of systems and methods to make precise predictions for individuals by measuring certain biomarkers in his/her metabolome. The disclosure, however, is not limited to these exemplary embodiments and applications or to the manner in which the exemplary embodiments and applications operate or are described herein. Moreover, the figures may show simplified or partial views, and the dimensions of elements in the figures may be exaggerated or otherwise not in proportion. In addition, as the terms “on,” “attached to,” “connected to,” “coupled to,” or similar words are used herein, one element (e.g., a material, a layer, a substrate, etc.) can be “on,” “attached to,” “connected to,” or “coupled to” another element regardless of whether the one element is directly on, attached to, connected to, or coupled to the other element or there are one or more intervening elements between the one element and the other element. In addition, where reference is made to a list of elements (e.g., elements a, b, c), such reference is intended to include any one of the listed elements by itself, any combination of less than all of the listed elements, and/or a combination of all of the listed elements. Section divisions in the specification are for ease of review only and do not limit any combination of elements discussed.
Unless otherwise defined, scientific and technical terms used in connection with the present teachings described herein shall have the meanings that are commonly understood by those of ordinary skill in the art. Further, unless otherwise required by context, singular terms shall include pluralities and plural terms shall include the singular. Generally, nomenclatures utilized in connection with, and techniques of, cell and tissue culture, molecular biology, and protein and oligo- or polynucleotide chemistry and hybridization described herein are those well-known and commonly used in the art. Standard techniques are used, for example, for nucleic acid purification and preparation, chemical analysis, recombinant nucleic acid, and oligonucleotide synthesis. Enzymatic reactions and purification techniques are performed according to manufacturer's specifications or as commonly accomplished in the art or as described herein. The techniques and procedures described herein are generally performed according to conventional methods well known in the art and as described in various general and more specific references that are cited and discussed throughout the instant specification. See, e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual (Third ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. 2000). The nomenclatures utilized in connection with, and the laboratory procedures and techniques described herein are those well-known and commonly used in the art.
As used herein, the term “diagnosis” refers to methods by which a determination can be made as to whether a subject is likely to be suffering from a given disease or condition, including but not limited diseases or conditions characterized by genetic variations. The skilled artisan often makes a diagnosis on the basis of one or more diagnostic indicators, e.g., a marker such as a metabolome, the presence, absence, amount, or change in amount, level or activity of which is indicative of the presence, severity, or absence of the disease or condition. Other diagnostic indicators can include patient history; physical symptoms (e.g., breathlessness, increased sweating, snoring, inability to cope with sudden physical activity, tiredness, lethargy, back and joint pains, etc.); psychological symptoms (e.g., low self-confidence and/or self-esteem, feeling isolated, depression, etc.); phenotype changes (large waistline, unhealthy fat distribution); metabolic syndrome (e.g., high regular body mass index, high triglyceride levels, low HDL cholesterol levels, high fasting blood sugar, type 2 diabetes, diseases of heart and/or blood vessels such as, e.g., deregulated blood pressure, atherosclerosis, heart attacks, or strokes; etc.); diseases of organs such as liver (e.g., non-alcoholic fatty liver disease; NAFLD), gall bladder, urinary bladder (e.g., urinary incontinence) and bone (e.g., osteoarthritis); genotype; or environmental or heredity factors. A skilled artisan will understand that the term “diagnosis” refers to an increased probability that certain course or outcome will occur; that is, that a course or outcome is more likely to occur in a patient exhibiting a given characteristic, e.g., the presence or level of a diagnostic indicator, when compared to individuals not exhibiting the characteristic. Diagnostic methods of the disclosure can be used independently, or in combination with other diagnosing methods, to determine whether a course or outcome is more likely to occur in a patient exhibiting a given characteristic.
As used herein, “metabolome” refers to the collection of all metabolites in a biological cell, tissue, organ or organism, which are the end products of cellular processes. Metabolome includes lipidome, sugars, nucleotides, amino acids, xenobiotics, carbohydrates, peptides, cofactors, vitamins, and cell process intermediates. As used herein, “lipidome” is the complete lipid profile in a biological cell, tissue, organ or organism.
As used herein, “metabolomic profiling” refers to the characterization and/or measurement of the small molecule metabolites in biological specimen or sample, including cells, tissue, organs, organisms, or any derivative fraction thereof and fluids such as blood, blood plasma, blood serum, saliva, synovial fluid, spinal fluids, urine, bronchoalveolar lavage, tissue extracts and so forth.
The “metabolite profile” or “metabolite signature” may include information such as the quantity and/or type of small molecules present in the sample. The ordinarily skilled artisan would know that the information, which is necessary and/or sufficient, will vary depending on the intended use of the metabolite profile. For example, the metabolite profile, can be determined using a single technique for an intended use but may require the use of several different techniques for another intended use depending on such factors as the disease state involved, the types of small molecules present in a particular targeted cellular compartment, the cellular compartment being assayed per se, and so forth.
The relevant information in a metabolite profile may also vary depending on the intended use of the compiled information, e.g., spectrum. For example for some intended uses, the amounts of a particular metabolite or a particular class of metabolite may be relevant, but for other uses the distribution of types of metabolites may be relevant.
Metabolite profiles may be generated by several methods, e.g., HPLC, thin layer chromatography (TLC), electrochemical analysis, Mass Spectroscopy (MS), refractive index spectroscopy (RI), Ultra-Violet spectroscopy (UV), fluorescent analysis, radiochemical analysis, Near-Infrared spectroscopy (Near-IR), Nuclear Magnetic Resonance spectroscopy (NMR), fluorescence spectroscopy, dual polarization interferometry, computational methods, Light Scattering analysis (LS), gas chromatography (GC), or GC coupled with MS, direct injection (DI) coupled with LC-MS/MS and/or other methods or combination of methods known in the art.
The term “small molecule metabolites” includes organic and inorganic molecules which are present in the cell, cellular compartment, or organelle, usually having a molecular weight under 2,000, or 1,500. The term does not include large macromolecules, such as large proteins (e.g., proteins with molecular weights over 2,000, 3,000, 4,000, 5,000, 6,000, 7,000, 8,000, 9,000, or 10,000), large nucleic acids (e.g., nucleic acids with molecular weights of over 2,000, 3,000, 4,000, 5,000, 6,000, 7,000, 8,000, 9,000, or 10,000), or large polysaccharides (e.g., polysaccharides with a molecular weights of over 2,000, 3,000, 4,000, 5,000, 6,000, 7,000, 8,000, 9,000, or 10,000). The small molecule metabolites of the cell are generally found free in solution in the cytoplasm or in other organelles, such as the mitochondria, where they form a pool of intermediates which can be metabolized further or used to generate large molecules, called macromolecules.
The term “small molecule metabolites” includes signaling molecules and intermediates in the chemical reactions that transform energy derived from food into usable forms. Examples of small molecule metabolites include phospholipids, glycerophospholipids, lipids, plasmalogens, sugars, fatty acids, amino acids, nucleotides, intermediates formed during cellular processes, isomers and other small molecules found within the cell. In one embodiment, the small molecules of the invention are isolated.
As used herein, the term “a significant number” denotes at least 5%, at least 10%, least 15%, least 20%, least 25%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, or 100% (e.g., all) of a set (e.g., the metabolites of the Tables).
As used herein, the term “cell” denotes a basic structural, functional, and biological unit. The term includes biological cells of living organisms and also artificial or synthetic cells. Non-limiting examples of biological cells include eukaryotic cells, plant cells, animal cells, such as mammalian cells, reptilian cells, avian cells, fish cells, or the like, prokaryotic cells, bacterial cells, fungal cells, protozoan cells, or the like, cells dissociated from a tissue, such as muscle, cartilage, fat, skin, liver, lung, neural tissue, and the like, immunological cells, such as T cells, B cells, natural killer cells, macrophages, and the like, embryos (e.g., zygotes), oocytes, ova, sperm cells, hybridomas, cultured cells, cells from a cell line, cancer cells, infected cells, transfected and/or transformed cells, reporter cells, and the like. A mammalian cell can be, for example, from a human, a mouse, a rat, a horse, a goat, a sheep, a cow, a primate, or the like.
As used herein, the term “sample” refers to a composition that is obtained or derived from a subject of interest that contains a cellular and/or other molecular entity that is to be characterized and/or identified, for example based on physical, biochemical, chemical and/or physiological characteristics. The source of the tissue sample may be blood or any blood constituents; bodily fluids; solid tissue as from a fresh, frozen and/or preserved organ or tissue sample or biopsy or aspirate; and cells from any time in gestation or development of the subject or plasma. Samples include, but not limited to, primary or cultured cells or cell lines, cell supernatants, cell lysates, platelets, serum, plasma, vitreous fluid, ocular fluid, lymph fluid, synovial fluid, follicular fluid, seminal fluid, amniotic fluid, milk, whole blood, urine, cerebrospinal fluid (CSF), saliva, sputum, tears, perspiration, mucus, tumor lysates, and tissue culture medium, as well as tissue extracts such as homogenized tissue, tumor tissue, and cellular extracts. Samples further include biological samples that have been manipulated in any way after their procurement, such as by treatment with reagents, solubilized, or enriched for certain components, such as proteins or nucleic acids, or embedded in a semi-solid or solid matrix for sectioning purposes, e.g., a thin slice of tissue or cells in a histological sample. Preferably, the sample is obtained from blood or blood components, including, e.g., whole blood, plasma, serum, lymph, and the like.
As used herein, “substantially” means sufficient to work for the intended purpose. The term “substantially” thus allows for minor, insignificant variations from an absolute or perfect state, dimension, measurement, result, or the like such as would be expected by a person of ordinary skill in the field but that do not appreciably affect overall performance. When used with respect to numerical values or parameters or characteristics that can be expressed as numerical values, “substantially” means within 10%, or within 5% or less, e.g., with 2%.
As used herein, the term “detecting” refers to the process of determining a value or set of values associated with a sample by measurement of one or more parameters in a sample, and may further comprise comparing a test sample against a reference sample. In accordance with the present disclosure, the detection step includes identification, assaying, measuring and/or quantifying one or more markers or activities thereof.
As used herein, the term “level” is defined herein as including any information related to, for example, the amount, relative concentration and absolute concentration. The term also includes changes in the amount, relative and absolute concentrations, whether in a percentage or absolute context. These “level” changes may be used over a selected duration of time such as, for example, a time change in amount or concentration. The “level” may refer to a time change in amount or concentration, and compared to a later time change. The amount and rate of change of the metabolites are powerful tools in assessing the physiological state of the individual.
As used herein, the term “activity” relates to a functional property of a molecule (e.g., a metabolite). For the small molecule compounds, the term “activity” may relate to an adhesive property, e.g., binding to its binding partner such as a protein (e.g., enzyme, receptor, or antibody). Binding activity may be studied using Fourier transform spectroscopy (FTS), Raman spectroscopy, fluorescence spectroscopy (FS), circular dichroism (CD), nuclear magnetic resonance (NMR), mass spectrometry (MS), atomic force microscope (AFM), paramagnetic probes, dual polarization interferometry, surface plasmon resonance (SPR), fluorescence intensity, bimolecular fluorescence complementation, fluorescent resonance energy transfer (FRET), bio-layer interferometry, co-immunopreciptation, ELISA, equilibrium dialysis, gel electrophoresis, far western blot, fluorescence polarization anisotropy, electron paramagnetic resonance, or microscale thermophoresis. “Activity” of a molecule may also relate to a “functional activity” e.g., pharmacological activity (e.g., agonist, partial agonist or antagonist activity on a receptor or ligand), catalytic activity (e.g., allosteric regulation of an enzyme), toxicity (e.g., apoptotic or necrotic activity), or chemical activity (e.g., pigmentation). Functional activities may be determined using routine functional assays, e.g., pharmacological assays, toxicity assays, enzyme kinetics, colorimetric or fluorescence assays, etc. The term “activity” is used broadly to include a binary definition, e.g., a definition of a compound, as a whole, being either active or inactive. Additionally, the present systems and methods can provide finer binning, ranges of percentile IC50 or raw IC50 values, including grouping (e.g., quantile or standard deviations) based on statistical weights corresponding to functional profile or other molecular parameter. Probabilities for a compound to be active may also be reflected in the activity profile. For example, the present systems and methods can correlate molecular parameters with experimental data, so that a user can be provided with an estimation about the activity. Some implementations can use a linear regression model.
As used herein, the term “marker” refers to a characteristic that can be objectively measured as an indicator of normal biological processes, pathogenic processes or a pharmacological response to a therapeutic intervention, e.g., treatment with an anti-obesity agent. Representative types of marker characteristics include, for example, molecular changes in the structure (e.g., changes in the chemical composition of a metabolite) or level (e.g., changes in concentration of a metabolite) or activity (e.g., changes in pharmacological activity, enzymatic activity, metabolic activity, or any other biological activity). Marker characteristics may further include, e.g., a plurality of differences, such as changes in the levels of molecular markers and activities thereof.
As used herein, the term “metabolite” refers to the end product that remains after metabolism. In some embodiments, these metabolites leach out into the biological fluid, e.g., blood, sweat, urine, saliva, pleural fluid, tears, over time. Preferably, metabolites are compound derived from the metabolism of a biological macromolecule, e.g., fats, lipids, carbohydrates, polysaccharides, polynucleotides, etc.
As used herein, the term “derivative” includes salts, amides, esters, enol ethers, enol esters, acetals, ketals, acids, bases, solvates, hydrates, or polymorphs of the individual metabolites. Derivatives may include precursors or products (e.g., glutamate is a derivative of glutamine and vice versa). Derivatives may be readily prepared by those of skill in this art using known methods for such derivatization. The derivatives suitable for use in the methods described herein may be detected using methods that are used for detecting parent metabolites. Derivatives include solvent addition forms, e.g., a solvates or alcoholates, which may be synthesized to facilitate detection. Derivatives further include amides or esters of the amino acids and/or isomers (e.g., stereoisomers).
As used herein, the term “salt” includes salts derived from any suitable of organic and inorganic counter ions well known in the art and include, by way of example, hydrochloric acid salt or a hydrobromic acid salt or an alkaline or an acidic salt of the metabolites.
As used herein, the term “solvate” refers to compounds containing either stoichiometric or non-stoichiometric amounts of a solvent such as water, ethanol, and the like. “Hydrates” are formed when the solvent is water; alcoholates are formed when the solvent is alcohol.
As used herein, the term “metabolite profile” or “metabolomics profile” includes an inventory of metabolites (in tangible form or computer readable form) within a sample from a subject, or any derivative fraction thereof, that is necessary and/or sufficient to provide information to a user for its intended use within the methods described herein. The inventory may include the quantity, levels, activities and/or types of small molecules present. The information, which is necessary and/or sufficient, will vary depending on the intended use of the “metabolite profile.” For example, the “metabolite profile,” can be determined using a single technique for an intended use but may require the use of several different techniques for another intended use depending on such factors as genotypic or phenotypic traits of the subject, the disease state involved, the types of small molecules present in a particular sample, etc. In a further embodiment, the small molecule profile comprises information regarding at least 3, at least 5, at least 10, at least 20, at least 25, at least 35, at least 50, at least 75, at least 100, at least 150, at least 200, at least 250, at least 300, at least 350, or more, e.g., at least 400, metabolites. In some instances the term “profile” may be used to refer to said inventory of small molecules.
As used herein, “reference standard” refers to a sample of tissue or cells that may or may not have the disorder (e.g., obesity) or a trait thereof that are used for comparisons. Thus a “reference” standard thereby provides a basis to which another sample, for example plasma sample containing metabolite markers, e.g., metabolites of Table 1, that can be compared. In contrast, a “test sample” refers to a sample compared to a reference standard or control sample.
As used herein, the term “reference metabolic profile” or “reference metabolomic profile” refers to the resulting profile generated using the “reference sample.” The term includes information regarding the small molecules of the profile that is necessary and/or sufficient to provide information to a user for its intended use within the methods described herein. The reference profile would include the quantity and/or type of small molecules present.
As used herein, “test sample” refers to a sample obtained from the individual subject to be analyzed. The term “control,” as used herein, refers to a reference for a test sample, such as control cells obtained from healthy or normal subjects, wherein the subjects are not suffering from or are otherwise predisposed to obesity. In some aspects, controls include samples obtained from the same subject at different points in time, during which, the subject may be going through a clinically-approved therapy or experimental therapy, e.g., with drugs or surgical intervention or both.
The term “modulate” as used herein refers to an increase or decrease. The change (e.g., increase or decrease) may be qualitative or quantitative in nature. For example, the term modulate may refer to a post-therapy reduction in BMI values (e.g., quantitative modulation) or drops in mood swings (e.g., qualitative modulation) or reduction in a composite qualitative-quantitative score such as Patient Health Questionnaire-9 (PHQ9) in obese patients.
The term “enhance” or “increase” refers to an increase in the specified parameter of, e.g., at least about 1.25-fold, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 8-fold, 10-fold, twelve-fold, or fifteen-fold, or greater, e.g., 25-fold.
The term “inhibit” or “reduce” or grammatical variations thereof as used herein refers to a decrease or diminishment in the specified level or activity of at least about 15%, 25%, 35%, 40%, 50%, 60%, 75%, 80%, 90%, 95% or more, e.g., 99%. In particular embodiments, the inhibition or reduction results in little or essentially no detectible levels or activity of the parameter being measured. Herein, non-detectable level/activity typically represents an insignificant level/activity, e.g., <about 10% or even <about 5% of the initial level/activity.
As used herein, the term “treating” refers to curative action, palliative action (e.g., control or mitigate a disease or disease symptoms) or prophylactic action (e.g., reduce the frequency of, or delay the onset of a pathologic condition or symptoms of the condition in a subject receiving the therapy relative to a subject not receiving therapy. This can include reversing, reducing, or arresting the symptoms, clinical signs, and underlying pathology of a condition in a manner to improve or stabilize a subject's condition (e.g., regress rapid weight gain in obese subjects).
As used herein, the term “lifestyle therapy” includes, dietary management (e.g., reduce intake of high calorie diet), exercise management (e.g., increase frequency and/or rigor of exercise), stress management (e.g., reduce emotional or mental stress) and/or behavior management (e.g., quit smoking).
As used herein, the term “administering” is used in the broadest sense as giving or providing to a subject in need of the treatment, a composition such as a pharmaceutical agent (e.g., drug) or a pharmaceutical composition containing the pharmaceutical agent. For instance, in the pharmaceutical sense, “administering” means applying as a remedy, such as by the placement of a drug in a manner in which such drug would be received, e.g., intravenous, oral, topical, buccal (e.g., sub-lingual), vaginal, parenteral (e.g., subcutaneous; intramuscular including skeletal muscle, cardiac muscle, diaphragm muscle and smooth muscle; intradermal; intravenous; or intraperitoneal), topical (e.g., skin or mucosal surfaces), intranasal, transdermal, intraarticular, intrathecal, inhalation, intraportal delivery, organ injection (e.g., eye or blood, etc.), or ex vivo (e.g., via immunoapheresis).
As used herein, “contacting” means that the composition comprising the pharmaceutical agent or a pharmaceutical composition comprising the agent is introduced into a sample containing a target, e.g., cell target, in a test tube, flask, tissue culture, chip, array, plate, microplate, capillary, or the like, and incubated at a temperature and time sufficient to permit binding of the agent to the target (e.g., cells) or vice versa (e.g., blood cells coming into contact with the agent). In the in vivo context, “contacting” means that the therapeutic or diagnostic molecule is introduced into a patient or a subject for the treatment of a disease, and the molecule is allowed to come in contact with the patient's target tissue, e.g., blood tissue, in vivo or ex vivo.
As used herein, the term “therapeutically effective amount” refers to an amount that provides some improvement or benefit to the subject. Alternatively stated, a “therapeutically effective” amount is an amount that will provide some alleviation, mitigation, or decrease in at least one clinical symptom in the subject. Methods for determining therapeutically effective amount of the therapeutic molecules, e.g., anti-obesity drugs, are described below.
As used herein, the term “subject” means an individual. In one aspect, a subject is a mammal such as a human. In one aspect a subject can be a non-human primate. Non-human primates include marmosets, monkeys, chimpanzees, gorillas, orangutans, and gibbons, to name a few. The term “subject” also includes domesticated animals, such as cats, dogs, etc., livestock (e.g., cows, pigs, goats), laboratory animals (e.g., mouse, rabbit, rat, gerbil, guinea pig, etc.) and avian species (e.g., chickens, turkeys, ducks, etc.). Subjects can also include, but are not limited to fish (for example, zebrafish, goldfish, tilapia, salmon, and trout), amphibians and reptiles. Preferably, the subject is a human subject. Especially, the subject is a human patient.
As used herein, the term “obesity” generally refers to a condition, temporary or chronic, which is defined by an excess amount body fat. The normal amount of body fat (expressed as percentage of body weight) is between about 25-30% in women and about 18-23% in men. Women with over 30% body fat and men with over 25% body fat are characterized as being obese.
As used herein, the term “healthy obesity” denotes a condition which would normally be classified as overweight or obese under a clinically acceptable metric, e.g., a body mass index (BMI) score of at least about 25 (overweight) or 30 (obesity), but which is extricated from the health complications that are normally linked with obesity.
In contrast, the term “metabolic obesity” denotes a condition which can be classified as non-obese under a clinically acceptable metric, e.g., a body mass index (BMI) score of less than about 25 (overweight) or 30 (obese), but which is nonetheless implicated with the health complications that are normally linked with obesity. “Unhealthy obesity” includes, but is not limited to, metabolic syndrome (a cluster of metabolic disorders that is characterized by obesity, high blood lipid levels, high blood pressure, and/or insulin resistance/high blood sugar) and cardiovascular disease consequences. The level of unhealthiness may be qualitative or quantitative, preferably quantitative. Cutoffs between healthy and unhealthy may be made based on statistical measurements, e.g., using a parametric or a non-parametric mBMI distribution and confidence estimates. Alternately, a regression residual for the difference between two parameters (e.g., BMI and mBMI, optionally adjusted for age and sex) may be used. Individuals in the top 5%, top 10%, top 20%, top 25%, or top 40%, preferably top 10% of the residual distribution may be classified as being obese.
“Body Mass Index, (or BMI)” refers to a calculation that uses the height and weight of an individual to estimate the amount of the individual's body fat. Too much body fat (e.g. obesity) can lead to illnesses and other health problems. BMI is the measurement of choice for many physicians and researchers studying obesity. BMI is calculated using a mathematical formula that takes into account both height and weight of the individual. BMI equals a person's weight in kilograms divided by height in meters squared. (BMI=kg/m2). Subjects having a BMI less than 18.5 are considered to be underweight, while those with a BMI of between 18.5 and 25 are considered to be of normal weight, while a BMI of between 25 to 30 are generally considered overweight, while individuals with a BMI of 30 or more are typically considered obese. Morbid obesity refers to a subject having a BMI of 40 or greater.
As used herein, an “obesity-related disease or condition” includes, but is not limited to, coronary artery disease, hypertension, stroke, peripheral vascular disease, insulin resistance, glucose intolerance, diabetes mellitus, hyperglycemia, hyperlipidemia, hypercholesteremia, hypertriglyceridemia, hyperinsulinemia, atherosclerosis, cellular proliferation and endothelial dysfunction, diabetic dyslipidemia, lipodystrophy and metabolic syndrome, type II diabetes, diabetic complications including diabetic neuropathy, nephropathy, retinopathy or cataracts, heart failure, inflammation, thrombosis, congestive heart failure, asthmatic or pulmonary disease related to obesity, and cardiovascular disease related to obesity.
As used herein, the term “screen” refers to a specific biological or biochemical assay which is directed to measurement of a specific condition or phenotype that a molecule induces in a target, e.g., target cell-free system, target cells, tissues, organs, organ systems, or organisms.
As used herein, the term “selecting” in the context of screening compounds or libraries includes both (a) choosing compounds from a group previously unknown to be modulators of a condition or phenotype (e.g., obesity); and (b) testing compounds that are known to be inhibitors or activators of the condition or phenotype (e.g., obesity). Both types of compounds are generally referred to herein as “test compounds.” The test compounds may include, by way of example, polypeptides (e.g., small peptides, artificial or natural proteins, antibodies), polynucleotides (e.g., DNA or RNA), carbohydrates (small sugars, oligosaccharides, and complex sugars), lipids (e.g., fatty acids, glycerolipids, sphingolipids, etc.), mimetics and analogs thereof, and small organic molecules having a molecular weight of less than about 10 KDa, preferably less than about 5 KDa, especially less than about 1 KDa (e.g., about 300 daltons to about 800 daltons). Preferably, the test compounds are provided in library formats known in the art, e.g., in chemically synthesized libraries, recombinantly-expressed libraries (e.g., phage display libraries), and in vitro translation-based libraries (e.g., ribosome display libraries).
In some embodiments, the disclosure relates to a method of diagnosis of obesity in subjects.
In step 110 of method 100 of
In step 120 of method 100 of
The metabolomic data, which are optionally analyzed with a toolkit, may be processed to generate standardized data, which ensures non-redundancy and/or integrity of data. The processing step may comprise normalization and/or standardization. Thus, the process of encoding categorical data and normalizing numeric data (sometimes called data standardization) can be carried out in accordance with the methods of the present disclosure. For example, values from multiple experimental batches may be normalized into Z-scores based on a reference cohort of n self-reported healthy individuals run with each batch, which normalized batches are converted to the same scale using linear transformation based on the values obtained from the runs that include the controls. Samples with metabolite measurements that are below the detection threshold are imputed as the minimum value for that metabolite and any batch that does not meet this threshold requirement may be purged or rerun. This process may be carried out for each metabolite of interest.
In embodiments wherein the metabolomic data is received in situ, any biological sample may be used to obtain the metabolomics profile. Preferably, the sample is a biological fluid sample, containing, w/w, at least about 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or 99% of an aqueous agent compared to particulate matter in solution, dispersion, colloid, or sol-form. Representative examples include, e.g., blood (including whole blood), blood plasma, blood serum, hemolysate, lymph, synovial fluid, spinal fluid, urine, cerebrospinal fluid, stool, sputum, mucus, amniotic fluid, lacrimal fluid, cyst fluid, sweat gland secretion, bile, milk, tears, saliva, or earwax. Blood-based samples, e.g., plasma, serum, hemolysate, lymph, are preferred.
In step 130 of method 100 of
As can be seen from Equation 1, the metabolite value (e.g., levels or concentration) exerts an effect on mBMI. Accordingly, in some embodiments, the metabolites of Tables 1-7, Table 11, Table 12A, Table 12B; preferably the metabolites of Tables 2-7; and especially the metabolites of Table 2 or Tables 4-7, exert an effect on mBMI. Particularly, in the case of the metabolites of Tables 1, 2, and 4-7, the relative effect of each metabolite on mBMI is associated with the order in which they are listed (i.e., metabolites that are listed at the top exert an effect on mBMI that is greater than metabolites that are listed in the bottom). Moreover, in the case of the metabolites of Table 3, the relative effect of each metabolite on mBMI is associated with its rank (in parenthesis).
In some embodiments, the ML is trained with an in silico metabolomic dataset. For example, the in silico dataset may include tissue samples (e.g., from subjects, both male and female, who are between 12 and 95 years of age. The association between specific metabolites and obesity is identified using a robust mathematical regression. The markers that are highly specific and also tightly associated with specific conditions, e.g., cardiovascular diseases (e.g., heart disease such as heart attack, angina, heart failure, arrhythmia), cerebrovascular diseases (e.g., stroke), vascular disease (e.g., high blood pressure), and/or diabetes, may be further identified using the robust mathematical regression, are then studied for the features, including, association with any obesity-related genes or signatures. A representative method is described in the Examples.
The architecture of the machine learning approach will be discussed in detail below.
Not being bound to a single embodiment and purely for the purpose of illustration, a machine-learning algorithm was integrated into the existing methodology at an individual, or combination of individual steps, in accordance with various embodiments herein. ML can be incorporated to optimize the results coming out of the algorithm (e.g., neural network, ML algorithm, etc.), by utilization of inputted training data sets, cross reference of output to known answers, backpropagation, and adjustment of weighting factors and parameters associated with the given ML algorithm in a repeating loop to arrive at a threshold quality of data output. For instance, in the process described here, machine earning (ML) is used to identify the best weights to assign to metabolites associated with BMI when building the mBMI model. The specific algorithm used is the glmnet package in R, specifically the cvglmnet function, which performs 10-fold cross validation. For training, we took a random half of our sample. The cvglmnet function performed 10-fold cross validation in this half of the dataset to assign the weights to each metabolite. We then tested the resulting model in the other half of the dataset by applying those weights. Other methods that could be used to achieve similar results would include random forest regression and linear regression. In subsequent steps, the prediction power of the model on the test dataset may be validated, e.g., using a probability model such as logistic regression. Optionally, a resampling may be performed to obtain an unbiased appraisal of the model's likely future performance. Features of ROC curve, such as, area-under-the curve (also called c-index) or concordance probability from a statistical test such as the Wilcoxon-Mann-Whitney test, may provide a good summary measure of pure predictive discrimination.
Generally in method 100 of
In step 140 of method 100 of
It should be noted that use of actual BMI for comparative assessment is optional because the subject's mBMI value may be directly compared with a reference standard (e.g., control). In some embodiments, control mBMI values may be determined using an identical sample obtained from a non-obese individual, which values are computed using steps 110, 120 and 130 of the aforementioned method 100). In some embodiments, the control mBMI values may be based on statistically determined value (e.g., mean or median) in a population of non-obese subjects. Control mBMI may be adjusted for age, gender, race, and any other variable that may influence the physiology of the subject.
In step 150 of method 100 of
In step 160 of method 100 of
In step 170 of method 100 of
In some embodiments, the reference standard comprises a BMI score for the subject and the subject is deemed at risk of obesity or a disease associated therewith if the mBMI>>a BMI of about 18.5 to about 24.9 kg/m2 (normal BMI); particularly if mBMI>a BMI of about 25 to about 30 kg/m2 (overweight BMI); and especially if the mBMI>a BMI of about 30 kg/m2 (obese BMI).
Preferably, determinations of normal weight, overweight, obesity or morbid obesity are made via statistical analysis. In a representative embodiment, a residual score may be used. For instance, if the residual of mBMI regressed on BMI, age and sex is greater than about 0.4, 0.5, 0.6, 0.7, or more, e.g., 0.8 (preferably, >0.5) then they are put into the high risk category.
In some embodiments, the methods of the disclosure may be further carried out by detecting one or more signatures. Such signatures may comprise, for example, a plurality of metabolites (e.g., about 2, 3, 4, 5, 6, 8, 10, 12, 13, 15, 20, 25, 30, 35, 40, 45, 49, 50, 60, 75, 100, 125, 150, 200, 250, 300, 307 or more metabolites). Representative signatures including a significant number, e.g., at least 50%, at least 65%, at least 80%, at least 90%, or more, e.g., 100% of the metabolites of Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7), or derivatives thereof.
In some embodiments, the methods of the disclosure are carried out by detecting one or more signatures comprising the broad classes of metabolites recited in Table 3, or a derivative thereof.
†Blood labs for TG (triglycerides), Chol (cholesterol), HDL (high-density lipoprotein) or LDL (low-density lipoprotein) that had an r2 > 0.1 with the metabolite are indicated in parentheses.
Metabolites may be included/excluded in a signature based on a variety of criteria, including, inclusion or exclusion of metabolites (or derivatives) from the same class, e.g., amino acids, carbohydrates, lipids, co-factors, nucleotides, peptides, xenobiotics, etc.; inclusion or exclusion of metabolites (or derivatives) based on whether they belong to the same or different sub-pathway, e.g., amino acid metabolism, sugar metabolism, purine metabolism, phospholipid metabolism, steroid metabolism, fatty acid or TCA metabolism, etc.; inclusion or exclusion of metabolites (or derivatives) based on directionality of correlation with BMI, e.g., signatures comprising metabolites that are only positively or negatively correlated with BMI.
Owing partly due to enhanced prognostic significance of signatures compared to unitary markers, it may be preferable to group markers into distinct subgroups based on one or more statistical parameters. For instance, metabolites that are uncorrelated with each other (individually) may be grouped together so that changes in the levels/activities of individual markers are guided by factors other than other components of the composite. On this basis, a linear regression model was used to generate a three metabolite base signature comprising (a) 1-(1-enyl-stearoyl)-2-docosahexaenoyl-GPC (P-18:0/22:6); (b) sphingomyelin (d18:1/18:1, d18:2/18:0); and (c) urate; or derivatives thereof. Using a slightly more expansive linear regression model, a similar methodology was used to generate a six marker signature comprising: (a) N-acetylglycine, (b) 1-(1-enyl-stearoyl)-2-docosahexaenoyl-GPC (P-18:0/22:6), (c) sphingomyelin (d18:1/18:1, d18:2/18:0), (d) cortisone, (e) mannose, and (f) urate; or a derivative thereof.
In some embodiments, prognostic metabolomic signatures may be identified using coefficients from an mBMI model. Such signatures may comprise, in reverse order of strength, metabolite markers selected from: urate, 1-stearoyl-2-dihomo-linolenoyl-GPC (18:0/20:3n3 or 6)*, alanine, N-acetyltyrosine glutamate, 1-palmitoleoyl-3-oleoyl-glycerol (16:1/18:1)*, 1-(1-enyl-palmitoyl)-2-oleoyl-GPC (P-16:0/18:1)*, 1-(1-enyl-stearoyl)-2-oleoyl-GPC (P-18:0/18:1), 1-(1-enyl-stearoyl)-2-docosahexaenoyl-GPC (P-18:0/22:6)*, 1-arachidoyl-GPC (20:0), N-acetylglycine, sphingomyelin (d18:1/18:1, d18:2/18:0), mannose, and cortisone; or a derivative thereof.
In some embodiments, prognostic metabolomic signatures may be identified using lasso regression. Signatures identified by such methods may comprise, in reverse order of strength, metabolite markers selected from: urate, 1-stearoyl-2-dihomo-linolenoyl-GPC (18:0/20:3n3 or 6)*, alanine, N-acetyltyrosine glutamate, 1-palmitoleoyl-3-oleoyl-glycerol (16:1/18:1)*, 1-(1-enyl-palmitoyl)-2-oleoyl-GPC (P-16:0/18:1)*, 1-(1-enyl-stearoyl)-2-oleoyl-GPC (P-18:0/18:1), 1-(1-enyl-stearoyl)-2-docosahexaenoyl-GPC (P-18:0/22:6)*, 1-arachidoyl-GPC (20:0), N-acetylglycine, sphingomyelin (d18:1/18:1, d18:2/18:0), mannose, and cortisone; or a derivative thereof.
Epidemiological Analysis
Events of interest, e.g., disease onset, disease progression, morbidity and mortality (termed disease variable), which occur due to an explanatory variable, such as high mBMI, are generally measured by analyzing the effect of the explanatory variable on the disease variable. Generally speaking, this is done by comparing rates of disease in an explanatory group versus a control group. There are a number of ways of comparing the explanatory and control groups, using different measures of association. A measure of association is any mathematical or statistical measure that used to quantify the association between two or more variables. In the context of epidemiology, a measure of association is any such mathematical or statistical relationship used to measure disease frequency relative to other factors, and is an indication of how more or less likely one is to develop disease as compared to another. Measures of association focus on risk factors, which are found to be associated with a health condition, and may be thought of as an attribute or exposure that increases the probability of occurrence of disease (e.g., behavior, genetic, environmental or social factors, time, person or place).
Epidemiological measures of association can broadly be divided into absolute and relative comparisons. Thus, a study of the rate of a disease phenotype (e.g., heart attack) may yield a rate of 2 per 100 in obese subjects and 1 per 100 in non-obese (normal weight) subjects. An absolute comparison such as (2 per 100)−(1 per 100)=(1 per 100), meaning there is one additional case per 100 obese subjects. A relative comparison such as (2 per 100)/(1 per 100)=2, means that obese subjects are at twice the risk of control subjects (subjects with normal weight or normal BMI). Both metrics, including, statistical classifications thereof, e.g., in percentile or quantile, may be used.
A variety of different measures of association is routinely used in epidemiology. The most common are relative risk (RR; risk ratio) and odds ratio (OR). Risk ratio is often used in cohort studies and may be defined as the relative risk associated with a risk factor, e.g., RR=R1/R0, where R1 is the rate in an exposed group versus RO, the rate in a non-exposed group. RR is thus a risk multiplier on top of a baseline risk RO, where the segment of the RR above 1 represents elevation in risk. Thus, a RR of 1.0 or greater indicates an increased risk, a RR of less than 1.0 indicates decreased risk, and a RR of 2 represents a 100% increase in risk. OR is an epidemiological measure of association expressing disease frequency in terms of odds, and is defined as the odds of disease in the exposed population divided by the odds of disease in the unexposed population. OR is more often used in case-controlled studies, and may involve a comparison of disease cases with the prevalence among non-cases for controls. Both RR and OR characterize the association between the exposure and the disease in relative terms, and both reflect the frequency of disease occurrence among exposed subjects as a multiple of the rate among unexposed subjects.
Absolute or difference measures of association are also used in epidemiology, and are generally referred to as attributable risk and population attributable risk percent. Attributable risk is defined as the incidents of disease in an exposed population minus the incidents of disease in the unexposed population, and generally is thought of as the number of cases among the exposed that could be eliminated if the exposure were removed. Population attributable risk percent is defined as the incidents of the disease in the total population minus the incidents in the unexposed population, divided by the incidents of disease in the total population. It measures the excess risk of disease in the total population attributable to exposure and the reduction in risk, which would be achieved if the population were entirely unexposed. Epidemiological measures of association are further defined and explained in Epidemiology: Beyond the Basics, by Xavier Nieto et al., Jones & Bartlett Learning; 4th Ed. (2018); and Nguyen et al., Gastroenterol Clin North Am. 39(1): 1-7 (2010).
In the present disclosure, patients whose mBMI values are significantly elevated compared to BMI values are, at least, 20%, 30%, 40%, 50%, 60%, 80%, 100%, 125%, 150%, 175%, 180%, 200%, 250%, 300%, or a greater %, e.g., 400%, more likely to suffer from an adverse events compared to controls. For instance, the number of events per 100 patients with a healthy metabolome (normal BMI) was increased by more than 80% in outliner patients with obese metabolic profile (normal BMI) (i.e., about 2.0 events in 100 patients in healthy subjects versus about 3.7 events in 100 patients in outlier subjects). In obese individuals, the number of events was elevated even more (about 4.2 events in 100 patients or increase of about 110% compared to healthy subjects). Separated analysis of the various endpoints reveals that the association was much more pronounced and accentuated for subjects with cardiovascular diseases than patients who had or who were at risk for developing stroke.
Additional Steps for Improving Robustness of Analysis
In some embodiments, the disclosure includes improving prognostic significance of the methods of the disclosure by analyzing a variety of environmental and/or genetic factors that may play in the predisposition, initiation, development, and pathophysiology of the obesity phenotype or diseases related thereto. Representative examples of such factors include, e.g., android/gynoid ratio, total triglycerides, waist/hip ratio, subcutaneous fat, visceral fat, insulin resistance, high density lipoprotein (HDL) levels, percent fat, diastolic blood pressure, systolic blood pressure, total cholesterol, low density lipoprotein (LDL), insulin resistance, dual-energy X-ray absorptiometry (DEXA) scores, and other anthropomorphic traits (larger-framed individuals).
In some embodiments, the disclosure includes improving the prognostic significance of the methods of the disclosure by analyzing the sample for the presence or absence of one or more genetic factors. In one specific embodiment, the prognostic methods include analysis of whether the subject is a carrier of a rare (MAF<0.01%) coding variants in the melanocortin 4 receptor gene (MC4R), including, variants thereof, e.g., M292fs, R236C, S180P, A175T, and T11A, but not I170V. Table 9 provides a general overview on the association of these MC4R variants with obesity in participants of European ancestry.
In some embodiments, the methods of the disclosure include supplemental genetic analysis data comprising annotations of risk genes and linkages thereof to obesity phenotypes in the human genome mutation database (HGMD; accessible via the world-wide-web at hgmd(dot)cf(dot)ac(dot)uk) or clinically relevant variant archive (CLINVAR; accessible via the world-wide-web at www(dot)ncbi(dot)nlm(dot)nih(dot)gov/clinvar). As outlined in detail in the Examples section, MC4R carriers had significantly higher BMI (p=0.02) and a positive statistical correlation than non-carriers. MC4R carriers also generally had higher diastolic blood pressure, insulin resistance, and percent body fat. The BMI data in the participants supported a pathogenic role for five of the variants (Met292fs, Arg236Cys, Ser180Pro, Ala175T, and Thr11Ala), but did not Ile170V (identified in HGMD and ClinVar as being pathogenically associated with obesity). Overall, MC4R variant carriers are observed with greatest frequency (about 6.1%) in obese patients with polygenic risk scores in the lowest quartile compared to subjects with normal weight (only about 0.3%).
In some embodiments, the methods of the disclosure include supplemental genetic analysis data comprising annotations of risk genes and linkages thereof, e.g., lipodystrophy genes, to obesity phenotypes. Particularly, the disclosure includes analysis of one or more of the Table 10 genes or linkages thereof:
In particular, the disclosure relates to analysis of at least 1, 2, 3, 4 or more, e.g., all genetic variants of the zinc metallopeptidase STE24 (ZMPSTE24) gene or the 1-acylglycerol-3-phosphate O-acyltransferase 2 (AGPAT2) gene or lipase E, hormone sensitive type (LIPE) gene or Bernardinelli-Seip congenital lipodystrophy type 2 (BSCL2) gene, or any combination thereof. The resulting variation may result in a change (e.g., mutation) in an amino acid sequence encoded by the gene. In some embodiments, the genetic variants include, one or more variations in the ZMPSTE24 gene comprising variation at chr1:40290870 G/GT (e.g., resulting in p.Leu362fs). In some embodiments, the genetic variants include, one or more variations in the AGPAT2 gene comprising variation chr9:136673876 G/C (e.g., resulting in p.Ala238Gly). In some embodiments, the genetic variants include one or more variations in the LIPE gene comprising variation chr19:42401821 CCCCCCGCAGCCCCCGTCTA/C (e.g., resulting in p.Val1068fs). In some embodiments, the genetic variants include one or more variations in the BSCL2 gene comprising variation chr11:62692371 C/T (e.g., resulting in c.863+5G>A). Preferably, the genetic variants include at least 2, 3, 4 or all of the aforementioned variations. Information on the genetic variants can be obtained from known databases, e.g., Varsome (varsome(dot)com) or Clinvar database (ncbi(dot)nlm(dot)nih(dot)gov/clinvar/).
Methods of Diagnosis
Methods for diagnosing, or aiding in diagnosing, whether a subject has obesity or a disease or condition related thereto, such as diabetes, metabolic syndrome, atherosclerosis, or cardiomyopathy, may performed using one or more of the biomarkers identified in the respective tables provided herein. A method of diagnosing (or aiding in diagnosing) includes (a) analyzing a biological sample from a subject to determine the levels or activities of one or more biomarkers in the sample and (b) comparing the levels or activities of one or more biomarkers in the sample to disease-positive or condition-positive reference levels (e.g., positive control) and/or disease-negative or condition-negative reference levels (e.g., negative control) of the one or more biomarkers to diagnose (or aid in the diagnosis of) whether the subject has the disease or condition. For example, a method of diagnosing whether a subject is obese may include the steps of (a) analyzing a biological sample (e.g., serum or blood) from a subject to determine the levels or activities of one or more metabolites of Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7) in the sample to compute an mBMI score; (b) optionally comparing the mBMI score to the actual BMI score; and (c) diagnosing obesity or a disease related thereto by comparing the mBMI score to a reference standard.
The diagnostic methods of the disclosure may be used along with other methods that are useful in the clinical determination of whether a subject has obesity or a disease related thereto. Methods useful in the clinical determination of whether a subject has a disease or condition related to obesity, such as, diabetes, metabolic syndrome, atherosclerosis, or cardiomyopathy are known in the art. For example, methods useful in the clinical determination of whether a subject has diabetes include, e.g., glucose disposal rates (Rd), body weight measurements, waist circumference measurements, BMI determinations, peptide YY measurements, hemoglobin A1C measurements, adiponectin measurements, fasting plasma glucose measurements, free fatty acid measurements, fasting plasma insulin measurements, and the like. Methods useful for the clinical determination of atherosclerosis and/or cardiomyopathy in a subject include angiography, stress-testing, blood tests (e.g., to measure homocysteine, fibrinogen, lipoprotein A, small LDL particles, and C-reactive protein levels), electrocardiography, echocardiography, computed tomography (CT) scans, ankle/brachial index, and intravascular ultrasounds.
In the context of diagnosing or treating obesity-related disease such as diabetes, the methods of the disclosure may be combined with methods for diagnosing diabetes, e.g., measurement of glucose disposal rate (Rd) as measured by the HI clamp. Similarly, insulin sensitivity of the individual can be determined using appropriate in vitro or in vivo assays. In some embodiments, such methods include use of oral glucose tolerance tests (OGTT) for use in categorizing subjects as having normal glucose tolerance (NGT), impaired fasting glucose levels (IFG), or impaired glucose tolerance (IGT). Methods for determining level of insulin resistance using a calibrated insulin resistance score (IR score) are known in the art. See, Shalaurova et al., Metab Syndr Relat Disord., 12(8): 422-429, 2014. The IR Score can be used to monitor disease progression or remission, response to therapeutic intervention and also for evaluating drug efficacy.
After the levels or activities of the one or more metabolites (or derivatives) is determined, the level(s) may be compared to disease or condition reference levels or activities of the one or more metabolites (or derivatives) to determine a rating for each of the one or more metabolites (or derivatives) in the sample. Preferably, the ratings are aggregated using any algorithm to create a score, for example, an mBMI score, for the subject. The algorithm may take into account any factors relating to a particular disease or condition related to obesity, such as cardiomyopathy or diabetes, including the number of biomarkers, the correlation of the biomarkers to the particular disease or condition, etc.
The identification of biomarkers herein allows for monitoring progression/regression of obesity or a disease related thereto (e.g. diabetes, metabolic syndrome, atherosclerosis, cardiomyopathy, insulin resistance, etc.) in a subject. A method of monitoring the progression/regression of obesity or a disease related thereto, such as diabetes, metabolic syndrome, atherosclerosis, and cardiomyopathy, in a subject comprises (a) analyzing a first biological sample from a subject to determine the levels or activities of one or more metabolites selected from Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7) or a derivative thereof, or a combination thereof in the first sample obtained from the subject at a first time point, (b) analyzing a second biological sample from a subject to determine the levels or activities of the one or more metabolites of (a) or a derivative thereof, the second sample obtained from the subject at a second time point, and (c) comparing the levels or activities of one or more metabolites in the first sample to the levels or activities of one or more metabolites in the second sample in order to monitor the progression/regression of the disease or condition in the subject. The results of the method are indicative of the course of the disease or condition (i.e., progression or regression, if any change) in the subject.
In some particular embodiments, progression or regression of obesity or a disease related thereto may be based on metabolomics BMI (mBMI) score which is indicative of the obesity (particularly unhealthy obesity) in the subject and which can be monitored over time. By comparing the mBMI score from a first time point sample to the mBMI score from at least a second time point sample the progression or regression of obesity or a disease related thereto can be determined. Such a method of monitoring the progression/regression of obesity or a disease related thereto in a subject comprises (a) analyzing a first biological sample from a subject for metabolites of Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7) to determine an mBMI score for the first sample obtained from the subject at a first time point, (b) analyzing a second biological sample from a subject for the metabolites of Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7) to determine a second mBMI score, the second sample obtained from the subject at a second time point, and (c) comparing the mBMI score in the first sample to the mBMI score in the second sample in order to monitor the progression/regression of obesity or a disease related thereto in the subject.
The markers and algorithms of the instant disclosure, which are useful for progression monitoring, may be further used to guide or assist physicians to make decisions about preventative or therapeutic measures such as dietary restrictions, exercise, or early-stage drug treatment.
The biomarkers identified herein may also be used in the determination of whether a subject who is not exhibiting any symptoms of a disease or condition, such as obesity or a disease related thereto, may nonetheless be at risk. Such methods are particularly useful, e.g., in determining whether a subject is predisposed to developing obesity or a disease related thereto, e.g., diabetes, metabolic syndrome, atherosclerosis, or cardiomyopathy. Such methods include (a) analyzing a first biological sample from a subject to determine the levels or activities of one or more metabolites selected from Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7) or a derivative thereof in the first sample obtained from the subject at a first time point, (b) analyzing a second biological sample from a subject to determine the levels or activities of the one or more metabolites of (a) or a derivative thereof, the second sample obtained from the subject at a second time point, and (c) comparing the levels or activities of one or more metabolites in the first sample to the levels or activities of one or more metabolites in the second sample in order to determine the subject's predisposition to or risk of developing obesity or a disease related thereto. The results of the method may be used along with other methods (e.g., biochemical assays, physiological measurements, and/or lifestyle evaluations) to clinically determine whether a subject is predisposed to or at risk of developing obesity or a disease related thereto.
After the levels or activities of the one or more metabolites or derivatives thereof in the sample are determined, the levels or activities may be compared to disease-positive or condition-positive and/or disease-negative or condition-negative reference levels in order to predict whether the subject is predisposed to or at risk of developing obesity or a disease related thereto, such as, diabetes, metabolic syndrome, atherosclerosis, or cardiomyopathy. Levels of the one or more metabolites (or derivatives thereof) in a sample corresponding to the disease-positive or condition-positive reference levels (e.g., levels that are the same as the reference levels, substantially the same as the reference levels, above and/or below the minimum and/or maximum of the reference levels, and/or within the range of the reference levels) are indicative of the subject being predisposed to or at risk of developing obesity or a disease related thereto. Levels of the one or more metabolites (or derivatives thereof) in a sample corresponding to disease-negative or condition-negative reference levels (e.g., levels that are the same as the reference levels, substantially the same as the reference levels, above and/or below the minimum and/or maximum of the reference levels, and/or within the range of the reference levels) are indicative of the subject not being predisposed to or at risk of developing obesity or a disease related thereto. In addition, levels of the one or more metabolites (or derivatives thereof) that are differentially present (especially at a level that is statistically significant) in the sample as compared to disease- or condition-negative reference levels may be indicative of the subject being predisposed to developing obesity or a disease related thereto. Levels of the one or more metabolites (or derivatives thereof) that are differentially present (especially at a level that is statistically significant) in the sample as compared to disease- or condition-positive reference levels may be indicative of the subject not being predisposed to developing the disease or condition.
Preferably, in carrying out the prognostic methods of the disclosure, the levels or activities of the one or more metabolites (or derivatives thereof) of Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7) may be outputted as a metabolomics BMI (mBMI) score which is indicative of the obesity (particularly unhealthy obesity) in the subject and which can be used to prognosticate obesity or a disease related thereto. By comparing the mBMI score of a subject's sample to the mBMI score of a reference standard (e.g., obtained by analyzing the levels or activities of the same metabolites in one or more healthy subjects), a determination can be made regarding whether the subject is predisposed to or at risk of developing obesity or a disease related thereto. Such a method of determining predisposition to or risk of developing obesity or a disease related thereto can be made by (a) analyzing a first biological sample from a subject to determine levels or activities of the one or more metabolites (or derivatives thereof) of Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7) and computing a first mBMI score for the first sample obtained from the subject, (b) analyzing an identical biological sample from a reference (e.g., healthy subjects) to determine levels or activities of the one or more metabolites of step (a) and computing a second mBMI score, and (c) comparing the mBMI score in the first sample to the mBMI score in the second sample in order to determine whether the subject is predisposed to or at risk of developing obesity or a disease related thereto. Herein, if the mBMI score of the test sample exceeds the mBMI score in the second sample, then the subject is evaluated as being predisposed to or at risk of developing obesity or a disease related thereto.
Purely by way of example, after the levels or activities of the one or more metabolites (or derivatives thereof) in the sample are determined, the levels or activities are used to compute an mBMI score for the subject, and the subject's mBMI score compared to mBMI scores of obesity-positive and/or obesity-negative reference samples in order to predict whether the subject is predisposed to or at risk of developing obesity or a disease related thereto. If the subject's mBMI scores correspond to the mBMI scores of obesity-positive reference standards (e.g., scores that are the same as the reference levels, substantially the same as the reference levels, above and/or below the minimum and/or maximum of the reference levels, and/or within the range of the reference levels), then the result indicates that the subject is predisposed to or at risk of developing obesity or a disease related thereto. If the subject's mBMI scores correspond to the mBMI scores of obesity-negative reference standards (e.g., scores that are the same as the reference levels, substantially the same as the reference levels, above and/or below the minimum and/or maximum of the reference levels, and/or within the range of the reference levels), then the result indicates that the subject is not predisposed to or not at risk of developing obesity or a disease related thereto. If the subject's mBMI score is elevated compared to the mBMI score of a negative sample (especially at a level that is statistically significant), then the results are indicative of the subject being predisposed to developing obesity or a disease related thereto. If the subject's mBMI score is attenuated compared to the mBMI score of a positive sample (especially at a level that is statistically significant), then the results are indicative of the subject not being predisposed to developing obesity or a disease related thereto. Although obesity is discussed in this example, predisposition to or risk of developing related diseases, e.g., diabetes, metabolic syndrome, atherosclerosis, or cardiomyopathy, may also be determined in accordance with the instant methods.
Predisposition to or risk of developing obesity or a disease related thereto may be computed using methods outlined above. For instance, for parametric continuous variables such as mBMI, means along with standard deviations (SD) may be used. For categorical data such as % body fat, % visceral fat, % subcutaneous fat or insulin resistance, counts or percentages may be used. Non-parametric Spearman's rank correlation may be used to assess the associations between anthropometric measurements (e.g., waist-to-height ratio (WHtR), waist to hip ratio (WHR), waist circumference, and BMI) of obesity with risk factors (e.g., mortality, morbidity, survival, etc.). Anthropometric measurements may also be converted to z-scores (original value subtracted by the mean and result divided by the SD) to represent the number of SDs above and below the mean for each subject. Logistic regression may be used to assess the effects of each standardized anthropometric measurement of being above the recommended treatment thresholds for various risk score models (computed for each SD increment above the mean for each anthropometric measure of obesity). Odds ratio (OR) and associated 95% confidence intervals (CI) may be further used to compute the chance of being above the recommended thresholds for the specific risk score model (e.g., Framingham model). Sensitivity, specificity and area under the receiver operating characteristic (ROC) curve may be computed for each metric using software packages such as SPSS.
The biomarkers provided also allow for the assessment of the efficacy of a composition for treating obesity or a disease related thereto, e.g., diabetes, metabolic syndrome, atherosclerosis, or cardiomyopathy. For example, depending on the modulation of the levels or activities of the metabolites of the disclosure, which is brought about by a pharmaceutical composition (or drug) for treating obesity, determinations can be made regarding whether the composition (or drug) is effective in treating obesity or a disease related thereto. Similar methodology can be used in determining the relative efficacy of two or more compositions (or drugs) for treating obesity. Such assessments may be used, for example, in efficacy studies as well as in lead selection of compositions for treating obesity.
Representative examples of anti-obesity drugs include, but are not limited to, e.g., orlistat, locaserin, sibutramine, rimonabant, metformin, exenatide, pramlintide, phentermine, topiramate; insulin, acetylsalicylic acid, acarbose, miglitol, alogliptin, linagliptin, pioglitazone, saxagliptin, sitagliptin, simivastin, albiglutide, dulaglutide, liraglutide, nateglinide, repaglinide, dapagliflozin, canagliflozin, empagliflozin, glimepiride, rosiglitazone, gliclazide, glipizide, glyburide, chlorpropamide, tolazamide, tolbutamide or a combination thereof.
Accordingly, the instant disclosure provides methods of assessing the efficacy of a composition for treating obesity or a disease related thereto, e.g., diabetes, metabolic syndrome, atherosclerosis, or cardiomyopathy, comprising determining levels or activities of at least one metabolite of Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7) in a sample obtained from a subject having obesity or a disease related thereto, wherein the determination is made before and after administration of the composition to the subject, wherein a modulation in the activities or levels of the metabolites in the subject post-administration of the composition compared to the activities or levels of the metabolites in the subject pre-administration of the composition indicates that the composition is effective in treating obesity or a disease related thereto.
In some embodiments, there is provided a method for assessing the efficacy of a composition for treating obesity or a disease related thereto by monitoring the directionality of changes in the levels or activities of the metabolites of the disclosure compared to a reference standard. As noted in Table 3, the levels or activities of a subset of metabolites is increased in obese subjects compared to control (e.g., healthy subjects). When an effective anti-obesity composition is administered to such subjects in need, the levels or activities of such metabolites may attenuate and reach threshold levels (e.g., control) or even sub-threshold levels. Similarly, if the levels or activities of a subset of metabolites is decreased in obese subjects compared to controls (e.g., healthy subjects), administration of an anti-obesity composition to such subjects may increase the levels or activities of such metabolites in the subject such that a threshold level (e.g., control) or even supra-threshold level is attained. In short, an effective anti-obesity composition may reverse the directionality of changes in the levels or activities of the metabolites of the disclosure in the subject's sample compared to the levels or activities of the metabolites in healthy subject(s).
In some embodiments, there is provided a method for assessing the efficacy of a composition for treating obesity or a disease related thereto by monitoring changes in mBMI levels post-administration of the composition. When an effective anti-obesity composition is administered to such subjects in need, the subject's mBMI scores may attenuate and reach threshold levels (e.g., control) or even sub-threshold levels. Similarly, if the obese subject's baseline mBMI scores is lower prior to administration of an anti-obesity composition, such that intake of the anti-obesity composition increases mBMI score for the subject, and then the composition is deemed not be effective for treating obesity.
As with the other methods described herein, the comparisons made in the methods of monitoring progression/regression of obesity or a disease related thereto, e.g., diabetes, metabolic syndrome, atherosclerosis, or cardiomyopathy, may be carried out using various techniques, including simple comparisons, statistical analyses (e.g., regression), and combinations thereof.
The results of the determinations may be used along with other methods for clinical monitoring of progression/regression of the disease or condition in a subject.
The metabolites (or derivatives thereof) provided in Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7) also allow for the identification of subjects in whom the composition for treating obesity or a disease related thereto such as diabetes, metabolic syndrome, atherosclerosis, or cardiomyopathy, is efficacious (i.e., patient responds to the therapeutic agent). For example, the identification of metabolites (or derivatives thereof) for obesity also allows for assessment of the subject response to a composition for treating obesity as well as the assessment of the relative patient response to two or more compositions for treating obesity. Such assessments may be used, for example, in targeted therapy of obesity or diseases related thereto. For instance, based on the results of the aforementioned tests, certain types of anti-obesity drugs may be favored over other types of anti-obesity drugs in certain subjects based on whether the subject is known to respond to the particular anti-obesity drug.
Thus, also provided are methods of predicting the response of a patient to a composition for treating obesity or a disease related thereto, e.g., diabetes, metabolic syndrome, atherosclerosis, or cardiomyopathy. The predictive method comprises (a) analyzing in a biological sample obtained from a subject having obesity or a disease related thereto, e.g., diabetes, metabolic syndrome, atherosclerosis, or cardiomyopathy, which subject is currently or previously being treated with a composition, the levels or activities of one or more metabolites (or derivatives thereof) of Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7); and (b) comparing the levels or activities of one or more metabolites of (a) in the sample to the levels or activities of one or more metabolites of (a) in a previously-taken biological sample from the subject, wherein the previously-taken biological sample was obtained from the subject before being treated with the composition. The results of the comparison are indicative of the response of the patient to the composition for treating the respective disease or condition. Preferably, the methods of predicting the response (i.e., measuring responsiveness) is carried out by measuring mBMI scores of the subject prior to and after administration of the composition for treating obesity or a disease related thereto.
The aforementioned methods can be used to monitor whether or not a patient is responding to an agent for treating obesity or a disease related thereto. If the comparisons indicate that the levels or activities of one or more metabolites (or derivatives thereof) of Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7) are increasing or decreasing over time to become more similar to the disease- or condition-negative reference levels (or less similar to the disease- or condition-positive reference levels), then the results are indicative of the patient responding to the anti-obesity agent.
It should be noted that responsiveness to the test agent or clinically-approved therapeutic agent can be made at any time after the first sample is obtained. In one aspect, the second sample (for measuring the responsiveness to a test agent or clinically-approved agent) is obtained 1, 2, 3, 4, 5, 6, or more days after the first sample. In another aspect, the second sample is obtained 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more weeks after the first sample or after the initiation of treatment with the composition. In another aspect, the second sample may be obtained 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, or more months after the first sample or after the initiation of treatment with the composition.
As with the other methods described herein, in the aforementioned methods for determining the subject's responsiveness to a test agent or clinically-approved therapeutic agent for treating obesity or a disease related thereto, e.g., diabetes, metabolic syndrome, atherosclerosis, or cardiomyopathy, may be carried out using various techniques, including simple comparisons, one or more statistical analyses, including combinations thereof.
The aforementioned methods are useful in identifying responders and/or non-responders to novel therapeutic agents that may at various stages of clinical testing. In particular, the aforementioned methods allow clinicians to stratify high-risk obese individuals and to assess the efficacy of therapeutic candidates more effectively and safely. A new diagnostic test that discriminates non-responding from responding patients to a therapeutic would enable pharmaceutical companies to identify and stratify patients that are likely to respond to the therapeutic agent and target specific therapeutics for certain cohorts that are likely to respond to the therapeutic. Accordingly, the methods of the disclosure not only provide cost-saving measures to pharmaceutical companies but also enable hospitals and dispensaries to deliver individualized and targeted therapy to patients by improving drug efficacy and concomitantly reducing the side effects.
The biomarkers provided herein also allow for the screening of compositions for activity in modulating metabolites (or derivatives thereof) that are associated with obesity or a disease related thereto, such as diabetes, metabolic syndrome, atherosclerosis, and cardiomyopathy, which may be useful in treating the disease or condition. Such methods comprise assaying test compounds for activity in modulating the levels or activities of one or more metabolites (or derivatives thereof) of Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7). Such screening assays may be conducted in vitro and/or in vivo, and may be in any form known in the art useful for assaying modulation of such metabolites (or derivatives thereof) in the presence of a test composition such as, for example, cell culture assays, organ culture assays, and in vivo assays (e.g., assays involving animal models). For example, the identification of metabolites (or derivatives thereof) associated with obesity also allows for the screening of compositions for activity in modulating metabolites (or derivatives thereof) associated with obesity, which may be useful in treating obesity. Methods of screening compositions useful for treatment of obesity (or a disease related thereto) comprise assaying test compositions for activity in modulating the levels of one or more metabolites (or derivatives thereof) of Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7). Although obesity is discussed in this example, the other diseases and conditions, such as diabetes, metabolic syndrome, atherosclerosis, and cardiomyopathy, may also be diagnosed in accordance with this method.
The disclosure also provides methods of identifying potential drug targets for diseases or conditions such as obesity or a disease related thereto, such as, diabetes, metabolic syndrome, atherosclerosis, and cardiomyopathy, using the biomarkers listed in Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7). A method for identifying a potential drug target for obesity or a disease related thereto, such as, diabetes, metabolic syndrome, atherosclerosis, and cardiomyopathy, comprises (a) identifying one or more biochemical pathways associated with one or more metabolites (or derivatives thereof) of Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7); and (b) identifying a protein (e.g., an enzyme) affecting at least one of the one or more identified biochemical pathways, the protein being a potential drug target for the disease or condition. For example, the identification of biomarkers for obesity also allows for the identification of potential drug targets for obesity. Representative pathways implicated in obesity are provided in Table 3 and include, e.g., (a) alanine and aspartate metabolism; (b) glutamate metabolism; (c) leucine, isoleucine and valine metabolism; (d) phenylalanine and tyrosine metabolism; (e) polyamine metabolism; (f) tryptophan metabolism; (g) glycine, serine and threonine metabolism; (h) fructose, mannose and galactose metabolism; (i) glycolysis, gluconeogenesis, and pyruvate metabolism; (j) ascorbate and aldarate metabolism; (k) nicotinate and nicotinamide metabolism; (l) TCA cycle; (m) carnitine metabolism (k) diacylglycerol fatty acid metabolism (also BCAA Metabolism); (l) phospholipid metabolism; (m) sphingolipid metabolism; (n) lysolipid metabolism; (o) plasmalogen; (p) steroid; (q) purine metabolism or (Hypo)xanthine/inosine containing; (r) purine metabolism adenine containing; (s) purine metabolism, guanine containing; (t) dipeptide derivative; (u) gamma-glutamyl amino acid (v) food component/plant-based xenobiotics metabolism.
Accordingly, the disclosure relates to one or more biochemical pathways (e.g., biosynthetic and/or metabolic (catabolic) pathway) that are associated with one or more metabolites (or derivatives thereof) which in turn are associated with obesity or a disease related thereto.
As is known in the art, pathway analysis is useful in drug discovery. For instance, a build-up of one metabolite (e.g., a pathway intermediate) may indicate the presence of a ‘block’ downstream of the metabolite and the block may result in a low/absent level of a downstream metabolite (e.g. product of a biosynthetic pathway). In a similar manner, the absence of a metabolite could indicate the presence of a ‘block’ in the pathway upstream of the metabolite resulting from inactive or non-functional enzyme(s) or from unavailability of biochemical intermediates that are required substrates to produce the product. Alternatively, an increase in the level of a metabolite could indicate a genetic mutation that produces an aberrant protein which results in the over-production and/or accumulation of a metabolite which then leads to an alteration of other related biochemical pathways and result in dysregulation of the normal flux through the pathway; further, the build-up of the biochemical intermediate metabolite may be toxic or may compromise the production of a necessary intermediate for a related pathway. It is possible that the relationship between pathways is currently unknown and this data could reveal such a relationship.
The proteins identified as potential drug targets may then be used to identify compositions that may be potential candidates for treating a particular disease or condition, such as obesity, including compositions for gene therapy.
In another aspect, the disclosure relates to methods for treating obesity or a disease related thereto such as diabetes, metabolic syndrome, atherosclerosis, and cardiomyopathy. The methods generally involve treating a subject obesity or a disease related thereto, e.g., with an effective amount of a pharmaceutical composition (e.g., an anti-obesity drug), or with surgery or lifestyle therapy, until the levels or activities of metabolites of Table 1-7 are modulated. More specifically, the disclosure provides methods for treating obesity or a disease related thereto comprising (a) detecting levels and/or activities of a plurality of metabolites (or derivatives thereof) (e.g., at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 39, 40, 50, 80, 100, 150, 200, 250, 300, 307, or more, e.g., 500 metabolites or derivatives thereof) from Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7) in a biological sample obtained from the subject; (b) diagnosing subject with obesity or a disease related thereto if the levels or activities of the metabolites of (a) are modulated compared to a reference standard; and (c) administering an effective amount of a therapy selected from the group consisting of anti-obesity pharmacotherapy, surgery, and lifestyle therapy to the subjects of (b) who are diagnosed with obesity or a disease related thereto.
Particularly, the disclosure provides methods for treating obesity or a disease related thereto comprising (a) detecting levels and/or activities of a plurality of metabolites (or derivatives thereof) (e.g., at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 39, 40, 50, 80, 100, 150, 200, 250, 300, 307, or more, e.g., 500 metabolites or derivatives thereof) from Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7) in a biological sample obtained from the subject and computing a metabolomic body mass index (mBMI) value for the subject based on the detection; (b) diagnosing subject with obesity or a disease related thereto if the mBMI value of the subject is modulated compared to a reference standard; and (c) administering an effective amount of a therapy selected from the group consisting of anti-obesity pharmacotherapy, surgery, and lifestyle therapy to the subjects of (b) who are diagnosed with obesity or a disease related thereto. In some embodiments, the reference standard includes a subject's BMI, wherein if mBMI>>BMI, then the subject is administered an anti-obesity drug. Optionally, the method may comprise determining an additional feature (e.g., blood pressure, waist/hip ratio, android/gynoid ratio, % body fat, % visceral fat, % subcutaneous fat or insulin resistance) and using that determination, together with mBMI values, regarding whether the subject should take the anti-obesity drug.
In the therapeutic embodiments described above, subjects whose mBMI exceeds BMI by at least 20%, 30%, 40%, 50%, 60%, 80%, 100% (i.e., 1-fold increase), 150%, 200%, 250%, 300%, or more, e.g., 500%, are treated with the anti-obesity drug.
In some embodiments, the metabolites (or derivatives thereof), as disclosed in Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7), may also serve as markers for genetic deficiency (e.g., leptin deficiency) or diseases such as hypothyroidism, insulin resistance, polycystic ovary syndrome, Cushing's syndrome and Prader-Willi syndrome, which may lead to obesity. For example, it is believed that at least some of the metabolites that are biomarkers of obesity may also serve as markers for one or more of these underlying causes of obesity. That is, the methods described herein with respect to obesity (or a disease related thereto) may also be used for diagnosing underlying conditions of obesity. Similarly, methods of assessing efficacy of compositions for treating obesity (or a disease related thereto), methods of screening a composition for activity in modulating metabolites associated with obesity (or a disease related thereto), methods of identifying potential drug targets for treating obesity (or a disease related thereto), and methods of treating obesity (or a disease related thereto) may be conducted in the context of diagnosis, evaluation, therapy, maintenance of underlying conditions and also for screening agents for the same purpose.
Other methods of using the biomarkers discussed herein are also contemplated. For example, the methods described in U.S. Pat. Nos. 7,005,255; 7,635,556; 7,329,489, 7,682,783; 7,682,784 and 7,550,258 may be conducted using a small molecule profile comprising one or more of the biomarkers disclosed herein.
Kits
The disclosure also relates to kits for detecting the presence of metabolite(s) or their derivatives, e.g., the metabolites (or derivatives thereof) disclosed in Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7), in a biological sample (a test sample). Such kits can be used to determine if a subject is suffering from or is at increased risk of developing a disorder associated with the metabolite (e.g., obesity or a disease related thereto). For example, the kit can comprise a labeled compound or agent capable of detecting the metabolite (or derivative thereof) in a biological sample and reagent/equipment for determining the amount of the metabolite (or derivative thereof) in the sample (e.g., an antibody against the metabolite or its derivative). Preferably, kits comprise one or more reagents to preserve the analyte (i.e., metabolites or derivatives thereof) and prevent contamination thereof. Typically, sodium azide (10%) may be used to prevent bacterial contamination. Kits may also include reagents for extraction of metabolites, e.g., acetonitrile, methanol, or chloroform, etc. Perchloric acid (PCA) may be included in metabolites are to be extracted from adherent cell culture and mammalian tissues.
Kits may also include instruction for observing that the tested subject is suffering from or is at risk of developing obesity or a disease related thereto if the amount of the metabolite is above or below a normal level. The kit may also comprise, e.g., a buffering agent, a preservative, or a stabilizing agent. The kit may also comprise components necessary for detecting the detectable agent (e.g., a substrate). The kit may also contain a control sample or a series of control samples which can be assayed and compared to the test sample contained. Each component of the kit is usually enclosed within an individual container and all of the various containers are within a single package along with instructions for observing whether the tested subject is suffering from or is at risk of developing obesity or a disease related thereto.
Especially, provided herein is a kit for determining a lipid or fat content of a biological sample, comprising, a plurality of probes for detecting a metabolite profile of the biological sample; vessels for holding the biological sample; optionally together with instructions for performing the detection, wherein the metabolite profile comprises at least three of the metabolites of Table 1 or derivatives thereof, wherein the at least 3 metabolites comprises, in the order of rank of relative correlation to the lipid or fat content, urate, 5-methylthioadenosine, and glutamate or derivatives thereof.
Further, provided herein is a kit for determining a lipid or fat content of a biological sample, comprising, a plurality of probes for detecting a metabolite profile of the biological sample; vessels for holding the biological sample; optionally together with instructions for performing the detection, wherein the metabolite profile comprises at least three of the metabolites of Table 2 or derivatives thereof, wherein the at least 3 metabolites comprises, in the order of rank of relative correlation to the lipid or fat content, urate, glutamate and 1-(1-enyl-palmitoyl)-2-oleoyl-GPC (P-16:0/18:1) or derivatives thereof.
Computer-Implemented Systems
In various embodiments, computer system 400 can be coupled via bus 402 to a display 412, such as a cathode ray tube (CRT) or liquid crystal display (LCD), for displaying information to a computer user. An input device 414, including alphanumeric and other keys, can be coupled to bus 402 for communicating information and command selections to processor 404. Another type of user input device is a cursor control 416, such as a mouse, a trackball or cursor direction keys for communicating direction information and command selections to processor 404 and for controlling cursor movement on display 412. This input device 414 typically has two degrees of freedom in two axes, a first axis (i.e., x) and a second axis (i.e., y), that allows the device to specify positions in a plane. However, it should be understood that input devices 414 allowing for 3 dimensional (x, y and z) cursor movement are also contemplated herein.
Consistent with certain implementations of the present teachings, results can be provided by computer system 400 in response to processor 404 executing one or more sequences of one or more instructions contained in memory 406. Such instructions can be read into memory 406 from another computer-readable medium or computer-readable storage medium, such as storage device 410. Execution of the sequences of instructions contained in memory 406 can cause processor 404 to perform the processes described herein. Alternatively hard-wired circuitry can be used in place of or in combination with software instructions to implement the present teachings. Thus, implementations of the present teachings are not limited to any specific combination of hardware circuitry and software. The term “computer-readable medium” (e.g., data store, data storage, etc.) or “computer-readable storage medium” as used herein refers to any media that participates in providing instructions to processor 404 for execution. Such a medium can take many forms, including but not limited to, non-volatile media, volatile media, and transmission media. Examples of non-volatile media can include, but are not limited to, optical, solid state, magnetic disks, such as storage device 410. Examples of volatile media can include, but are not limited to, dynamic memory, such as memory 406. Examples of transmission media can include, but are not limited to, coaxial cables, copper wire, and fiber optics, including the wires that comprise bus 402.
Common forms of computer-readable media include, for example, a floppy disk, a flexible disk, hard disk, magnetic tape, or any other magnetic medium, a CD-ROM, any other optical medium, punch cards, paper tape, any other physical medium with patterns of holes, a RAM, PROM, and EPROM, a FLASH-EPROM, any other memory chip or cartridge, or any other tangible medium from which a computer can read.
In addition to computer readable medium, instructions or data can be provided as signals on transmission media included in a communications apparatus or system to provide sequences of one or more instructions to processor 404 of computer system 400 for execution. For example, a communication apparatus may include a transceiver having signals indicative of instructions and data. The instructions and data are configured to cause one or more processors to implement the functions outlined in the disclosure herein. Representative examples of data communications transmission connections can include, but are not limited to, telephone modem connections, wide area networks (WAN), local area networks (LAN), infrared data connections, NFC connections, etc.
It should be appreciated that the methodologies described herein flow charts, diagrams and accompanying disclosure can be implemented using computer system 400 as a standalone device or on a distributed network of shared computer processing resources such as a cloud computing network.
The methodologies described herein may be implemented by various means depending upon the application. For example, these methodologies may be implemented in hardware, firmware, software, or any combination thereof. For a hardware implementation, the processing unit may be implemented within one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable gate arrays (FPGAs), processors, controllers, micro-controllers, microprocessors, electronic devices, other electronic units designed to perform the functions described herein, or a combination thereof.
In various embodiments, the methods of the present teachings may be implemented as firmware and/or a software program and applications written in conventional programming languages such as C, C++, Python, etc. If implemented as firmware and/or software, the embodiments described herein can be implemented on a non-transitory computer-readable medium in which a program is stored for causing a computer to perform the methods described above. It should be understood that the various engines described herein can be provided on a computer system, such as computer system 400, whereby processor 404 would execute the analyses and determinations provided by these engines, subject to instructions provided by any one of, or a combination of, memory components 406/4008/410 and user input provided via input device 414.
In various embodiments, provided herein is an obesity profiling system, comprising: (a) a metabolome detector/analyzer configured to detect/analyze levels or activities of a plurality of metabolites (or derivatives thereof) (e.g., at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 39, 40, 50, 80, 100, 150, 200, 250, 300, 307, or more, e.g., 500 metabolites or derivatives thereof) from Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7) in a subject's biological sample; (b) an obesity determining engine configured to determine obesity based on levels and/or activities of metabolites or derivatives thereof, wherein the engine is optionally communicatively connected to a data source (e.g., human metabolome database); and (c) a display communicatively connected to a computing device and configured to display a report containing the subject's obesity profile.
In some embodiments, provided herein is an obesity profiling system, comprising: (a) a metabolome detector/analyzer configured to detect/analyze levels or activities of a plurality of metabolites (or derivatives thereof) (e.g., at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 39, 40, 50, 80, 100, 150, 200, 250, 300, 307, or more, e.g., 500 metabolites or derivatives thereof) from Table 11, Table 12A, Table 12B,9 or Tables 1-7 (preferably Tables 2-7) in a subject's biological sample; (b) an obesity determining engine configured to determine obesity based on levels and/or activities of metabolites, wherein the engine is optionally communicatively connected to a data source (e.g., human metabolome database); and (c) a display communicatively connected to a computing device and configured to display a report containing the subject's obesity profile, wherein components (a), (b) and (c) are communicatively connected to each other, e.g., via the internet.
In some embodiments, provided herein is an obesity profiling system, comprising: (a) a metabolome detector/analyzer configured to detect/analyze levels or activities of a plurality of metabolites (or derivatives thereof) (e.g., at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 39, 40, 50, 80, 100, 150, 200, 250, 300, 307, or more, e.g., 500 metabolites or derivatives thereof) from Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7) in a subject's biological sample, wherein the analyzer is communicatively connected to an obesity determining engine configured to determine obesity based on levels and/or activities of metabolites; (b) a data source (e.g., human metabolome database); and (c) a display communicatively connected to a computing device and configured to display a report containing the subject's obesity profile, wherein components (a), (b) and (c) are communicatively connected to each other, e.g., via the internet.
In some embodiments, provided herein is an obesity profiling system, comprising: (a) a metabolome detector/analyzer configured to detect/analyze levels or activities of a plurality of metabolites (or derivatives thereof) (e.g., at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 39, 40, 50, 80, 100, 150, 200, 250, 300, 307, or more, e.g., 500 metabolites or derivatives thereof) from Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7) in a subject's biological sample; (b) an obesity determining engine configured to determine obesity based on levels and/or activities of metabolites; (c) a data source (e.g., human metabolome database); and (d) a display communicatively connected to a computing device and configured to display a report containing the subject's obesity profile, wherein components (a), (b), (c) and (d) are communicatively connected to each other, e.g., via the internet.
In some embodiments, provided herein are obesity profiling systems of the foregoing, comprising a metabolome detector/analyzer configured to detect/analyze levels or activities of at least 3 metabolites of Table 1 or derivatives thereof in a subject's biological sample, wherein the at least 3 metabolites comprises, in the order of rank of relative correlation to the subject's obesity, urate, 5-methylthioadenosine, and glutamate.
In some embodiments, herein are obesity profiling systems of the foregoing, comprising a metabolome detector/analyzer configured to detect/analyze levels or activities of at least 3 metabolites of Table 2 or derivatives thereof in a subject's biological sample, wherein the at least 3 metabolites comprises, in the order of rank of relative correlation to the subject's obesity, urate, glutamate and 1-(1-enyl-palmitoyl)-2-oleoyl-GPC (P-16:0/18:1).
In some embodiments, provided herein is an obesity profiling system of the foregoing, further comprising an analyzer for detecting a secondary parameter in the subject's sample optionally together with a genetic parameter. Preferably, the secondary parameter is selected from the group consisting of android/gynoid ratio; total triglycerides; waist/hip ratio; subcutaneous fat; visceral fat; insulin resistance; HDL; percent fat; diastolic blood pressure; systolic blood pressure; total cholesterol; and LDL, or a combination thereof, particularly preferably, android/gynoid ratio; total triglycerides; waist/hip ratio; subcutaneous fat; visceral fat; insulin resistance; and HDL. Preferably, the genetic parameter is selected from genetic variants of melanocortin 4 receptor gene (MC4R) or a lipdystrophy gene selected from zinc metallopeptidase STE24 (ZMPSTE24), 1-acylglycerol-3-phosphate O-acyltransferase 2 (AGPAT2), lipase E, hormone sensitive type (LIPE), Bernardinelli-Seip congenital lipodystrophy type 2 (BSCL2), or a combination thereof. Especially, the analyzer analyzes, whether the subject's sample comprises an MC4R variant selected from M292fs, R236C, S180P, A175T, and T11A, but not I170V; and/or whether the subject's sample comprises a genetic variant of ZMPSTE24, AGPAT2, LIPE, BSCL2, or a combination thereof.
The disclosure further relates to computer readable medium comprising computer-executable instructions, which, when executed by a processor, cause the processor to carry out a method or a set of steps for diagnosing obesity in a subject. In some embodiments, the computer readable media carry out a method or a set of steps for diagnosing obesity in a subject, comprising detecting levels or activities of a plurality of metabolites (or derivatives thereof) (e.g., at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 39, 40, 50, 80, 100, 150, 200, 250, 300, 307, or more, e.g., 500 metabolites or derivatives thereof) from Table 11, Table 12A, Table 12B, or Tables 1-7 (preferably Tables 2-7) in a subject's biological sample, wherein the computer readable medium comprises machine learning techniques to determine obesity of subject based on the metabolite profile.
In some embodiments, the computer readable media carry out a method or a set of steps for diagnosing obesity in a subject, comprising detecting a metabolite profile in a metabolome dataset received from a subject's sample, wherein the metabolite profile comprises levels or activities of at least three metabolites of Table 1 or derivatives thereof and the computer readable medium comprises machine learning techniques to determine obesity of subject based on the metabolite profile, wherein the at least 3 metabolites comprises, in the order of rank of relative correlation to the subject's obesity, urate, 5-methylthioadenosine, and glutamate.
In some embodiments, the computer readable media carry out a method or a set of steps for diagnosing obesity in a subject, comprising detecting a metabolite profile in a metabolome dataset received from a subject's sample, wherein the metabolite profile comprises levels or activities of at least three metabolites of Table 2 or derivatives thereof and the computer readable medium comprises machine learning techniques to determine obesity of subject based on the metabolite profile, wherein the at least 3 metabolites comprises, in the order of rank of relative correlation to the subject's obesity, urate, glutamate and 1-(1-enyl-palmitoyl)-2-oleoyl-GPC (P-16:0/18:1).
The aforementioned embodiments of the disclosure are further described in view of the following non-limiting examples.
The structures, materials, compositions, and methods described herein are intended to be representative examples of the disclosure, and it will be understood that the scope of the disclosure is not limited by the scope of the examples. Those skilled in the art will recognize that the disclosure may be practiced with variations on the disclosed structures, materials, compositions and methods, and such variations are regarded as within the ambit of the disclosure.
There are recent calls to improve phenotyping in very large numbers of people with obesity with the goals of understanding factors that make people susceptible to (or protected from) obesity, accompanied by a better elucidation of the factors that account for variability in success of different obesity treatments. Here, longitudinal body mass index (BMI), anthropomorphic measurements, whole body DXA scans and genetic risk and metabolite data were analyzed from 2,601 individuals. The metabolome assay covered up to 1,007 metabolites at up to three time-points per person. Associations between nearly a third of the metabolome and BMI were identified, and it was revealed that metabolite levels can explain ˜40% of the variation in BMI and can predict obesity status with ˜80-90% specificity and sensitivity. The metabolome profile is a strong indicator of metabolic health compared to the polygenic risk and anthropomorphic measurement of BMI.
Methods
Samples and study design—The study included 1,969 European ancestry twins enrolled in the TWINSUK registry, a British national register of adult twins. A detailed study of the genetic variants influencing the human metabolome in this cohort has been previously reported in Long et al. (Nature Genetics, 49, 568-578, 2017). Serum samples were collected at three visits, 8-18 (median 13) years apart. The cohort is mainly composed of females (96.7%), and the sample set included 388 monozygotic twin pairs, 519 dizygotic twin pairs, and 155 unrelated individuals. The age of participants at the first time point ranged from 33 to 74 years old (median 51); 36 to 81 years old (median 59) at the second time point; and 42 to 88 years old (median 65) at the third time point. The BMI values measured at each metabolome time point were taken within two years of the blood draw date. At baseline, 36.3% of the female participants and 53.8% of the male participants were overweight, and 16.9% of the females and 10.8% of the males were obese. The twins study was approved by Ethics Committee, and all participants provided informed written consent. BMI data were available for 1743 participants within two years of the time point for metabolome time point 1, 1834 for within two years of time point 2, and 1777 for up to 2 years before time point 3 or 4 years after this time point; 1,458 individuals had all three data points. For independent validation and studies of phenotypes correlated with metabolic BMI outliers, 617 unselected adults more than 18 years old who were available for a clinical research protocol were enrolled. Participants underwent a verbal review of the institutional review board-approved consent. Participants ranged in age from 18-89 years old (median 53), were 32.9% female, and had BMI data measured at one time point: 16.7% of the female participants and 47.5% of the male participants were overweight, and 7.2% of the females and 23.7% of the males were obese.
Phenotyping—Individuals in the TWINSUK cohort and Health Nucleus both underwent DEXA imaging. The data from these scans were used to calculate android/gynoid ratio, percent body fat, visceral fat, and subcutaneous fat. DEXA is very accurate in the measurement abdominal visceral adipose tissue (VAT). High levels of VAT are associated with atherogenic dyslipidemia, hyperinsulinemia, and glucose intolerance (Neeland et al., Circulation, 137, 1391-1406, 2018). TWINSUK cohort participants were additionally measured for circumference at the waist and hip using a measuring tip to calculate the waist/hip ratio. TWINSUK participants self-reported information about whether they were taking high blood pressure medication at their first visit and about cardiovascular events and their timing via a survey at the final visit. MRI images were available for a selected number of Health Nucleus participants. Insulin resistance was defined by HOMA score >3 (available on the world-wide-web at gihep(dot)com/calculators/other/homa/).
Metabolite Profiling—The non-targeted metabolomics analysis of 901 metabolites in the TWINSUK cohort and 1,007 metabolites in the Health Nucleus cohort was performed at Metabolon, Inc. (Durham, N.C., USA) on a platform consisting of four independent ultra-high performance liquid chromatography-tandem mass spectrometry (UPLC-MS/MS) methods. The detailed descriptions of the platform can be found in previous publications (Long et al., Nature Genetics, 49, 568-578, 2017). For the TWINSUK cohort, blood serum after fasting was used for analysis, and the resulting raw values were transformed to z scores using the mean and standard deviation. For the Health Nucleus cohort, blood plasma after fasting was used for analysis, and values from multiple experimental batches were normalized into Z-scores based on a reference cohort of either 42 (n=457) or 300 (n=176) self-reported healthy individuals run with each batch. The 42 and 300-normalized batches were converted to the same scale using linear transformation based on the values obtained from 7 runs that included both the 42 and 300 controls. Samples with metabolite measurements that were below the detection threshold were imputed as the minimum value for that metabolite.
Genome sequencing and analysis—As previously described, DNA samples were sequenced on an Illumina HISEQX sequencer utilizing a 150 base paired-end single index read format. Reads were mapped to the human reference sequence build HG38. Variants were called using ISIS Analysis Software (v. 2.5.26.13; Illumina). A linear mixed model was applied to account for family structure in the cohort while testing for associations between genetic variants and the different phenotypes: BMI; BMI prediction model values and residuals after accounting for BMI, age, sex; and levels of the 49 BMI-associated metabolites. A genetic similarity matrix (GSM) was constructed from 301,556 variants that represented a random 20% of all common (MAF>5%) variants genome-wide after linkage-disequilibrium (LD) pruning (r2 less than 0.6, window size 200 kb) and was used to model the random effect in the linear mixed model via a “leave-out-one-chromosome” method for each tested variant. Each of 97 known BMI-associated variants was tested independently using customized Python scripts wrapping the FAST-LMM package. Principal component axes were calculated to check ethnicity using plink, and the first principal component for those of European ancestry was used as a covariate in analyses of unrelated individuals in R described below. Polygenic risk scores were calculated using genotypes for 97 variants whose associations and betas had been published previously. For rare variant analysis in the gene MC4R, coding and splice variants with MAF<0.1% were analyzed with a gene-based collapsing analysis where all qualifying variants in the gene were grouped together, again using customized Python scripts wrapping the FAST-LMM package and the same GSM described above to account for relatedness. Rare lipodystrophy variants were defined as those achieving a pathogenic or likely pathogenic categorization in ClinVar or HGMD.
Statistical analysis—R was used for the analysis and data manipulation. Bonferroni correction was used for all analyses, and most statistical analyses were restricted to unrelated individuals of European ancestry, in accordance with field standards for ensuring that ancestry differences do not cause bias or skew in the results. For each quantitative analysis of BMI or other traits, the subset of BMI values or other outcome variables used were rank-ordered and forced to a normal distribution. Analyses comparing metabolites to BMI were performed in R using the 1 m function, and age, sex, and the first genetic principal component were included as covariates. The obesity prediction model was built using ridge regression (alpha=0) with glmnet in R. The residuals used to separate participants into the five categories shown in
For the analysis of change in BMI across visits 1, 2 and 3, the slope of the change for each person was calculated (change in BMI vs. change in years). For the analyses of BMI recovery, participants were separated into four categories based on having values that were at least one standard deviation above or below the mean for the BMI change at that time point (
Results
Profound Perturbation of Metabolome by Obesity (Metabolites Associated with Body Mass Index).
The levels of 901 metabolites to the BMIs of 832, 882, and 861 unrelated individuals of European ancestry in the TWINSUK cohort at three time points spanning a total range of 8-18 years were compared. Initially, 284 metabolites that were significantly associated (p<5.5×10-5) with BMI at one or more time points were identified (Table 11, Table 12A and Table 12B). The study focused on 110 metabolites that were significantly associated with BMI at all 3 time points and sought to replicate the associations in an independent sample of 427 unrelated individuals of European ancestry out of the 617 participants in the Health Nucleus cohort. Of the 84 metabolites that had been measured in both cohorts, 83 showed directions of effect that were consistent between the two cohorts, and 49 were statistically significant replications (
The 49 metabolites that associated with BMI were primarily lipids (n=23, accounting for 7.5% of all lipids assayed across both cohorts) and amino acids (n=14, 9.3% of all amino acids) as well as nucleotides (n=3, 12.0% of all nucleotides), peptides (n=3, 12% of all peptides), and other categories (n=6, see
Patterns in Metabolite Change According to BMI.
The majority of the 49 BMI-associated metabolites increased with increasing BMI (n=35) (
Of particular interest was the association with cortisone, a metabolite of the steroid hormone cortisol. The results show lower levels among the obese individuals, which is consistent with previous reports. There are, however some inconsistent relationships between cortisol and metabolic parameters in the literature. Additionally, each of the 49 metabolites in just those of normal weight, overweight or obese separately were examined. The directionality of the effect was found to be largely consistent with those seen in the group as a whole (Table 12A or Table 12B).
Modelling the Metabolome of Obesity—
Ridge regression was used to build a model that would predict BMI from the 49 BMI-associated metabolites (see
Richer models using the full set of available metabolites (n=650 measured in both cohorts) improved the accuracy of the model (47-49% of the variance explained) and could be considered as the optimal approach by accepting the additional cost of a full untargeted metabolome as compared to the more targeted panel of 49 metabolites. This performance should be contrasted to the results of models using routine clinical laboratory determinations: regression analysis predicting BMI from age, sex, HDL, LDL, total cholesterol and total triglycerides explained 31% of the variation in BMI, whereas a model using age, sex and the 49 metabolite signature explained 43% of the variation. In fact, even though mBMI was modeled by training on BMI, this metabolite signature had a better correlation than BMI with most health-related phenotypes measured here (Table 2).
Identification and Characterization of Metabolic BMI Outliers—
Having established a model to predict BMI using the metabolome, the participants were split into 5 groups (
Having characterized these outliers, metabolome differences were revisited. As expected, those with mBMI substantially higher than BMI significantly differed in their metabolite levels from those with mBMI substantially lower than BMI for most of the 49 BMI-associated metabolites. However, two of the BMI-associated metabolites did not differ between these two groups: asparagine and cortisone. The association between each of the BMI-associated metabolites and insulin resistance was additionally investigated, which as many previously reported markers of obesity have also been markers of diabetes. Insulin resistance measurements were taken for 515 unrelated, European-ancestry participants. After controlling for BMI, it was found that 12 of the 49 BMI-associated metabolites were significantly associated (correcting for 49 tests requires p<0.001) with insulin resistance, all with positive direction of effect: tyrosine, alanine, kynurenate, gamma-glutamyltyrosine, 1-oleoyl-3-linoleoyl-glycerol (18:1/18:2), and six phospholipids, and glucose (see Tables 11, 12A, Table 12B).
Principal component analysis of the main 49 BMI-associated metabolites indicated that the first principal component, which was most heavily influenced by nucleotides and amino acids, explained 19.7, 19.8, and 21.4% of the overall variation in these metabolites at time points 1, 2 and 3 and 22.5% in the Health Nucleus. The first principal component also explained 17.9%, 28.6%, and 29.9% of the variation in BMI at these time points and 48.9% in the Health Nucleus. This difference in explanatory power across cohorts likely reflected differences in cohort composition, especially the difference in sex ratios across studies. The TWINSUK cohort was 96.7% female, while the Health Nucleus cohort was 32.9% female; when restricting to females within the Health Nucleus cohort, the first principal component only explained 32.9% of the variation in BMI. The first principal component was not only useful for distinguishing obese from non-obese: even among those who were obese, this component respectively explained 4.0, 13.5, 10.9, and 16.1% of the variation in obese BMI. This first principal component was a robust and reliable predictor of BMI, with the majority of the 49 metabolites having strong influences on this component at all three visits. Some of the most important contributors to this axis included metabolites involved in nucleotide metabolism, such as urate and pseudouridine, and diverse amino acids, especially branched-chain amino acids. The subsequent axes were more prone to changing their contributing metabolites across visits, but they were consistently strongly influenced by key metabolites as shown in
BMI Changes Over Time—
BMI data from TWINSUK were available for all three-time points for 1,458 participants. On average, participants gained 0.91 BMI between the first and second visits, when the mean age increased from 51 to 58, and lost 0.09 BMI between the second and third visits, when the mean age increased to 64 (
Predictors of Changes in BMI—
BMI change over the course of the study as a phenotype in analyses was used to identify metabolites or demographic factors that could predict weight change in the 695 TWINSUK participants with weight at all three time points who were unrelated and genetically of European ethnicity. It was found that age at the start of the study was by far the most significant predictor of weight change, explaining 9.4% of the variation in slope of BMI change. Menopause status at the beginning and end of the study explained an additional 1.5% of the variation and time between visits explained 0.5% more, while initial BMI and sex were not predictors of change in BMI over time. No single metabolite at time point 1 was significantly (p<5.5×10-5) associated with the slope of subsequent BMI change after controlling for initial BMI, age and time between visits. Likewise, the BMI prediction made using 49 metabolites from visit 1 was not significantly associated with subsequent weight change. The lack of association with change in BMI thus show that the perturbation in metabolite patterns was likely a consequence of the BMI changes as opposed to a contributing factor.
Metabolite Recovery after BMI Change—
To confirm this direction of effect, a study was conducted to investigate whether longitudinal changes in weight were reflected by longitudinal changes in metabolite levels within the same person. It was found that when tracking an individual's weight across visits, their metabolite changes generally reflected their weight changes. For example, 73% of the 41 participants who were classified as having gained weight between time point 1 and 2 and then losing weight between time points 2 and 3 had metabolome BMI model predictions increase at time point 2 and then decrease again at time point 3, demonstrating that metabolite changes associated with a BMI change could be reversed. Overall, participants who had substantial weight change between time points (as defined in the methods and
MC4R Variant Carriers with Low Polygenic Risk Scores—
Members of the study populations who were carrying rare (MAF<0.01%) coding variants in the known obesity gene MC4R were identified. Specifically, eight such carriers were identified in the subset of unrelated participants, with an enrichment in participants who were obese despite a low polygenic risk score. Out of 31 participants who were obese with polygenic risk scores in the lowest quartile, 6.1% were MC4R variant carriers, while the carrier frequency was just 0.3% in those of normal weight (
Evolution of Obesity and Metabolome Clinical Profiles—
Given recent research showing that obese individuals who are metabolically healthy may remain at higher risk of negative health outcomes than are normal weight individuals who are metabolically healthy, a study was conducted to address whether the outlier groups were more likely to become obese over time. Focusing on the 1,458 individuals from TWINSUK who had weight measurements at all three time points, it was found that those who had a mBMI that was higher than their BMI were marginally more likely to gain weight and convert to an obese phenotype (BMI>30) over the 8-18 years of follow up. For example, 32.8% of those of normal weight but with an overweight or obese metabolome converted to being overweight or obese by time point 3 compared to 24.8% of those who were of normal weight and had a healthy metabolome (p=0.02,
Cardiovascular Disease Outcomes—
Obesity is a well-recognized risk factor for cardiovascular disease and ischemic stroke. The longitudinal nature of the TWINSUK study allowed the collection of clinical endpoints in these unselected participants. The age of participants at the first visit ranged from 33 to 74 years old (median 51); and 42 to 88 years old (median 65) at the last visit. During the follow up (median 13 years), the study recorded 53 cardiovascular events (myocardial infarct, angina, angioplasty) or strokes for 1573 individuals. Participants with a healthy metabolome (normal BMI or obese) had 2 events per hundred individuals. Individuals with an obese metabolic profile had 3.7 (normal BMI) and 4.2 events (in obese individuals) per hundred individuals. Separated analysis of the various endpoints confirmed the trends, more accentuated for cardiovascular than for diagnosis of stroke (
Correlations Between Twins—
Because twin studies are important to analyze the heritability of traits, the BMI model predictions and obesity status of 350 sets of twins were reassessed, wherein either both twins had normal BMI (n=244), both twins were obese (n=67), or one was obese and the other had normal BMI (n=39). To keep the categories clear, individuals with BMIs between 25 and 30 (overweight) and their twins were excluded. As asserted by the model's high specificity and sensitivity, the metabolite-based obesity predictions tended to reflect the actual obesity statuses of the individuals. This was even the case when only one twin was obese: the obese twin was generally predicted by their metabolome to be obese, while the normal weight twin was not (
These outliers were thought to represent the healthy obese and normal weight, metabolically unhealthy individuals described above.
Genetic Analyses
Known Genetics of Obesity—
The study first investigated the known genetic factors contributing to high BMI. Polygenic scores for BMI were calculated using known associations from the considerable literature of obesity and BMI GWAS. As previously reported, it was found that polygenic risk score only explained 2.2% of the variation in BMI at each of the three TWINSUK time points and in Health Nucleus for unrelated participants of European ancestry (
Specifically, the study identified 8 such carriers in the subset of unrelated participants (Table 2). Each variant was observed in one unrelated individual, and 5 of the 8 had already been annotated as causing obesity in HGMD or ClinVar (Table 2). As a group, MC4R carriers had significantly higher BMI (p=0.02) than did non-carriers as well as non-significant trends toward a higher diastolic blood pressure, insulin resistance, and percent body fat (
Genetics of the Healthy Obese—
Additional support for the decoupling of the genetics of high BMI versus the basis of obesity and predicted mBMI was derived from the analysis of outliers. Individuals with an mBMI that was substantially lower than their actual BMI had a higher polygenic risk score for BMI than did other groups. In contrast, those whose mBMI was substantially higher than their actual BMI had low polygenic risk scores (
Genetics of metabolome differences—
The study further investigated whether obese individuals with different genetic backgrounds had different metabolomes from other obese individuals. First, metabolites that could distinguish individuals with different BMI polygenic risk scores or MC4R variant carriers were searched. Linear regression showed no significant associations between any single metabolites and polygenic risk or MC4R carrier status either in the entire population or in only the obese individuals. This result implies that metabolites are unlikely to be intermediate phenotypes that explain the underlying genetics of obesity. To check for more specific signals beyond the compiled polygenic risk score, separate analyses of each of the 97 variants that are used to calculate the polygenic risk score were also performed. There was no evidence for any of these known GWAS variants to be more strongly associated with a metabolite than with BMI itself, though the power for discovery was limited given the very small effect sizes of most individual GWAS variants. In summary, although it is known that there is a strong genetic component to metabolite levels, most of the metabolic perturbations that occur in the obese state are a response to obesity as opposed to shared genetic mechanisms.
The results of the present study highlight the profound disruption of the metabolome that is caused by obesity and identifies a metabolome signature that serves to examine metabolic health beyond anthropomorphic measurements (
It is well known that uric acid increases with obesity, due to insulin resistance reducing the kidneys' ability to eliminate uric acid, but previous work has not emphasized the power of urate to predict BMI. It was also found that 23 of the lipids in the assay were definitively associated with BMI, with an enrichment of associations found for glycerol lipids. These results are consistent with previous studies showing that sphingomyelins and diacylglycerols increase with BMI while lysophosphocholines decrease with BMI, with other various phosphatidylcholines having effects in both directions. Other previously reported metabolite associations with BMI, including positive associations with choline, cysteine, pantothenate, fructose, palmitate, stearate, fructose, and xylose, and negative associations with citrulline, methionine, and uridine are not apparent in the large study. These metabolites have largely been implicated in studies specifically addressing diabetes in the setting of obesity, and their effects may be limited to that context. Given this landscape, it will be of interest to perform studies that more specifically dissect the associations of metabolites with various traits. For example, few of the BMI-associated metabolites were associated with insulin resistance after controlling for BMI, despite the overlap between obese patients and patients with insulin resistance. As previously observed, the metabolome abnormalities associated with high BMI corrected with loss of weight. However, the present study found that metabolite levels did not provide predictive power for future weight changes. Overall, the metabolome perturbations appear as a consequence of changes in weight as opposed to being a contributing factor.
The present study does not support a strong association between metabolome changes and the genetics of BMI defined by a 97-variant polygenic risk score. This may be explained by the fact that known BMI GWAS loci explain only a small fraction (˜3%) of BMI heritability. However, as discussed below, the BMI polygenic risk may also influence body build and not only obesity. Taken together, it does not appear that metabolites are intermediate phenotypes between the genetics of BMI and obesity itself. The study also identified individuals who carried rare functional variants in the known obesity gene MC4R. The carriers of these variants were often obese individuals, but their metabolome was not categorically different from that of other obese individuals. The lack of metabolome differences for carriers of variants in this gene is not surprising given that MC4R variants cause obesity by increasing appetite. However, the results did not show that obese carriers of MC4R variants often had low polygenic risk scores for obesity; out of 31 participants who were obese with polygenic risk scores in the lowest quartile, 6.1% were MC4R variant carriers, while the carrier frequency was just 0.3% in those of normal weight. Polygenic risk scores are calculated using common variants and association statistics from existing genome-wide association studies. Their impact on phenotype is generally modest, and the present study demonstrates part of why this is true: rare variants with larger effects on phenotypes are not captured in polygenic risk scores.
The present study shows the potential to sequence obese individuals who are outliers with low polygenic risk scores because of the apparent enrichment for monogenic contributions. As of the completion of the study, a large consortium provided additional detail on the role of variants in pathways that implicate energy intake and expenditure in obesity. Finally, the metabolome signature identified individuals whose predicted mBMI was either substantially higher or lower than their actual BMI. These individuals include the metabolically healthy obese, but also emphasize the importance of the metabolome in unhealthy individuals with a normal BMI. These profiles were surprisingly stable over the prolonged follow-up. This suggests that there is a durable benefit of maintaining a healthy metabolome signature and points to an ongoing risk for the individuals that have an unhealthy metabolome despite stability of BMI. An abnormal metabolome signature, irrespective of BMI, was associated in the present study with three-fold increase in cardiovascular events. Thus, while these findings are in line with the known relationships between metabolically healthy obese status and health-related traits like metabolic syndrome and body fat, the relationship was extended to a broader category of metabolically healthy and unhealthy individuals on the basis of the disparity between mBMI and BMI. The fact that the metabolically healthy obese have a high BMI polygenic risk score also supports the concept that some of the genetic studies may capture anthropomorphic associations—body size—rather than obesity sensu stricto. These findings are in line with previous studies identifying genetic variants specifically associated with the metabolically healthy obese, or favorable adiposity. While the common variants associated with favorable adiposity thus far have had subtle effects, a thorough investigation of the full genomes mBMI/BMI outliers can be expected to identify rare variants with large effects on healthy obesity and unhealthy metabolome with normal weight.
The biological differences between these outlier categories would benefit from further study as well. For example, differences in waist/hip ratio, percent visceral fat, and blood pressure between mBMI/BMI outliers were observed despite having the same BMI distribution (
While the present teachings are described in conjunction with various embodiments, it is not intended that the present teachings be limited to such embodiments. On the contrary, the present teachings encompass various alternatives, modifications, and equivalents, as will be appreciated by those of skill in the art.
Further, in describing various embodiments, the specification may have presented a method and/or process as a particular sequence of steps. However, to the extent that the method or process does not rely on the particular order of steps set forth herein, the method or process should not be limited to the particular sequence of steps described. As one of ordinary skill in the art would appreciate, other sequences of steps may be possible. Therefore, the particular order of the steps set forth in the specification should not be construed as limitations on the claims. In addition, the claims directed to the method and/or process should not be limited to the performance of their steps in the order written, and one skilled in the art can readily appreciate that the sequences may be varied and still remain within the spirit and scope of the various embodiments.
The embodiments described herein, can be practiced with other computer system configurations including hand-held devices, microprocessor systems, microprocessor-based or programmable consumer electronics, minicomputers, mainframe computers and the like. The embodiments can also be practiced in distributing computing environments where tasks are performed by remote processing devices that are linked through a network.
It should also be understood that the embodiments described herein can employ various computer-implemented operations involving data stored in computer systems. These operations are those requiring physical manipulation of physical quantities. Usually, though not necessarily, these quantities take the form of electrical or magnetic signals capable of being stored, transferred, combined, compared, and otherwise manipulated. Further, the manipulations performed are often referred to in terms, such as producing, identifying, determining, or comparing.
Any of the operations that form part of the embodiments described herein are useful machine operations. The embodiments, described herein, also relate to a device or an apparatus for performing these operations. The systems and methods described herein can be specially constructed for the required purposes or it may be a general purpose computer selectively activated or configured by a computer program stored in the computer. In particular, various general purpose machines may be used with computer programs written in accordance with the teachings herein, or it may be more convenient to construct a more specialized apparatus to perform the required operations.
Certain embodiments can also be embodied as computer readable code on a computer readable medium. The computer readable medium is any data storage device that can store data, which can thereafter be read by a computer system. Examples of the computer readable medium include hard drives, network attached storage (NAS), read-only memory, random-access memory, CD-ROMs, CD-Rs, CD-RWs, magnetic tapes, and other optical, FLASH memory and non-optical data storage devices. The computer readable medium can also be distributed over a network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion.
This application claims the benefit of priority under 35 U.S.C. § 119 from U.S. Provisional Application No. 62/652,864, filed Apr. 4, 2018; and from U.S. Provisional Application No. 62/724,515, filed Aug. 29, 2018, which are hereby incorporated by reference in their entirety as set forth in full.
Number | Date | Country | |
---|---|---|---|
62652864 | Apr 2018 | US | |
62724515 | Aug 2018 | US |