The present invention relates in general to the field of peripheral blood DNA methylation, and more particularly, a novel method of predicting knee osteoarthritis radiographic progression from peripheral blood DNA methylation.
Without limiting the scope of the invention, its background is described in connection with Knee osteoarthritis (OA).
Osteoarthritis (OA) is the leading cause of chronic disability in the United States, and is the third most rapidly rising chronic medical condition associated with disability worldwide [1,2]. Despite its importance and economic impact, there are no disease-modifying anti-osteoarthritic drugs (DMOADs) approved by the US Food and Drug Administration (FDA) or European Medicines Agency, in stark contrast to the multitude of biologic and nonbiologic disease-modifying treatments available in other forms of arthritis [3]. This lag in the development of DMOADs is due in no small part to a lack of easily accessible radiographic and/or biochemical biomarkers to diagnose OA and discriminate patient phenotypes, including prediction of future progressors.
Accordingly, much attention has recently been focused on the development of diagnostic and prognostic biomarkers for OA. Several groups have described biomarker candidates for OA diagnosis, discriminating OA patients from healthy controls based on peripheral blood analytes. In 2014, Ramos et al. described an mRNA-based peripheral blood signature which could discriminate OA patients from matched controls [4], with a c-statistic of 0.97. In 2018, Li et al. published a reanalysis of these data using a different machine learning technique that performed at an equally high sensitivity and specificity on a reduced subset genes [5]. In 2018, Huang and colleagues used data and biospecimens from the DOXY (doxycycline for treatment of OA) clinical trial to examine four plasma biomarkers [6] and found baseline levels of lipopolysaccharide binding protein (LBP) were associated with future radiographic progression (TIC over 18 months OR=1.418).
A more difficult but perhaps more clinically relevant biomarker task, however, is the classification of OA patients into distinct phenotypes. The largest study yet to take on the task of OA phenotype discrimination was an analysis of serum and urine biochemical biomarkers performed by the Foundation for the National Institutes of Health (FNIH) OA Biomarkers Consortium (OABC-FNIH), using data and biospecimens from the longitudinal, US-based Osteoarthritis Initiative (OAI) study [7,8]. In this study, the authors produced a model combining three serum and urine biomarkers that could discriminate radiographic and pain progressors from nonprogressors with a receiver operator characteristic area under the curve (AUC-ROC, c-statistic) of 0.631[9]. Baseline values of these parameters did not offer substantial predictive capability; rather, models utilized a time-integrated concentration (TIC) approach tracking analyte measurements over a period of years.
A drawback of traditional biochemical biomarkers such as those used in the OABC-FNIH is their inherent variability. Epigenetic assays offer theoretical advantages as biomarkers. Most notably, an epigenetics-based assay has the potential to offer prognostic information based on a single time point reflecting early, relatively stable gene regulatory changes that precede gene transcription and subsequent protein translation. Early epigenetic changes in peripheral blood have been found to be useful biomarkers in several rheumatic diseases including rheumatoid arthritis [10] and systemic lupus erythematosus [11], as well as several chronic low-level inflammatory diseases, including type-2 diabetes mellitus risk [12] and cardiovascular disease [13]. Several recent studies, including our own, have demonstrated alterations in joint tissue epigenetic patterns associated with OA development and progression [14-18]; however, no analyses of blood epigenetic changes have yet been published.
Despite these advances, what is needed is a rapid, inexpensive, and widely available method and system to identify OA disease progression from easily accessible biological samples.
In one embodiment, the present invention includes a method for determining whether a subject diagnosed with osteoarthritis is at increased risk for progression to severe osteoarthritis within 12-24 months, the method comprising the steps of: (a) preparing a blood sample; (b) measuring in the blood sample prepared in step (a) a level of DNA methylation, e.g., by differentiation of bisulfite-generated polymorphisms; (c) comparing the DNA methylation level(s) measured in step (b) to a control blood sample level(s) of DNA methylation in a subject without osteoarthritis, and (d) identifying the subject as being at increased risk for progression to severe osteoarthritis within 12-24 months when the DNA methylation is increased by at least 2 fold as compared to the control blood level(s) of DNA methylation. In one aspect, the blood sample comprises peripheral blood mononuclear cells or cell-free DNA. In another aspect, the control blood level of the DNA methylation in a similarly processed blood sample from a control subject or a mean level of several control subjects. In another aspect, the control subject (i) is matched to the tested subject by at least one of: age category, sex, BMI category, ethnicity, or baseline KL radiographic grade; and (ii) does not have osteoarthritis as determined by both radiographic and symptomatic examination. In another aspect, the control blood level of the DNA methylation is a predetermined standard. In another aspect, the DNA methylation is determined with a 450k, or 850k array, or CpG sites associates with at least one of: an antigen presentation pathway, adenosine monophosphate kinase (AMPK) signaling pathway, or a sonic hedgehog signaling pathway, or one or more top upstream regulators identified by IPA selected from transcription factor PITX2, histone H3, histone H4, miR-141, miR-9, miR-137, bone morphogenic protein 2, a TGF-β superfamily member, or 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, or 13 markers selected from cg04195161, cg22064129, cg04985016, cg12692919, cg04043957, cg08872579, cg02019955, cg01333532, cg23705082, cg05042110, cg00715363, cg01307007 and cg09239099. In another aspect, the subject is human. In another aspect, the osteoarthritis is a knee osteoarthritis. In another aspect, the subject has a Kellgren-Lawrence (KL) score of 1-3 in at least one knee. In another aspect, the method further comprises the step of treating the osteoarthritis wherein the treatment is at least one of acetaminophen, steroids, hyaluronic acid, nonsteroidal anti-inflammatory drugs (NSAIDs), or Duloxetine.
In another embodiment, the present invention includes a method for treating osteoarthritis in a subject, wherein the subject has been diagnosed with osteoarthritis but does not have severe osteoarthritis, the method comprising the steps of: (a) measuring in a blood plasma sample obtained from the subject the plasma level of DNA methylation, e.g., by differentiation of bisulfite-generated polymorphisms; and (b) if the plasma level DNA methylation is increased by at least 2 fold as compared to a control plasma level(s) of the DNA methylation in a subject without osteoarthritis, administering an osteoarthritis treatment to the subject. In one aspect, the subject has been diagnosed with knee osteoarthritis and has a Kellgren-Lawrence (KL) score of 1-3 in at least one knee. In another aspect, the subject is human. In another aspect, the blood plasma sample obtained from the subject has been prepared in the presence of heparin. In another aspect, the DNA methylation is determined with a 450k, or 850k array, or CpG sites associates with at least one of: an antigen presentation pathway, adenosine monophosphate kinase (AMPK) signaling pathway, or a sonic hedgehog signaling pathway, or one or more top upstream regulators identified by IPA selected from transcription factor PITX2, histone H3, histone H4, miR-141, miR-9, miR-137, bone morphogenic protein 2, a TGF-β superfamily member, or 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, or 13 markers selected from cg04195161, cg22064129, cg04985016, cg12692919, cg04043957, cg08872579, cg02019955, cg01333532, cg23705082, cg05042110, cg00715363, cg01307007 and cg09239099. In another aspect, the control blood level of the DNA methylation is a level of DNA methylation in a similarly processed blood sample from a control subject or a mean level of several control subjects. In another aspect, the control subject (i) is matched to the tested subject by at least one of: age category, sex, BMI category, ethnicity, or baseline KL radiographic grade; and (ii) does not have osteoarthritis as determined by both radiographic and symptomatic examination. In another aspect, the control plasma level of the DNA methylation is a predetermined standard. In another aspect, the osteoarthritis treatment is at least one of acetaminophen, steroids, hyaluronic acid, nonsteroidal anti-inflammatory drugs (NSAIDs), or Duloxetine.
In another embodiment, the present invention includes a method for treating osteoarthritis in a subject, the method comprising administering an osteoarthritis treatment to a subject with elevated plasma levels of DNA methylation, wherein the subject does not have severe osteoarthritis. In one aspect, the subject has been diagnosed with knee osteoarthritis and has a Kellgren-Lawrence (KL) score of 1-3 in at least one knee. In another aspect, the subject is human. In another aspect, the osteoarthritis treatment is at least one of acetaminophen, steroids, hyaluronic acid, nonsteroidal anti-inflammatory drugs (NSAIDs), or Duloxetine.
For a more complete understanding of the features and advantages of the present invention, reference is now made to the detailed description of the invention along with the accompanying figures and in which:
While the making and using of various embodiments of the present invention are discussed in detail below, it should be appreciated that the present invention provides many applicable inventive concepts that can be embodied in a wide variety of specific contexts. The specific embodiments discussed herein are merely illustrative of specific ways to make and use the invention and do not delimit the scope of the invention.
To facilitate the understanding of this invention, a number of terms are defined below. Terms defined herein have meanings as commonly understood by a person of ordinary skill in the areas relevant to the present invention. Terms such as “a”, “an” and “the” are not intended to refer to only a singular entity, but include the general class of which a specific example may be used for illustration. The terminology herein is used to describe specific embodiments of the invention, but their usage does not delimit the invention, except as outlined in the claims.
The present invention uses peripheral blood mononuclear cell (PBMC)-based epigenetic models to predict future knee OA radiographic progression in a well-matched cohort of patients from the Osteoarthritis Initiative (OAI), using similar definitions to those previously published in OABC-FNIH studies[9].
Baseline patient demographic, disease, and PBMC composition characteristics were well matched. Models developed using only patient characteristics were not predictive of future progression.
The inventors first identified a group of 58 OA radiographic progressors with baseline Kellergren-Lawrence (K/L) radiographic grade 2-3, symptomatic knee OA who exhibited ≥0.7 mm of joint space width (JSW) loss over the first 24 months of follow-up and persistent JSW loss at 48 months from the longitudinal, US-based Osteoarthritis Initiative (OAI) study. The inventors then matched 58 nonprogressors with ≤0.5 mm of joint space width loss over 48 months of follow-up by age category, sex, BMI category, ethnicity, and baseline KL radiographic grade (Table 1). Although not included in the initial matching criteria, there were no statistical differences in baseline JSW, NSAID use, or smoking history between the two groups. The inventors did note a statistically significant increase in baseline Western Ontario and McMaster (WOMAC) pain subscale among progressors (21.5±2 mean±SEM vs. 17.0±1.5 points on a 0-100 point normalized scale, P=0.05).
Recent studies have reported that clinical characteristics alone can be used to model future GA progression[19]. To ensure that these models were not being affected by patient characteristics, the inventors first developed models using baseline patient characteristics data alone, without including DNA methylation data. These models included age, sex, BMI, baseline JSW, baseline WOMAC pain, smoking history, and NSAID use. Baseline models were not able to discriminate the two groups (receiver operator characteristic area under the curve ROC-AUC, c-statistic=0.49±0.01 mean±SEM, accuracy=50±0.7%) (
DNA methylation data from mixed PBMC samples can be skewed by underlying differences in cellular composition among groups[20]. As the OAI dataset does not include data on individual blood sample cellular composition, the inventors estimated this using a computational approach[21]. There were no statistically significant differences in estimated PBMC composition between the two groups; however, the inventors did note a trend towards increased monocyte counts in cases (progressors 7.4±0.3% mean±SEM vs. nonprogressors 6.7±0.3%, P=0.12) (Table 1). Accordingly, the inventors corrected the dataset for PBMC composition differences using frozen surrogate variable analysis (FSVA), a technique previously demonstrated to be robust in correcting cellular composition differences in genome-wide DNA methylation data[22,23], before developing these epigenetic models.
Models developed based on PBMC DNA methylation data are predictive of radiographic progression. Traditionally, epigenome-wide association studies have reported DNA methylation data as beta values, defined as the fraction methylation (0-1 scale) of each CpG site included in the array. However, beta values are characterized by high heteroscedasticity (most beta values fall within extreme high- and low-percent methylation levels). Therefore, the inventors also analyzed M values (the log base-2 ratio of methylated:unmethylated probe intensities for a given CpG site), which are approximately homoscedastic, in these models[24]. In this analysis, models based on PBMC DNA methylation data were consistently capable of discriminating those patients who would go on to experience radiographic progression from nonprogressors (
The inventors then added patient characteristic covariate data to model development, including age, sex, BMI, baseline JSW, baseline WOMAC pain, smoking history, and NSAID use. Including these data did not alter model performance (M value models c=0.80±0.01, beta value models c=0.78±0.01, both nonsignificant compared to models without covariates). Finally, as this analysis included data from two methylation assay types, the inventors then analyzed model performance for 450k and 850k chips separately; both types demonstrated equivalent performance in these models (p=0.27).
A subset of DNA methylation CpG sites were selected frequently in multiple rounds of model development and were enriched in functional pathways previously associated with OA development. Although 969 CpGs were selected in at least one round of model development in either M or Beta based models, a subset of CpG sites were selected several times (Table 3, complete list of CpGs selected in Table 4). Many of these most frequently selected CpGs in both M value-based and Beta value-based model development were shared; for example, 7 of the top 20 most frequently selected CpGs in both M- and Beta-models were identical. The majority of the top 20 CpGs selected during modeling were associated with CpG islands (13 of 20 M- and 16 of 20 Beta-based models), including all but one of the CpG sites shared among the two methylation measures, suggesting these sites may play a role in gene regulation.
cg11865413
15
N
—
Shelf
cg26384229
14
ALG10B
TSS200
Island
cg21643086
15
cg21643086
12
cg17745251
11
C10orf140
Body
Island
cg03687650
11
OSBPL5
Body
S
—
Shelf
cg06409741
11
RASA3
Body
Island
cg06409741
11
RASA3
Body
Island
cg03870777
10
KRT18
TSS200
N
—
Shore
cg26384229
9
ALG10B
TSS200
Island
cg11865413
9
N
—
Shelf
cg03687650
OSBPL5
Body
S
—
Shelf
cg17745251
7
C10orf140
Body
Island
cg03870777
6
KRT18
TSS200
N
—
Shore
Although a differential methylation analysis was not performed in the traditional sense, it was reasoned that CpG sites chosen for inclusion in these models might still clustered in functional pathways important in OA. To investigate this, the inventors performed gene ontology analysis using the Ingenuity Pathway Analysis (IPA) system of genes associated with CpG sites chosen in at least one round of model development. The most significantly enriched canonical pathways enriched in this gene set included the antigen presentation pathway (n=6 genes, p=7E-4), adenosine monophosphate kinase (AMPK) signaling (n=15, p=1E-3, and sonic hedgehog signaling (n=4 genes, p=9E-3), among others (Table 4). The top upstream regulators identified by IPA include the transcription factor PITX2 (n=14 genes, p=1E-5), histone H3 (n=29 genes, p=7E-5) and H4 (n=14,p=9.6E-5), miR-141 (n=9, p=1.9E-4), miR-9 (n=5, p=3.4E-4), miR-137 (n=2, p=8.2E-4), and bone morphogenic protein 2 (BMP2, n=15, p=9.0E-4), a TGF-β superfamily member, among others (Table 4).
Biomarkers hold great potential for improving clinical outcomes in common chronic diseases including OA. The early identification of OA patients who are likely to progress radiographically, clinically, or both, will allow for the enrichment of clinical trials of disease-modifying antiosteoarthritic drugs (DMOADs) with appropriate patients. Biomarkers of progression may also allow clinicians to stratify early OA patients, offering personalized care by taking into account an individual's likelihood to respond to a particular therapy. In this pilot study, the inventors sought to test whether baseline peripheral blood mononuclear cell DNA methylation data could be used to develop models to discriminate those patients who would go on to experience radiographic progression from nonprogressors. While based on a relatively small number of patients from a single cohort, these results nonetheless suggest that peripheral blood-based epigenetic models may be useful for OA subtype discrimination and should be evaluated in future larger studies of heterogeneous OA patients from additional cohorts.
Several groups have previously investigated biomarkers predictive of future OA progression. The largest biochemical study in this regard was the OAI biomarker studies including the osteoarthritis biomarker consortium of the Foundation for the National Institutes of Health (OABC-FNIH) which investigated a targeted set of biochemical biomarkers as predictors of OA pain and radiographic progression. Their analysis identified a combination of three biochemical biomarkers for discrimination of pain and radiographic progression from nonprogressors with a c-statistic of 0.631[9], although they used a time-integrated concentration (TIC) approach, requiring multiple biomarker measurements over time as input into their prognostic model. This same study also evaluated the predictive capability of baseline biomarkers alone, but these not demonstrate robust differentiation of progressors (c=0.586)[9].
Radiographic biomarkers predicting future disease progression have also been evaluated. In 2016, Collins et al. reported models based on changes in MRI semi-quantitative markers over 24 months to predict the likelihood of radiographic progression at 48 months as part of the OABC-FNIH study[26]; the best model achieved a c-statistic of 0.74. Like the OABC-FNIH biochemical biomarker study, this study leveraged changes in imaging parameters over time to predict future progression. Trabecular bone texture (TBT) is perhaps the best-studied single imaging biomarker for prediction of future OA progression. Kraus et al evaluated this in FNIH patients, with a best composite model using baseline TBT to predict future radiographic progression with a c-statistic of 0.624[27]. This is somewhat lower than previous reports using TBT, Janvier and colleagues, for example, reported a c-statistic of AUC 0.77 in 2017[28].
There have also been recent reports using machine learning to model future progression based on combinations of baseline imaging and demographic data. Joseph and colleagues in 2018 published the results of three such models, with a maximum c-statistic of 0.72 to predict progression to moderate-severe OA over 8 years including demographic data, risk factors, K/L score, cartilage WORMS score, meniscal tear, and cartilage MRI T2 imaging data[29] from the OAI cohort. Also in 2018, Halilaj and colleagues published the results of a mixed-effects mixture model within the OAI cohort to predict joint space narrowing in 1243 subjects. They described an impressive c-statistic of 0.86 for prediction of future progression using a combination of baseline and year 1 follow-up variables, using a 10-cycle, 90% development/10% lockbox validation data splitting approach with 10-fold internal cross-validation, not unlike the methodology of the present invention. Models based solely on baseline data, however, did not predict future radiographic progression (c≤0.6)[30].
Several recent studies have also identified mitochondrial haplotypes as biomarkers which can discriminate OA phenotypes. For example, in 2012 Fernandez-Moreno et al. identified significant increases in cartilage-specific biomarkers of OA in patients carrying mitochondrial haplogroup H associated with the onset of OA [31]. Similarly, this group later identified a decrease in incident OA risk in patients with mitochondrial haplogroup J compared to haplogroup H (HR 0.7, p<0.05) in both the OAI and CHECK cohorts [32], as well as slowing of the rates of radiographic progression rate among OA patients with haplogroup T compared to other haplogroups within the OAI cohort [33]. Future studies should investigate potential complementarity between mitochondrial haplogroups and DNA methylation as predictive OA biomarkers.
Much work has been done over the past several years linking epigenetic changes, particularly DNA methylation, with OA. The present inventors and others have noted differential DNA methylation in inflammatory cell pathways in both cartilage and subchondral bone from OA patients[14-17,34,35]. A subgroup of end-stage OA patients' cartilage demonstrates a fingerprint of epigenetic changes within inflammatory genes[18,36]. Given chondrocyte and subchondral bone DNA methylation aberrancy in both knee and hip OA, along with the known contributions of chronic inflammation and inflammatory gene epigenetic changes to OA, the DNA methylation patterns within PBMCs associated with OA progression the inventors describe in this report are intuitively consistent with previously published data.
The PBMC DNA methylation-based approach taught herein has several advantages over previously published biomarkers. Most notably, this approach offers predictive capability based on a single baseline blood draw. Further, this model demonstrated higher c-statistic values than previously described biochemical biomarkers. A traditional DNA methylation analysis of progressors and nonprogressors was performed and did not identify significantly differentially methylated positions (DMPs) with a false data rate-corrected p-value ≤0.05. This is not particularly surprising given the mixed PBMC population. This approach utilized the combined effects of multiple CpG sites when developing DNA methylation-based predictive models.
Gene ontology analysis revealed several canonical gene pathways and upstream regulators. Unfortunately, there have not been previous epigenetic analyses of peripheral blood cells in OA in humans or mice to compare these data to. However, many pathways and regulators previously described in articular tissues overlap with these findings. For example, AMP kinase (AMPK) deficiency in chondrocytes accelerates both age- and trauma-associated OA in mice[44], whereas a pharmacological stimulator of AMPK attenuates post-traumatic OA in rats[45]. The endocannabinoid system within synovial tissue has been associated with OA pain [46]. Autophagy has been widely associated with OA. Age-related loss of autophagy has been linked with OA severity in human [47] and murine [48] joint tissue. Furthermore, cartilage-specific deletion of mammalian target of rapamycin (mTOR) increases autophagy signaling and protects against post-traumatic OA induction in mice [49]. Among the upstream regulators identified in this ontology analysis was PITX2, a component of the Wnt signaling pathway key to mesenchymal stem cell function which is altered in OA patients [50]. MicroRNA-141 is dysregulated in human OA and contributes to pathogenesis by augmentation of lipid metabolism [51], and is important in maintaining the appropriate expression of key osteoblast differentiation proteins including Runx2, Sclerostin, ALP, and Dlx5[52]. MicroRNA-9 has been shown to directly augment the expression of the key catabolic enzyme matrix metalloproteinase-13 (MMP13) in OA patients [53] and animal models[54]. MicroRNA-137 regulates chondrocyte metabolism via targeting Runx2[55]. Finally, bone morphogenic protein-2 (BMP2) upregulation has been linked to OA development and increased matrix turnover [56]. It is a key regulator of chondrocyte metabolism [57] and has been suggested as a novel OA therapeutic[58].
A major concern with machine learning is generally the problem of overfitting. The inventors reduced this possibility through extensive cross-validation and report the accuracy of models when tested on data unseen during model development. The inventors included in the analysis data from both Illumina 450k and 850k chips. All data from 850k chips were subset to only those sites included on the 450k array and were batch corrected. The inventors saw no evidence for differences in predictive capability with the two chip types (see above). The inventors did see differences comparing M value-based to beta value-based models; however, this is not unsurprising given previous reports of variations in statistical measures of these values [24] owing to differences in scedasticity. There is an emerging literature in the OA field which has demonstrated the importance of activated macrophages in disease development and progression [63-65]. It is important to highlight that the gene ontology analyses performed herein were based on CpG sites selected following PBMC cell composition correction. Thus, the present invention can be used to perform predictive and/or prognostic biomarker analysis, which can include DNA from mixed PBMCs and/or whole blood.
These results demonstrate for the first time the use of peripheral blood epigenetic patterns as predictive models for knee OA radiographic progression. The use of linear modeling applied to a large genome-wide DNA methylation dataset from well-matched case and control cohorts offer the first glimpse into the potential of future OA epigenetic classification. The present invention offers significant advantages over traditional biochemical biomarkers, including requiring a single baseline blood sample. Thus, the present invention provides an easily accessible predictive biomarkers to personalize and improve the care of adults with knee OA.
Study Design. This nested case-control study (116 total patients, 58 progressors, 58 nonprogressors) used data and biospecimens from the Osteoarthritis Initiative (OAI) database, which is available for public access at https://data-archive.nimh.nih.gov/oai/. All Osteoarthritis Initiative (OAI) participants provided written informed consent, and the study was carried out in accordance with the OAI data user agreement, approved by the Committee on Human Research of the Institutional Review Board (IRB) for the University of California, San Francisco (UCSF).
The IRBs of the University of Oklahoma Health Sciences Center and Oklahoma Medical Research Foundation also reviewed and approved the project. Detailed case and control criteria are presented herein below. Participants had baseline and yearly follow-up knee radiographs and PBMC DNA available. Kellgren-Lawrence Grade (KLG) and quantitative joint space width (JSW) [59] were assessed by the central reading site using non-fluoroscopic fixed-flexion knee radiographs with a Synaflexer positioning device (Synaflexor, Synarc, Newark, Calif.). All participants had baseline KLG of 2-3 in at least one knee without a history of total knee joint replacement through 48-months. Participants with a tibial plateau rim distance of 6.5 mm at baseline, or with a change in the rim distance of >2.0 mm between baseline and follow-up were excluded due to unreliable radiographic positioning.
Definition of radiographic progression, case and control group characteristics. The definition of radiographic progression was similar to the case definitions of the OABC-FNIH[9]. Cases had radiographic progression in the medial tibiofemoral compartment by a longitudinal loss in the minimum JSW of at least 0.7 mm from baseline to 24-month follow-up in one index knee, with persistent narrowing in the same index knee at 48 months based on radiographs obtained from a non-fluoroscopic fixed flexion protocol (Synaflexor, Synarc, Newark, Calif.). In each case, the contralateral (non-index) knee had less progression than the index knee, or no progression over the follow-up period. Participants with a tibial plateau rim distance of 6.5 mm at baseline, or with a change in the rim distance of >2.0 mm between baseline and follow-up were excluded due to inappropriate and/or unreliable radiographic positioning. Non-progressors were defined as those with ≤0.5 mm of JSW loss from baseline to 48 months in either knee. These thresholds were set to coincide with previous OAI biomarker studies[9,60,62,66], as defined in the Foundation for the National Institutes of Health biomarker consortium project (OABC-FNIH) and were set based on the distribution of 1-month change in minimum JSW in normal knees of OAI control participants and estimated to have ≤10% probability of change due to measurement error[8,67].
Controls were frequency matched with cases by age category, sex, race, and BMI category as shown in Table 1 and were also frequency matched with the larger OAI cohort to broaden the applicability of these findings. Group differences were calculated using a Student t-test and statistically analyzed using a 2-tailed P value. There were no differences among the two groups in any demographic category with the exception of mean baseline Western Ontario and McMaster (WOMAC) pain subscale (Table 1), although it should be noted that there was a trend towards fewer females included among cases (53% vs. 60%, p=0.13), more NSAID use at baseline among cases (29% vs. 17%, p=0.13), and an increased mean baseline K/L grade among cases (2.2 vs. 2.0, p=0.09), although no difference in mean baseline JSW was noted in cases (3.9 vs. 4.0, p=0.34). The inventors also evaluated the presence of various comorbid conditions among data collected by the OAI, including history of heart attack, failure, stroke, pulmonary disease, diabetes, and cancer (Table 1). The inventors adjusted these models for these variables as part of the analysis (see below).
DNA methylation assays, PBMC composition assessment and adjustment. Five hundred nanograms of DNA was treated with sodium bisulfite (EZ DNA methylation kit, Zymo) and loaded onto Illumina Infinium HumanMethylation450k (n=62 samples: 30 cases, 32 controls) or 850k (n=54 samples: 28 cases, 26 controls) arrays. As no direct measures of PBMC cell type composition were available from the OAI, these data were estimated using the estimate Cell Counts function of minfi. No statistically significant differences in estimated PBMC cell composition were found between groups (Table 1, Table 5), although there was a trend towards increased monocytes among cases (7.4±0.3% vs. 6.7±0.3%, cases vs. controls mean±SEM, p=0.12). DNA methylation data were corrected for cell count variation using frozen surrogate variable analysis [22] via the sva package (v. 3.28.0). This method has been previously shown to robustly correct for cell count variation and other batch effects in large-scale epigenomic studies [22,23,68].
Data preprocessing. Statistical analysis was performed using R (v. 3.5.0). Raw .IDAT files were processed using the minfi package (v. 1.26.2). Illumina 850k (EPIC) array data were subset to only those sites also included in the 450k array. Preprocessing and normalization were performed using the preprocess Funnorm function. All samples passed internal controls included on each chip and had detection p≤0.01 for ≥97% of cytosine-guanosine dinucleotide (CpG) positions included on each array. Epigenome-wide association studies traditionally have used DNA methylation beta values, defined as the fraction methylation (0-1 scale) for a particular CpG site. Beta values, however, are characterized by high heteroscedasticity (most beta values fall within extreme high- and low-percent methylation levels), and questions have been raised regarding statistical validity of beta value analysis [24]. Therefore, the inventors also analyzed M values (the log2 ratio of methylated:unmethylated probe intensities for a given CpG site), which are approximately homoscedastic. The inventors excluded from analysis CpG probes located on sex chromosomes, probes with known single nucleotide polymorphisms (SNPs) with minor allele frequency of ≥5%, and probes not detected in all samples, leaving a final n=435,118 CpG sites for analysis in each sample.
Modeling. Elastic-net regularized generalized linear models were developed using the glmnet package (v. 2.0-16). As model overfitting is a frequently-occurring problem when developing classifiers on high-dimensional data, the inventors implemented three strategies to reduce potential overfitting. First, the inventors performed 7-fold internal cross-validation during development via the cv.glmnet function, utilizing the ‘one standard error rule’ when selecting lambda values [69]. Second, the inventors tested models on lockbox data not used for training (
Ontology analysis. Genes associated with CpG sites selected at least once during model development (n=969) were analyzed using Ingenuity Pathway Analysis (IPA, Qiagen) v. 48207413 build 2019-06-16 using default settings.
Case and control eligibility and Selection Process.
1. Subject Inclusion:
1.1. A subject with one or two knees with symptomatic OA that are eligible to be a Fast Progressor and/or a Non Progressor**.
An eligible knee is defined as:
1.1a. Mild-moderate symptomatic OA at baseline: in the same knee, KLG 2-3 and frequent pain (pain on most days of a month in past year) at baseline.
1.1b. Medial compartment JSW>=1.5 mm (VooMCMJSW>=1.5) at baseline since at <1.5 mm knee is already close to end stage and further fast progression is less unlikely to be preventable at that point; in addition JSW measurement is less reliable when knee approaches bone on bone.
1.1c. JSN in medial compartment at baseline is equal to or worse than JSN in lateral compartment.
1.1d. JSW data at BL, 24 m and 48 m time points. (Exception: Fast Progressor knees can be missing JSW due to a TKR at 36 or 48 m.).
1.1e. Adequate radiographic positioning at BL, 24 and 48 m time points defined as VxxTPCFDS<7.0 mm and change in VxxTPCFDS<2 mm in magnitude BL to 24 m or BL to 48 m.
1.2. Subjects with only one eligible knee at baseline**:
1.2a. The contralateral knee KLG is less than or equal to the eligible knee at baseline. (e.g., subjects with a KLG 2 symptomatic eligible knee can have a KLG 0-2 symptomatic or non-symptomatic. If the eligible knee is KLG 3, then the contralataeral knee can be a KLG 0-3 symptomatic or non-symptomatic knee. The contralateral knee must have a baseline medial JSW of >=1.5 mm.
2. Subject Exclusion.
2.1 TKR in either knee at BL, 12 m or 24 m.
2.2 RA diagnosis at any visit thru 72 m (RAVISIT is 0, 1, 2, 5, 6 or 8). What about subjects with self-reported psoriatic arthritis, AS, or reactive arthritis? How many are there?
2.3 Oral Glucocorticoid use reported at >=2 visits from BL to 72 m. How many are there?
3. Classify eligible knees of eligible subjects as Fast Progressor or Non-Progressor Index knees.
3.1 Fast Progressor Index knees: JSW loss of more than 0.7 mm between BL and 24 m and remains narrowed more than 0.7 mm at 48 m or has a TKR by 48 m.
3.2 Non-Progressor Index knees: JSW loss less than 0.5 mm between BL and 24 m and remains narrowed less than 0.5 mm at 48 m. Knee does not have TKR at 36 m or 48 m and does not have lateral JSN progression through 48 m.
4. Fast Progressor Subjects.
A Fast Progressor subject has one or two Fast Progressor Index knees.
For those with one Fast Progressor Index knee the contralateral knee can be:
4.1 A Non Progressor Index knee with a KLG≤to the KLG of the progressor knee;
4.2 An ineligible knee (e.g. KLG 0-1, KLG 2-3 nonsymptomatic and KLG≤to the KLG of Progressor knee) that meets Fast Progressor JSW criteria;
4.3 An ineligible knee that does not meet Fast Progressor criteria and KLG≤to the KLG of Progressor knee; or
4.4 An ineligible knee that does not meet conditions 1.1b to 1.1e and KLG≤to the KLG of Progressor knee.
5. Non Progressor Subjects
A Non Progressor Subject has one or two Non Progressor Index knees.
If a subject has only one Non Progressor Index knee:
5.1. The contralateral knee cannot be a Fast Progressor Index knee;
5.2. If the contralateral knee is not an eligible knee, then the contralateral knee must a) have KLG≤to the KLG of Progressor knee, b) allow assessment for Fast Progression (meet conditions 1.1b. to 1.1e.) and c) have JSW loss less than 0.5 mm between BL and 24 m and remain narrowed less than 0.5 mm at 48 m, d) does not have lateral JSN progression by 48 m and e) does not have TKR at 36 m or 48 m.
Designed to differentiate rapid radiographic and/or pain progressors from nonprogressors. This study utilized the Foundation for the NIH cohort of the OAI using biochemical biomarker data (PMID 27296323).
Development and validation on 464 (to date, estimated 600 upon completion) patients, including 299 rapid radiographic- and/or pain-progressors and 165 nonprogressors. All participants had baseline Kellegren-Lawrence radiographic grade (KLG) of 1-3 in at least one knee without a history of total knee joint replacement through 24-months. Patients with radiographic and/or pain progression by 12-months were excluded.
Case definition. Radiographic progressors had radiographic progression in the medial tibiofemoral compartment by a longitudinal loss in the minimum JSW of at least 0.7 mm from baseline to 24-month follow-up in one index knee, with persistent narrowing in the same index knee at 48 months based on radiographs obtained from a non-fluoroscopic fixed flexion protocol (Synaflexor, Synarc, Newark, Calif.). Participants with a tibial plateau rim distance of 6.5 mm at baseline, or with a change in the rim distance of >2.0 mm between baseline and follow-up were excluded due to inappropriate and/or unreliable radiographic positioning. Non-radiographic progressors were defined as those with ≤0.5 mm of JSW loss from baseline to 48 months in either knee.
Pain progressors were defined as >=9 points at 2 or more timepoints on the Western Ontario McMasters (WOMAC) pain subscale from the 24-month to 60-month pain assessment.
Controls were frequency matched with cases by age category, sex, race, and BMI category (<25, 25-27.5, 27.5-30, 30-35, and >=35 kg/m2.
DNA methylation assays, PBMC composition assessment and adjustment. Five hundred nanograms of DNA was treated with sodium bisulfite (EZ DNA methylation kit, Zymo) and loaded onto Illumina Infinium HumanMethylation850k arrays. DNA methylation data were corrected for cell count variation using frozen surrogate variable analysis via the sva package (v. 3.28.0).
Data preprocessing. Statistical analysis was performed using R (v. 3.5.0). Raw .IDAT files were processed using the ChAMP package v.2.20.1. Preprocessing and normalization were performed using Beta-Mixture Quantile (BMIQ) normalization. Beta values, defined as the fraction methylation (0-1 scale) for a particular CpG site were used for model development. The inventors excluded from analysis CpG probes located on sex chromosomes, probes with known single nucleotide polymorphisms (SNPs) with minor allele frequency of ≥5%, and probes not detected in all samples, leaving a final n=710,589 CpG sites for analysis in each sample. The simplified method eliminated the need to correct for PBMC composition and batch correction using frozen surrogate variable analysis (fsva). This method also increased the random split to 1:10, and is 10-fold validated, with an increase to 664,736 CpGs+/−gimnet models.
Modeling. Elastic-net regularized generalized linear models were developed using the glmnet package (v. 2.0-16). First, 10-fold internal cross-validation was performed during development via the cv.glmnet function, utilizing the ‘one standard error rule’ when selecting lambda values. Second, the models were tested on lockbox data not used for training (
Differences between the two approaches in Example 1 and 2. Smaller number of development and validation groups in the initial approach. Inclusion of both pain and radiographic progressors in the new approach vs. radiographic progressors only. It is now possible to differentiate pain vs. radiographic vs. dual progressors using a stepwise learning approach. Different/more extensive DNA methylation arrays used for the new approach. Increased cross-validation folds used in the new approach (10 vs. 7). Different development/validation ratio can also be used in the new approach (80/20 vs. 70/30). The CpG sites selected by the new modeling approach were substantially different than the original study (owing to inclusion of pain progressors and much larger number of CpG sites made available to the modeling system with the use of 850k chips).
Additional variations between Example 1 and Example 2. OAI/FNIH samples were DNA obtained from Buffy coat (peripheral blood mononuclear cells+neutrophils). A second set of sample DNA (Johnson County (JoCo)) was obtained from peripheral blood mononuclear cells (no neutrophils). The difference in initial cell type, along with differences in radiographic measurements, contribute to reduced discrimination capability of the models; however, the accuracy the models display even given these differences speak to the robustness of the process and final models. The Beta- and M-models performed equivalently in this new modeling cohort; therefore, models based on beta values (more commonly used in DNA methylation analyses) were utilized.
Johnson County (JoCo) cohort. 121 total participants, 70 progressors, 51 control nonprogressors. Definition of progression (different than previous cohorts given lack of quantitative joint space measurements). Radiographic progressors: Increase in overall radiographic grade (Kellegren-Lawrence Grade, KLG) of at least 1 from baseline to 48 months. Non-radiographic-progressors: No increase, or within-grade increase, in KLG from baseline to 48 months.
0.94 ± 0.004
As outlined above, baseline peripheral blood cell DNA methylation patterns from a group of patients from the Osteoarthritis Initiative (OAI) and demonstrated methylation-based models distinguish future radiographic progressors from nonprogressors. In this example, the method to the FNIH OsteoArthritis Biomarkers Consortium (OABC) subcohort of the OAI that included pain-, radiographic-, and dual- (both pain and radiographic) progressors compared to nonprogressors controls was used.
Buffy coat DNA was obtained from the OAI from baseline visits of OABC patients (n=600). 500 ng of DNA was bisulfite treated and loaded onto Illumina EPIC arrays then imaged by the Clinical Genomics Center at the Oklahoma Medical Research Foundation. Raw data were extracted and Beta-Mixture Quantile (BMIQ) normalized using the Chip Analysis Methylation Pipeline (ChAMP) package. 46 samples were eliminated due to low quality or sex mismatch leaving 81 radiographic-, 91 pain-, 193 dual-, and 189 non-progressors. Elastic net-penalized generalized linear models (GLMs) were then developed. Following a 70%-development, 30%-validation data split, GLMs underwent further internal 10-fold cross validation and optimized models were tested on validation data. Model development was repeated for a total of 40 randomized data splits and mean performance calculated. Supervised ‘reduced’ models were developed by reducing the dataset to include those CpG sites selected in ≥10/40 development rounds in the any- vs. nonprogressor comparison (n=13 CpGs). An independent radiographic progressor validation cohort was obtained from the Johnston County Osteoarthritis Project (JoCo), including 89 future progressors (≥1 K/L grade worsening at 48 mo or joint replacement) and 56 nonprogressors (no- or within-grade K/L worsening). Peripheral blood mononuclear cells methylation was quantified, 33 samples were removed due to low quality. Models were also tested on DNA methylation data from our previous peripheral blood methylation publication, including 27 radiographic-only and 28 nonprogressors from the OAI cohort.
Baseline buffy coat DNA methylation patterns accurately predicted future radiographic (accuracy 87%±0.8% mean±SEM, see Table 7 for AUC-ROC c-statistic, pain (89%±0.9%), dual radiographic+pain (72%±0.7%), and ‘all’ progressors (radiographic, pain, or dual) (78%±0.4%). Intriguingly, pain-only and radiographic-only progressors were not reliably distinguished (accuracy 58%±1%). The inclusion of demographic characteristics did not alter model performance, nor the addition of baseline serum/urine analytes, nor the inclusion of methylation array batch data. Models included a mean of 13±1 CpG sites. Supervised reduced models including 13 CpG sites (cg04195161, cg22064129, cg04985016, cg12692919, cg04043957, cg08872579, cg02019955, cg01333532, cg23705082, cg05042110, cg00715363, cg01307007 and cg09239099) had similar discriminatory capability (Table 7). Despite differences in the definition of radiographic progression and the cell type input (buffy coat vs. PBMC) in the JoCo cohort, models accurately discriminated radiographic progressors from nonprogressors (81%±0.3%). Models also performed accurately when applied to our previous OAI DNA methylation dataset (80% 0.3%).
These results confirm the predictive capability of peripheral blood-based DNA methylation models in a large cohort of OA patients and confirmed these findings in two independent cohorts. These data show that a pain and structural progression share early systemic immune epigenotypes.
It is contemplated that any embodiment discussed in this specification can be implemented with respect to any method, kit, reagent, or composition of the invention, and vice versa. Furthermore, compositions of the invention can be used to achieve methods of the invention.
It will be understood that particular embodiments described herein are shown by way of illustration and not as limitations of the invention. The principal features of this invention can be employed in various embodiments without departing from the scope of the invention. Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, numerous equivalents to the specific procedures described herein. Such equivalents are considered to be within the scope of this invention and are covered by the claims.
All publications and patent applications mentioned in the specification are indicative of the level of skill of those skilled in the art to which this invention pertains. All publications and patent applications are herein incorporated by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.
The use of the word “a” or “an” when used in conjunction with the term “comprising” in the claims and/or the specification may mean “one,” but it is also consistent with the meaning of “one or more,” “at least one,” and “one or more than one.” The use of the term “or” in the claims is used to mean “and/or” unless explicitly indicated to refer to alternatives only or the alternatives are mutually exclusive, although the disclosure supports a definition that refers to only alternatives and “and/or.” Throughout this application, the term “about” is used to indicate that a value includes the inherent variation of error for the device, the method being employed to determine the value, or the variation that exists among the study subjects.
As used in this specification and claim(s), the words “comprising” (and any form of comprising, such as “comprise” and “comprises”), “having” (and any form of having, such as “have” and “has”), “including” (and any form of including, such as “includes” and “include”) or “containing” (and any form of containing, such as “contains” and “contain”) are inclusive or open-ended and do not exclude additional, unrecited elements or method steps. In embodiments of any of the compositions and methods provided herein, “comprising” may be replaced with “consisting essentially of” or “consisting of”. As used herein, the phrase “consisting essentially of” requires the specified integer(s) or steps as well as those that do not materially affect the character or function of the claimed invention. As used herein, the term “consisting” is used to indicate the presence of the recited integer (e.g., a feature, an element, a characteristic, a property, a method/process step or a limitation) or group of integers (e.g., feature(s), element(s), characteristic(s), propertie(s), method/process steps or limitation(s)) only.
The term “or combinations thereof” as used herein refers to all permutations and combinations of the listed items preceding the term. For example, “A, B, C, or combinations thereof” is intended to include at least one of: A, B, C, AB, AC, BC, or ABC, and if order is important in a particular context, also BA, CA, CB, CBA, BCA, ACB, BAC, or CAB.
Continuing with this example, expressly included are combinations that contain repeats of one or more item or term, such as BB, AAA, AB, BBC, AAABCCCC, CBBAAA, CABABB, and so forth. The skilled artisan will understand that typically there is no limit on the number of items or terms in any combination, unless otherwise apparent from the context.
As used herein, words of approximation such as, without limitation, “about”, “substantial” or “substantially” refers to a condition that when so modified is understood to not necessarily be absolute or perfect but would be considered close enough to those of ordinary skill in the art to warrant designating the condition as being present. The extent to which the description may vary will depend on how great a change can be instituted and still have one of ordinary skilled in the art recognize the modified feature as still having the required characteristics and capabilities of the unmodified feature. In general, but subject to the preceding discussion, a numerical value herein that is modified by a word of approximation such as “about” may vary from the stated value by at least ±1, 2, 3, 4, 5, 6, 7, 10, 12 or 15%.
Additionally, the section headings herein are provided for consistency with the suggestions under 37 CFR 1.77 or otherwise to provide organizational cues. These headings shall not limit or characterize the invention(s) set out in any claims that may issue from this disclosure. Specifically and by way of example, although the headings refer to a “Field of Invention,” such claims should not be limited by the language under this heading to describe the so-called technical field. Further, a description of technology in the “Background of the Invention” section is not to be construed as an admission that technology is prior art to any invention(s) in this disclosure. Neither is the “Summary” to be considered a characterization of the invention(s) set forth in issued claims. Furthermore, any reference in this disclosure to “invention” in the singular should not be used to argue that there is only a single point of novelty in this disclosure. Multiple inventions may be set forth according to the limitations of the multiple claims issuing from this disclosure, and such claims accordingly define the invention(s), and their equivalents, that are protected thereby. In all instances, the scope of such claims shall be considered on their own merits in light of this disclosure, but should not be constrained by the headings set forth herein.
All of the compositions and/or methods disclosed and claimed herein can be made and executed without undue experimentation in light of the present disclosure. While the compositions and methods of this invention have been described in terms of preferred embodiments, it will be apparent to those of skill in the art that variations may be applied to the compositions and/or methods and in the steps or in the sequence of steps of the method described herein without departing from the concept, spirit and scope of the invention. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the invention as defined by the appended claims.
To aid the Patent Office, and any readers of any patent issued on this application in interpreting the claims appended hereto, applicants wish to note that they do not intend any of the appended claims to invoke paragraph 6 of 35 U.S.C. § 112, U.S.C. § 112 paragraph (f), or equivalent, as it exists on the date of filing hereof unless the words “means for” or “step for” are explicitly used in the particular claim.
For each of the claims, each dependent claim can depend both from the independent claim and from each of the prior dependent claims for each and every claim so long as the prior claim provides a proper antecedent basis for a claim term or element.
Subchondral tibial bone texture analysis predicts knee osteoarthritis progression: data from the Osteoarthritis Initiative: Tibial bone texture & knee OA progression. Osteoarthritis Cartilage. 2017; 25: 259-266.
Application of biomarkers in the development of drugs intended for the treatment of osteoarthritis. Osteoarthritis Cartilage. 2011; 19: 515-542.
This application claims priority based on U.S. Provisional Application No. 63/112,973, filed Nov. 12, 2021. The contents of which is incorporated by reference in its entirety.
This invention was made with government support under K08-AR070891 and P30-AR053483 awarded by the National Institutes of Health. The government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
63112973 | Nov 2020 | US |