Cerebral Palsy (CP) refers to a group of movement disorders with symptoms including poor coordination, stiff or weak muscles and tremors. There is currently no biomarker-based test to confirm or rule out CP or sub-classifications of CP.
In one aspect, the invention provides a method of detecting cerebral palsy in a subject. The method includes: determining a methylation state for each of a plurality of cytosine-guanine dinucleotide (CpG) sites in a sample obtained from the subject, calculating a differential methylation level based on the methylation states of the plurality of CpG sites, and comparing the differential methylation level to a predetermined reference level. When the differential methylation level deviates from the predetermined reference level, the presence of cerebral palsy is indicated in the patient.
In various embodiments, the plurality of CpG sites comprises at least one selected from the CpG sites listed in Table 4, Table 5, Table 6 or Table 7.
In various embodiments, the plurality of CpG sites comprises at least five selected from the CpG sites listed in Table 4, Table 5, Table 6 or Table 7.
In various embodiments, the plurality of CpG sites comprises at least ten selected from the CpG sites listed in Table 4, Table 5, Table 6 or Table 7.
In various embodiments, the plurality of CpG sites comprises at least m % selected from the top n most predictive CpG sites listed in Table 4, Table 5, Table 6 or Table 7. wherein: m is selected from the group consisting of: 50, 60, 70, 80, 90, 95, and 99; and n is selected from the group consisting of: 100, 500, 1,000, 2,000, and 2,500.
In various embodiments, the method further includes providing treatment for cerebral palsy to the patient when cerebral palsy is indicated. In various embodiments, the treatment for cerebral palsy includes the administration of medication, therapy or surgery.
In various embodiments, the plurality of CpG sites includes a plurality of up sites and a plurality of down sites. Methylation at the up sites indicates that the likelihood the patient has cerebral palsy is increased. Methylation at the down sites indicates that the likelihood the patient has cerebral palsy is decreased. Calculating a differential methylation level includes adding in a linear weighted summation values based on the methylation states of the up sites from a value based on the methylation states of the down sites.
In various embodiments, the sample is selected from the group consisting of a peripheral blood mononuclear cell sample, a blood sample, a muscle sample or a satellite cell sample. The blood sample may either be a whole blood sample or a buffy coat sample.
For a fuller understanding of the nature and desired objects of the present invention, reference is made to the following detailed description taken in conjunction with the accompanying figures.
The instant invention is most clearly understood with reference to the following definitions.
As used herein, the singular form “a,” “an,” and “the” include plural references unless the context clearly dictates otherwise.
Unless specifically stated or obvious from context, as used herein, the term “about” is understood as within a range of normal tolerance in the art, for example within 2 standard deviations of the mean. “About” can be understood as within 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.5%, 0.1%, 0.05%, or 0.01% of the stated value. Unless otherwise clear from context, all numerical values provided herein are modified by the term about.
As used in the specification and claims, the terms “comprises,” “comprising,” “containing,” “having,” and the like can have the meaning ascribed to them in U.S. patent law and can mean “includes,” “including,” and the like.
As used herein, “cerebral palsy” or “CP” refers to a superfamily of disorders generally considered to be caused by damage to or abnormalities within the developing brain and associated with impaired motor function including movement, posture, and balance.
As used herein, “cytosine-guanine dinucleotide site” or “CpG site” means a cytosine nucleotide followed by a guanine nucleotide in the genome of an organism. CpG sites can be designated with the number of the chromosome of the organism on which they are located and a number designating the position. The flanking sequences can be used to generate the position number. For example, “12.108079458” or “chr12 108079458” refer to a CpG site on chromosome 12 at position 108079458. The position number refers to the nucleotide index starting from 1 on the coding or plus (+) strand of the DNA molecule and specifically references the position of the 5′ cytosine in a CpG dinucleotide pair. In addition, this CpG location has a complementary sequence pair on the non-coding minus (−) strand and the position number also refers to that complementary strand cytosine which is located plus one nucleotide from the indicated coding strand position. Thus, for CpG 12.108079458, the methylation score values indexed to this specific site cover the cytosine on the coding strand of chromosome 12 at position number 108079458 and the cytosine on the non-coding strand of chromosome 12 at position number 108079459.
As used herein, “methylation” or “methylated” as applied to CpG sites refers to the addition of a methyl group to cytosine, forming either 5′-methyl-cytosine or 5′-hydroxymethyl-cystosine.
As used herein, the term “percent methylation” or “% MET” refers to the frequency with which a particular set of CpG sites are methylated. Here, CpG methylation is expressed as a percentage of methylated copies found in the DNA sample for each individual CpG site relative to the total number of copies found for each site.
A “reference level” with respect to some measurement used in diagnosis is indicative of the presence or absence of a particular phenotype or characteristic. When the level of the measurement in a subject deviates from the reference level it is indicative of the presence of, or relatively heightened level of, a particular phenotype or characteristic.
Unless specifically stated or obvious from context, the term “or,” as used herein, is understood to be inclusive.
Ranges provided herein are understood to be shorthand for all of the values within the range. For example, a range of 1 to 50 is understood to include any number, combination of numbers, or sub-range from the group consisting 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 (as well as fractions thereof unless the context clearly dictates otherwise).
Conclusive diagnosis of CP is valuable because it is a prerequisite for intervention and affords peace of mind to the patient and family. Accordingly, there is a need for an efficient and accurate test for CP. This disclosure addresses that need.
One aspect of the invention provides a method of detecting cerebral palsy in a subject. The method can include: determining a methylation state for each of a plurality of CpG sites in a sample obtained from the subject, calculating a differential methylation level based on the methylation states of the plurality of CpG sites, and comparing the differential methylation level to a predetermined reference level. Deviation of the methylation load level from the predetermined reference level indicates the presence of cerebral palsy in the patient.
The methylation state of the plurality of CpG sites may be determined by any means known in the art. In various embodiments, the methylation state of the plurality of CpG sites may be determined by methyl-sensitive restriction enzyme digestion followed by Next-Gen Sequencing on an appropriate instrument, or they may be determined by targeted qPCR assays to quantify cut and uncut CpG sites following methyl-sensitive restriction enzyme digestion, or they may be determined by bisulfite oxidation treatment with DNA sequencing (either direct or via NGS), or they may be determined by hybridization of labelled oligonucleotide probes (called “hybridization arrays”) to measure methylation following methyl-sensitive restriction enzyme digestion, or they may be determined by hybridization of anti-5′-methyl-cytosine antibodies to measure methylation after hybridization capture on a targeted gene panel.
Without wishing to be limited by theory, in various embodiments, the method relies on the concept of differential methylation level (ΔML)—site-specific differences in CpG methylation summed across a gene or genome domain, structure or element—in order to characterize functional shifts in methylation patterns. This method is illustrated in the example below and in Equation (1).
As shown in
In various embodiments, the plurality of CpG sites includes a plurality of “up sites” and a plurality of “down sites”. Up sites are CpG sites where methylation at the site indicates an increased methylation load in CP patients relative to non-CP matched control patients. Down sites are CpG sites where methylation at the site indicates a decreased methylation load in CP patients relative to non-CP matched control patients. Table 3 illustrates one possible approach by which these sites may be scored in order to determine ΔML.
Various embodiments of the invention are directed to methods of examining sets of up sites and down sites and determining a differential methylation level based on their methylation state. Due to the predictive value of the plurality of sites, the scores will differ when examining patients with and without CP.
A predetermined reference level may be calculated by determining the methylation state of a set of sites in patients who are known not to have CP. Accordingly, when a differential methylation level is determined using the same plurality of site, deviation from the reference level indicates an additional evidentiary datum point supporting increasing the probability that the patient likely has CP.
A skilled person will understand that the specifics of the calculation used for generating a differential methylation level are not critical and various processes may be employed to generate these levels. All of them are within the scope of various embodiments of the invention.
Various embodiments of the invention rely on pluralities of CpG sites of various sizes. In various embodiments, the plurality of CpG sites may contain about 5, 10, 100, 1000, 5000 or more CpG sites.
Even among the CpG sites that have shown predictive power, different sites contribute different weightings to the overall predictive probability of the CP status of a patient. Various embodiments of the invention calculate differential methylation levels based on various combinations of predictive CpG sites.
The predictive power of the sites may be quantified in various ways. As shown in
The predictive power of CpG sites may also be quantified based on P-value, adjusted for False Discovery Rate (FDR). Table 5 lists the top 40 CpG sites for peripheral blood mononuclear cell samples by P-value. In various embodiments, the plurality of CpG sites may include one or more sites selected from Table 5. In some embodiment, the plurality of CpG sites may include the top n sites or m of the top n sites from Table 5 or other ordinal lists (wherein m and n are positive integers). Table 7 lists the top 1000 CpG sites for blood, muscle and muscle satellite cell samples by P-value.
In various embodiments, the method further includes providing treatment for CP to patients in whom CP is indicated. The treatment may include any form of standard of care treatment for CP accepted by the extended medical community. CP as a group of disorders includes many variations that require various interventions to improve the quality of life of the CP patient and any of these may be included in various embodiments of the invention.
The method may be practiced by analyzing a variety of sample types. In various embodiments, the sample may be a peripheral blood mononuclear cell sample, a blood sample, a muscle cell sample or muscle satellite cell sample. In various embodiments, the sites selected from Table 4, Table 5, Table 6 or Table 7 can be matched to the sample type used.
In various embodiments the treatment for CP may include surgery. In various embodiments, the surgery may include gastroenterology surgery, hearing correction surgery, medicine related surgery, e.g. insertion of a baclofen pump, orthopedic surgery, neurosurgery, urologic surgery and vision correction surgery.
In various embodiments the treatment for CP may be therapy. In various embodiments the therapy may include occupational therapy, physical therapy and speech therapy.
In various embodiments, the treatment for CP may include the administration of medication. In various embodiments the administration of medication may include medication to treat seizures, involuntary movement, spasticity, incontinence and gastroesophageal reflux. In various embodiments the treatment for CP may include anticonvulsants, muscle relaxants, benzodiazepines, nerve blocks, botulinum toxin (BTX) (available, e.g., under the BOTOX® trademark from Allergan, Inc. of Irvine, Calif.), baclofen and anticholinergics.
The invention is further described in detail by reference to the following experimental example. This example is provided for purposes of illustration only, and is not intended to be limiting unless otherwise specified. Thus, the invention should in no way be construed as being limited to the following example, but rather, should be construed to encompass any and all variations which become evident as a result of the teaching provided herein.
A cohort of 34 patients was identified and genomic DNA was extracted from each patient. Samples were prepared for sequencing analysis using a methyl-sensitive restriction enzyme digestion and library preparation followed by Next-Gen Sequencing (NGS) on an ILLUMINA® HISEQ® X10™ system with >30× coverage. Sequencing data was uploaded to secure cloud-based computational infrastructure where it was processed and analyzed for site-specific DNA methylation profiles.
The samples used in this study are listed in Table 1 with the codes employed for analyses and the genomic DNA QC values. The sequencing vendor Macrogen, Inc. performed a second round of QC upon sample receipt. Genomic DNA samples received from Nemours Biomedical Research Center were coded internally with GP numbers. DNA quantity and quality used for sequencing is reported in the data columns.
Statistical analyses were conducted for pairwise contrasts using a code system of “xGrp1.xGrp2” where Grp1 and Grp2 designate the sample groups as: non-CP (CN) and Cerebral Palsy (CP). When differential methylation loads (ΔML) are presented, those values are calculated as the difference of Grp1 minus Grp2. Thus, if ΔML>0, methylation in Grp1 is greater than in Grp2 and vice versa. The specific contrasts with sample information are presented in Table 2 (below).
Note that the statistical results reported in this document utilized a total patient cohort size of 32 (16 CN vs 16 CP) after two samples were omitted due to young age of patients (<5 yo).
ΔML was calculated as the summation of the difference in % MET scores for each CpG site within the defined region or structure being scored, averaged by the number of CpG sites present:
where first and last CpG indexes are defined by the gene unit across which the summation score is being calculated. As the calculation is applied in Example 1, Grp1 is a control (non-CP) cohort and Grp2 contains patients with CP. Thus, positive ΔML values indicate more methylation present in Grp1 and negative values indicate more methylation in Grp2.
In this study, results from 5 levels of analyses were compared: (a) single CpG sites, (b) CpG Islands, (c) micro-CpG Islands, (d) 2 kb upstream promoter domains, and (e) Transcriptional Start Site (TSS) domains. The single CpG sites are simply the comparison of the individual site methylation score values. The CpG Islands are defined by the UCSC Genome Browser bed file and contain domains with greater than 50% G+C nucleotide content, greater than 0.60 ratio of Observed to Expected frequency of CpG sites, and a minimum length of 200 nucleotides. A subset of smaller domain definitions was also employed. The TSS domains are demarcated as a 2 kb window centered on the TSS (+/−1 kb down/up-stream). The methylation signal across the larger domain structures were all non-informative. This means that a discriminating signal was only observed at the level of single CpG site scores. Only the CpG data is included here in this report.
The pairwise analysis in Table 2 provides a direct contrast and allows for several graphical visualization plots to be generated these are presented in
Table 3 presents summation counts of the significant changes in methylation scores based on P-values adjusted for site-specific false discovery rates (FDR). The comparisons are executed as non-CP-minus-CP cohorts in the treatment code header, such that positive values mean methylation in the non-CP cohort is greater (UP) and negative values mean methylation in the CP cohort is greater (DOWN). Site-specific dispersion was estimated to equalize CpG variances. A Likelihood Ratio Test was used with a defined one-way ANOVA model for pairwise tests.
The following tables rank CpG sites according to various metrics associated with their predictive power or their location in various genes or pathways.
The predictive power of each CpG site may be ranked by the P-value of the individual sites in differentiating between non-CP and CP patients. Table 5 ranks the top 40 sites by P-value.
The predictive power of each CpG site may be ranked by the P-value of the individual sites in differentiating between non-CP and CP patients. Table 7 ranks the top 1000 sites by P-value.
Although preferred embodiments of the invention have been described using specific terms, such description is for illustrative purposes only, and it is to be understood that changes and variations may be made without departing from the spirit or scope of the following claims.
The entire contents of all patents, published patent applications, and other references cited herein are hereby expressly incorporated herein in their entireties by reference.
The application claims priority to U.S. Provisional Patent Application No. 62/603,486, filed Jun. 1, 2017, which is incorporated herein by reference in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2018/035335 | 5/31/2018 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62603486 | Jun 2017 | US |