THIS INVENTION relates to methods of determining in a subject a predisposition to, or increased risk for, developing a tendon, ligament, or other soft tissue injury or pathology. The invention also relates to molecular markers; isolated nucleic acid molecules; primers and oligonucleotide sets; and detection reagents capable of detecting one or more single nucleic acid polymorphisms; for use therein.
Tendon and ligament pathologies as well as exercise associated muscle cramping (EAMC) can affect subjects participating in a range of sporting pursuits, as well as occurring in the less physically active (Kader et al. Br J Sports Med 2002; 36:239-49; Young et al. Foot Ankle Clin 2005; 10: 371-382). These pathologies affect soft tissues, such as skeletal muscles, tendons and ligaments, and their surrounding structures (Puddu et al. Am J Sports Med 1976; 4:145-50), and include, for example, Achilles tendinopathy (AT), acute spontaneous rupture, and injury to the anterior cruciate ligament (ACL).
Achilles tendinopathy (AT) is a degenerative condition involving inflammation of the Achilles tendon and is often caused by overuse or mechanical overload of the Achilles tendon. Acute spontaneous rupture also commonly affects the Achilles tendon, particularly in the middle-aged, male athlete. Injury to the anterior cruciate ligament (ACL) is one of the more severe sporting injuries, the risk of which is increased by movements involving a sudden deceleration or change in direction.
A number of intrinsic and extrinsic factors have been implicated in raising the risk of these pathologies (Jarvinen et al. Foot Ankle Clin 2005; 10: 255-266). Intrinsic factors include genetic variability in several genes that are known to be associated with increased risk of these pathologies. These genes include, for example, the α chain of type V collagen (COL5A1), tenascin C (TNC), enzymes that breakdown the matrix such as matrix metalloproteinases (MMP-3), and inflammatory process genes such as the inflammatory cytokine, growth differentiating factor, and IL-1β, IL-1RN and IL-6 (September et al., Br J Sports Med 2011; 45:1040-1047). Polymorphisms in some of these genes have been found to be associated with exercise related phenotypes (Collins and Posthumus, Exercise and Sport Sciences Reviews, 2011 39(4), 191-198) including AT (Mokone et al. Scand J Med Sci Sports 2006; 16:19-26; September et al. Br J Sports Med 2009; 43:357-365; Jarvinen et al. J Cell Sci 2003; 116(Pt 5):857-866; Jarvinen et al. J Cell Sci 1999; 112 (Pt 18):3157-3166); anterior curaciate ligament ruptures (ACL) (Posthumus et al. Am J Sports Med 2009; 37(11), 2234-2240). More recently polymorphisms with COL5A1 were associated with ROM (Collins et al. Scand J Med Sci Sports 2009, 19(6), 803-810; Brown et al. Scand J Med Sci Sports, 2011 21(6), e266-72.) and athletic performance (Posthumus et al. Med Sci Sports Exerc 2011, 43(4), 584-589; Brown et al., IJSPP 2011 in press).
Extrinsic factors include, for example, repetitive loading which may impede repair of damages tendons. For instance, tenocytes are required to maintain homeostasis of the extracellular matrix (ECM) by regulating the balance between ECM synthesis and degradation (Clancy American Orthopedic Society for Sports Medicine: Park Ridge, II. 1989), During the normal tendon healing process damaged tenocytes are removed by cytokine-mediated apoptosis. Repetitive loading may, however, change the extracellular matrix (ECM) composition and result in excessive tenocyte apoptosis (Yuan et al. J Orthop Res 2002; 20:1372-1379; Egerbacher et al. Clin Orthop Relat Res 2008; 466:1562-1568). Excessive tenocyte apoptosis which has been observed in tendinopathy (Scott et al. Br J Sports Med 2005; 39:e25) may compromise the ability of the tendon to regulate repair processes.
There is a need for improved methods of determining in a subject a predisposition to, or increased risk for, developing a tendon, ligament, or other soft tissue injury or pathology.
According to one aspect of the invention, there is provided a method of determining in a subject a predisposition to, or increased risk for, developing a tendon, ligament, or other soft tissue injury or pathology, the method comprising the step of screening the subject for the presence of at least one polymorphism in at least one gene selected from the group comprising any one or more of:
The tendon, ligament, or other soft tissue injury or pathology, may be selected from the group including tendon injuries, ligament injuries, EAMC, ROM, and endurance running performance.
According to another aspect of the invention, there is provided a method of determining in a subject a predisposition to, or increased risk for, developing a tendon, ligament, or other soft tissue injury or pathology, the method comprising the step of screening the subject for the presence of at least one polymorphism within a collagen V gene COL5A1, and at least one polymorphism in at least one gene selected from the comprising:
The method may further include the step of screening the subject for gender.
The polymorphism of the COL5A1 gene may be rs71746744, rs16399 and/or rs1134170 within the 3′-untranslated region (UTR) of the alpha 1 chain of the COL5A1 gene. The polymorphism of the MIR608 gene may be rs4919510. The polymorphism of the CASP8 gene may be rs1045485 and rs3834129.
More particularly, the polymorphism of the COL5A1 gene may be rs71746744 (-/AGGG), rs16399 (ATCT/-) and/or rs1134170 (A/T) within the 3′-untranslated region (UTR) of the alpha 1 chain of the COL5A1 gene. The polymorphism of the MIR608 gene may be rs4919510 (C/G). The polymorphism of the CASP8 gene may be rs1045485 (G/C, D302H) and rs3834129 (CTTACT/del).
More specifically, the method may include the step of detecting or screening for the presence of a polymorphism in the COL5A1 gene which has modified, augmented, or mitigated interaction with a MIR608 polymorphism product or a CASP8 gene product, when compared to a wild-type interaction. More particularly, the COL5A1 gene polymorphism may be a polymorphism which has a modified, augmented, or mitigated interaction with the rs4919510 (C/G) MIR608 polymorphism, and/or the rs1045485 (G/C, D302H) CASP8 polymorphism; and/or the rs3834129 (CTTACT/del) CASP8 polymorphism, and/or any other linked polymorphism, and the product encoded thereby.
The polymorphism of the COL5A1 gene may be rs71746744, rs16399 and/or rs1134170 within the 3′-untranslated region (UTR) of the alpha 1 chain of the COL5A1 gene. The polymorphism of the GDF5 gene may be rs143383. The polymorphism of the CASP8 gene may be rs1045485 and/or rs3834129. The polymorphism of the IL6 gene may be rs1800795. The polymorphism of the IL1B gene may be rs1143627 and/or rs16944. The polymorphism of the MIR608 gene may be rs4919510.
More particularly, the polymorphism of the COL5A1 gene may be rs71746744 (-/AGGG), rs16399 (ATCT/-) and/or rs1134170 (A/T) within the 3′-untranslated region (UTR) of the alpha 1 chain of the COL5A1 gene. The polymorphism of the GDF5 gene may be rs143383 (T/C). The polymorphism of the CASP8 gene may be rs1045485 (G/C, D302H) and/or rs3834129 (CTTACT/del). The polymorphism of the IL6 gene may be rs1800795 (G/C). The polymorphism of the IL1B gene may be rs1143627 (T/C) and/or rs16944 (C/T). The polymorphism of the MIR608 gene may be rs4919510 (C/G).
According to another aspect of the invention, there is provided a molecular marker for use in diagnosing a predisposition to, or increased risk for, developing tendon, ligament, or other soft tissue pathology or injury in a subject, the molecular marker comprising any one or more of:
The tendon, ligament, or other soft tissue pathology or injury may be EAMC.
The molecular marker may be DNA-based, RNA-based, or other combinations of nucleic acids or modified bases.
The molecular marker may comprise an isolated nucleic acid fragment that is a part of, or a fragment derived from, the group comprising a COL5A1 gene, a MIR608 gene, a CASP8 gene, a GDF5 gene, a IL6 gene, and a IL1B gene, the fragment being between 10 and 40, preferably between 15 and 35, more preferably between 20 and 30 nucleic acids in length, and which hybridizes under stringent hybridization conditions to at least a portion of the COL5A1 gene, the MIR608 gene, the CASP8 gene, the GDF5 gene, the IL6 gene, or the IL1B gene. This may include sequences complementary to the marker, and sequences having substitutions, deletions or insertions, sequences which can hybridize under strict hybridization conditions thereto, and functional discriminatory truncations thereof.
In one embodiment, the molecular marker is a polymorphic sequence variant or a polymorphism. The polymorphism may be any one or more of the polymorphisms selected from the group comprising rs71746744, rs16399 and rs1134170 within the 3′-untranslated region (UTR) of the alpha 1 chain of the COL5A1 gene; rs4919510 of the MIR608 gene; rs1045485 and rs3834129 of the CASP8 gene; rs143383 of the GDF5 gene; rs1800795 of the IL6 gene; and rs1143627, rs16944 of the IL1B gene; together with any other polymorphism closely linked (i.e. which is in high linkage disequilibrium) with any of the specific polymorphisms listed above.
More particularly, the polymorphisms may be selected from the group comprising:
More particularly, the molecular marker may be, or may be detectable using, any one or more isolated oligonucleotides selected from the group comprising: SEQ. ID. NO. 1 to SEQ. ID. NO. 18; sequences complementary thereto, sequences which can hybridize under stringent hybridization conditions thereto, and functional discriminatory truncations thereof.
Accordingly, the invention extends to a primer or oligonucleotide sets for use in detecting or diagnosing a predisposition to, or increased risk for, developing tendon, ligament, or other soft tissue pathologies or injuries in a subject, the primer or oligonucleotide sets comprising isolated nucleic acid sequences selected from the group comprising: Set 1: SEQ. ID. NO. 1 and SEQ. ID. NO. 2; Set 2: SEQ. ID. NO. 3 and SEQ. ID. NO. 4; Set 3: SEQ. ID. NO. 5 and SEQ. ID. NO. 6; Set 4: SEQ. ID. NO. 7 and SEQ. ID. NO. 8; Set 5: SEQ. ID. NO. 9 and SEQ. ID. NO. 10; Set 6: SEQ. ID. NO. 11 and SEQ. ID. NO. 12; Set 7: SEQ. ID. NO. 13 and SEQ. ID. NO. 14; Set 8: SEQ. ID. NO. 15 and SEQ. ID. NO. 16; Set 9: SEQ. ID. NO. 17 and SEQ. ID. NO. 18; sequences complementary thereto, sequences which can hybridize under stringent hybridization conditions thereto, and functional discriminatory truncations thereof.
According to a further aspect of the invention, there is provided an isolated nucleic acid molecule for detecting at least one SNP provided hereinbefore, wherein the nucleic acid molecule comprises less than 40, less than 30, less than 20, or even preferably less than 10 contiguous nucleotides selected from the group comprising SEQ ID NOS 1 to 18, and fragments, complementary sequences, sequences which can hybridize under stringent hybridization conditions thereto, and functional discriminatory truncations thereof.
The invention extends also to a detection reagent capable of detecting one or more single nucleic acid polymorphisms selected from the group comprising the polymorphisms listed hereinbefore, fragments thereof, sequences complementary thereto, sequences which can hybridize under stringent hybridization conditions thereto, and functional discriminatory truncations thereof.
According to another aspect of the invention, there is provided a diagnostic assay comprising any one or more of the markers described hereinbefore, fragments thereof, sequences complementary thereto, sequences which can hybridize under stringent hybridization conditions thereto, and functional discriminatory truncations thereof.
According to yet another aspect of the invention, there is provided a method of determining a predisposition for, or increased risk of, developing a tendon, ligament and/or soft tissue pathology or injury in a subject, the method comprising the steps of screening a subject for a polymorphism in one or more of the following genes:
According to another aspect of the invention, there is provided a method of determining a predisposition for, or increased risk of, developing a tendon, ligament and/or soft tissue pathology or injury in a subject, the method comprising the step of screening the subject for the presence of at least one polymorphism in the collagen V gene, COL5A1, and at least one polymorphism in at least one gene selected from the group comprising:
According to a further aspect of the invention, there is provided a method of diagnosing a predisposition to, or increased risk for, developing a tendon, ligament, or other soft tissue injury or pathology, the method comprising the steps of:
The tendon, ligament, or other soft tissue injury or pathology, may be selected from the group including tendon injuries, ligament injuries, EAMC, ROM, and endurance running performance.
According to another aspect of the invention, there is provided a method of diagnosing a predisposition to, or increased risk for, developing a tendon, ligament, or other soft tissue injury or pathology, the method comprising the steps of
The polymorphism may be any one of more of the polymorphisms listed hereinbefore, polymorphisms in high linkage disequilibrium with the listed polymorphisms, or a polymorphism detectable using any one or more of the sequences listed hereinbefore, fragments thereof, sequences complementary thereto, sequences which can hybridize under stringent hybridization conditions thereto, and functional discriminatory truncations thereof.
The method may further include the step of screening the subject for gender.
The method may include the additional steps of:
According to another aspect of the invention, there is provided use of a molecular marker of the invention in diagnosing a predisposition to a soft tissue pathology in a subject.
According to a still further aspect of the invention, there is provided a kit for use in diagnosing a predisposition to a soft tissue pathology in a subject, the kit comprising:
The kit may further include any one or more of reagents, such as buffers, DNases, RNAses, polymerases, instructions, and the like.
The molecular markers may be any one or more markers selected from the markers listed hereinbefore.
The soft tissue may be a connective tissue injury, and may include tendon and/or ligament injuries such as, for example, Achilles tendon, knee ligament and ankle ligament pathologies. The sample may comprise an animal tissue or blood sample, such as a human tissue or blood sample.
Further features of the invention will now be described with reference to the following non-limiting examples and figures.
In the drawings:
SEQ. ID. NO. 1: is the forward primer for COL5A1 (A/T) rs1134170;
SEQ. ID. NO. 2: is the reverse primer for COL5A1 (A/T) rs1134170;
SEQ. ID. NO. 3: is the forward primer for COL5A1 (-/AGGG) rs71746744;
SEQ. ID. NO. 4: is the reverse primer for COL5A1 (-/AGGG) rs71746744;
SEQ. ID. NO. 5: is the forward primer for IL-1β (T>C) (rs1143627);
SEQ. ID. NO. 6: is the reverse primer for IL-1β (T>C) (rs1143627);
SEQ. ID. NO. 7: is the forward primer for IL-6 (G/C) (rs1800795);
SEQ. ID. NO. 8: is the reverse primer for IL-6 (G/C) (rs1800795);
SEQ. ID. NO. 9: is the forward primer for COL5A1 (ATCT/-) rs16399;
SEQ. ID. NO. 10: is the reverse primer for COL5A1 (ATCT/-) rs16399;
SEQ. ID. NO. 11: is the forward primer for CASP8 (CTTACT/del) (rs3834129);
SEQ. ID. NO. 12: is the reverse primer for CASP8 (CTTACT/del) (rs3834129);
SEQ. ID. NO. 13: is the forward primer for CASP8 (G/C) D302H (rs1045485);
SEQ. ID. NO. 14: is the reverse primer for CASP8 (G/C) D302H (rs1045485);
SEQ. ID. NO. 15: is the forward primer for IL-1β (C/T) (rs16944);
SEQ. ID. NO. 16: is the reverse primer for IL-1β (C/T) (rs16944);
SEQ. ID. NO. 17: is the forward primer for GDF5 (T/C) (rs143383);
SEQ. ID. NO. 18: is the reverse primer for GDF5 (T/C) (rs143383);
SEQ. ID. NO. 19: is the sequence of Has-miR-608 with a C at the 22nd position;
SEQ. ID. NO. 20: is the sequence of Has-miR-608 with a G at the 22nd position.
SEQ. ID. NO. 21: is the sequence of the MiR608 gene (ENSE00001499827); and
SEQ. ID. NO. 22: is the sequence of the rs4919510 polymorphism.
For the purposes of this specification, a “polymorphism” may include a change or difference between two related nucleic acids. A “nucleotide polymorphism” refers to a nucleotide which is different in one sequence when compared to a related sequence when the two nucleic acids are aligned for maximal correspondence. A “probe” or “molecular marker” is an RNA sequence(s) or DNA sequence(s) or analogues, modified versions, or the complement of the sequences shown. This may include a “genetic marker”, which is a region on a genomic nucleic acid mapped by a molecular marker or probe. A “probe” is a composition labeled with a detectable label. A “probe” is typically used herein to identify a marker nucleic acid. A polynucleotide probe is usually a single-stranded nucleic acid sequence that can be used to identify complementary nucleic acid sequences, or may be a double- or higher order-stranded nucleic acid sequence which can be used to bind to, or associate with, a target sequence or area, generally following denaturing. The sequence of the polynucleotide probe may or may not be known. An RNA probe may hybridize with its corresponding DNA gene, or to a complementary RNA, or to other type of nucleic acid molecules. As used herein the term “functional discriminatory truncations” mean nucleic acid sequences, modified nucleic acid sequences, or other nucleic acid variants which, although they are truncated forms of sequences presented herein or variants thereof, can still bind in a discriminatory manner to target gene or nucleic acid sequences described herein and forming part of the present invention. The terms “isolated” or “biologically pure” refer to material which is substantially or essentially free from components which normally accompany it as found in its native state. An “amplified mixture” of nucleic acids includes multiple copies of more than one (and generally several) nucleic acids. “Stringent hybridization conditions” in the context of nucleic acid hybridization are sequence dependent and are different under different environmental parameters. Generally, stringent conditions are selected to be about 5° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. Highly stringent conditions are selected to be equal to the Tm point for a particular probe. An example of stringent wash conditions for, say, a Southern blot of such nucleic acids is a 0.2×SSC wash at 65° C. for 15 minutes. Such a high stringency wash may be preceded by a low stringency wash to remove background probe signal. An example of a low stringency wash is 2×SSC at 40° C. for 15 minutes. In general, a signal to noise ratio of 2× (or higher) than that observed for an unrelated probe in the particular hybridization assay indicates detection of a specific hybridization event. For highly specific hybridization strategies such as allele-specific hybridization, an allele-specific probe is usually hybridized to a marker nucleic acid (e.g., a genomic nucleic acid, an amplicon, or the like) comprising a polymorphic nucleotide under highly stringent conditions.
Briefly, a total of 358 unaffected control (CON) participants [159 South Africa (SA CON) and 199 Australia (AUS CON)] and 166 affected AT (TEN) participants (87 SA TEN and 79 AUS TEN) were genotyped for the variants CASP8 (rs384129) and CASP8 (rs1045485). Logistic regression was used to derive risk models for AT. A receiver operator characteristic (ROC) curve was plotted to determine the effectiveness of a model to capture TEN risk. This study indicates the independent association of CASP8_rs1045485 and CASP8_rs3834129 as well as their haplotype with TEN risk and the identification of an optimal model which included genetic loci CASP8_rs384129 and CASP8_rs1045485 together with gender to capture TEN risk in both SA and AUS.
The South African (SA) and Australian (AUS) participants were of self-reported European Caucasian ancestry. A total of 159 asymptomatic control participants (designated as SA CON) and 87 with diagnosed Achilles tendinopathy (designated as SA TEN) together with 199 asymptomatic control participants (designated as AUS CON) and 79 diagnosed Achilles tendinopathy (designated as AUS TEN) participants were recruited for the study as previously described (Mokone et al Am J Sports Med 2005; 33:1016-1021; Mokone et al. Scand J Med Sci Sports 2006; 16:19-26; September et al. Br J Sports Med 2011; 45:1040-1047; September et al. Int J Sports Med 2008; 29:257-263; September et al. Br J Sports Med 2009; 43:357-365).
Participants signed informed consent forms according to the Declaration of Helsinki, provided personal particulars and completed a questionnaire regarding medical history (September et al. Int J Sports Med 2008; 29:257-263). Approval for the study was obtained from the Research Ethics Committee of the Faculty of Health Sciences, The University of Cape Town (reference number 172/2005) and Human Ethics Committee of La Trobe and Deakin Universities, Melbourne, Australia.
DNA was extracted for all participants as previously described (Mokone et al Am J Sports Med 2005; 33:1016-1021; September et al. Br J Sports Med 2009; 43:357-365).
CASP8 (Srivastava et al. Mol Carcinog 2010; 49:684-692) rs3834129 and rs1045485 were investigated. Genotyping of rs384129, rs1045485 and rs2779249 was conducted using the Taqman method according to standard techniques and rs1799983 was genotyped using restriction fragment length polymorphism analysis.
Basic characteristics of the study groups were presented and summarized previously (Mokone et al Am J Sports Med 2005; 33:1016-1021; Mokone et al. Scand J Med Sci Sports 2006; 16:19-26; September et al. Br J Sports Med 2011; 45:1040-1047; September et al. Int J Sports Med 2008; 29:257-263; September et al. Br J Sports Med 2009; 43:357-365; Fu et al. J Hypertens 2009; 27:991-1000).
The relationship between the genotypes and AT susceptibility was tested and found not to differ significantly between the countries. The data from the population groups were combined for all further analyses. Age, gender, country and whether the individual was born in the specific country were considered confounders and were adjusted for in all analyses by including them in the models as fixed effects. Logistic regression was used to compare the TEN and CON groups, as well as the countries with respect to genotype, allele and allele-combination frequencies. Significant genotype associations were further examined to determine whether it was the result of heterozygote, recessive or a dominant effect, by recoding the genotypes appropriately in the logistic regression models. Haplotype and allele combination associations were tested for additive, dominant and recessive models on the logit scale.
Logistic regression was used to derive risk models for AT. Three models were constructed; the first incorporated the four known confounders and the genotypes at the four loci implicated in the apoptosis signalling cascade (rs384129, rs1045485, rs1799983, 2779249), The second contained the same factors as the first, plus the interleukin loci previously genotyped (rs1800795; rs16944; rs1143627). The optimal model was backwards selected from the first, using Akaike criterion.
A receiver operating characteristic (ROC) curve18 was constructed for each of the three logistic regression models to compare the effectiveness of each model to predict TEN risk. The area under the ROC curve (AUC) was used to quantify the overall ability of the model to discriminate between diagnostic groups based on genotype risk.
Results corresponding to a p-value of less than 0.05 were described as significant. The programming environment, R (R Development Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria 2010.) and R packages were used for all analyses. The R package, genetics (Warnes et al. R package version 1 3 4 2008) was used to estimate genotype and allele frequencies and Hardy-Weinberg equilibrium probabilities. Frequencies of allele combinations were inferred and analysed using the R package, haplo.stats. (Schaid et al. Am J Hum Genet 2002; 70:425-434; Sinnwell et al. R package version 1 4 4 2009). ROC curves were created using the R package Epi (Carstensen et al. R package version 1.1.20.2011)
Genotype and minor allele frequency distributions for each of the polymorphisms together with the HWE p-values are shown in
CASP8 rs3834129
A significant difference in the genotype (p=0.0294) but not the allelic distribution (p=0.2528), was detected for rs3834129 between the CON and TEN groups, after adjusting for the confounders. A heterozygote advantage model provided the best fit (OR=0.61; p=0.0141; 95% CI: 0.40-0.90); the odds of TEN with D/I is 39% less than the odds with either (I/I or D/D) homozygote. A dominant model for the minor allele (I) also provided a significant fit (OR=0.60; p=0.0215; 95% CI: 0.38-0.93); with a D/D genotype the odds of TEN is 67% more (OR=1.67; CI 1.08 to 2.60) than the D/I or I/I genotypes. The distribution for rs3834129 were similar between SA and AUS (p=0.860 and p=0.3578) after adjusting for the confounders.
CASP8 rs1045485
Highly significant differences in the genotype (p=0.0009) and allele (p=0.00027) distributions were detected for rs1045485 between the two countries after adjusting for the confounders. Significant differences were detected in the genotype (p=0.0213) and allele (p=0.0097) distributions between the CON and TEN groups, after adjusting for the confounders. The highly significant allelic effect can be interpreted as: Each C allele reduces the odds of TEN by 41% (OR=0.59; 95% CI: 0.39-0.87). Investigating the significant genotype effect showed two transmission models with highly significant fits, the heterozygote advantage (only the G/C genotype) reduces the odds of TEN (OR=0.56; p=0.0094; 95% CI: 0.35-0.86) compared to both homozygotes; and the dominant model (any minor C allele) either C/C or C/G genotype reduces the odds of TEN (OR=0.55; p=0.0065; 95% CI: 0.35-0.84) compared to G/G.
Frequencies were inferred for the allele combinations rs384129, rs1045485, rs1799983 and rs2779249. The most common allele combinations were D-G-G-C (CON=17%; TEN=21%) and I-G-G-C (CON=18%; TEN 16%); whereas both I-C-T-A and I-C-T-C were detected at frequencies below 1% in CON. The four-way allelic combination was not significantly associated with AT susceptibility after adjusting for the confounders and similarly nor were any of the 3-way allelic combinations.
The CASP8 inferred haplotype was significantly associated with AT risk for additive (p=0.0210), dominant (p=0.0052) and recessive (p=0.0036) allelic combination models. The D-C inferred allele combination was present in 15% of CON and 9% of TEN and showed a dominant protective effect such that an individual needs only one of those combinations to be protected against AT. While the D-G inferred allele combination was present in 35% of CON and 45% of TEN; showing a recessive risk effect such that you need to be homozygous for D-G allele combination to be at increased risk of AT.
The model which contained genotypes for rs18000795, rs16944, rs1143627, rs384129, rs045485, rs1799983 and rs2779249) and the confounders had an AUC=0.705 (244 CON and 116 TEN); sensitivity=45.7 and specificity=84%.
The factors which jointly contributed to the optimal model for evaluating risk assessment of AT were the genetic loci rs384129 and rs1045485, and gender (
The inventors have surprising found an association of the CASP8 polymorphisms and their haplotype, and have identified an apoptosis polygenic profile for indicating an increased risk of AT.
The recessive model for rs3834129 suggests that individuals with a D/D genotype have a 68% higher risk of AT than those with either I/I or D/I genotypes. This finding is unexpected since the del allele destroys a Sp1 binding element which results in decreased caspase-8 expression (Sun et al. Nat Genet 2007; 39:605-613). Reduced caspase-8 expression was expected to protect against excessive apoptosis and the deletion allele was therefore expected to protect against AT. Interestingly, the heterozygote advantage model predicts that subjects that are heterozygous, D/I genotype, at this locus have a reduced risk compared to subjects having either homozygote (D/D and I/I).
At the rs1045485 locus, there was a highly significant protective effect of the heterozygote genotype, C/G compared to G/G genotypes. The C/C genotype was rare (2%) so it was not surprising that a heterozygote (G/C reduces AT risk by 44% compared to G/G+C/C) and an additive allelic (each C allele reduces risk by 41%) and a dominant model (C/C+C/G reduces AT risk by 45% compared to G/G) all provided highly significant fits.
The CASP8 polymorphism associations were mirrored in the CASP8 haplotype. The CASP8 D-C haplotype was associated with reduced AT risk in the additive, dominant and recessive allelic combination models. Genotyping other SNPs in the region implicated by the haplotype may provide more informative haplotypes in identifying the critical casual region.
Lastly, this study suggests that the more biomarkers incorporated into the design of a risk profile, the greater the effectiveness to predict risk, as would be expected in a polygenic condition (AUC=0.705). A preferred risk model suggests that the two loci CASP8 together with gender is sufficient to predict AT risk (AUC=0.667).
Another preferred model estimates that the minimum risk for AT occurs in females who are homozygous C/C and heterozygous D/I for rs384129 and rs045485 respectively and on the contrary males with the G/G and D/D genotypes at the two CASP8 loci are at maximum risk for AT. Although all inferred allele combinations were not significantly associated with AT risk, the ROC analysis suggests that the loci are collectively able to discriminate between affected and unaffected individuals. This suggests that the cumulative effect of these protein products contribute to AT risk.
Collectively, these results further implicate the apoptosis signalling cascade as one of the biological pathways involved in the development of AT. The associations observed in this study should be explored in larger independent groups to elucidate the biological significance of the apoptosis signalling cascade in musculoskeletal soft tissue injuries.
Three hundred and forty-two asymptomatic control participants (CON) and 160 with diagnosed Achilles tendinopathy (TEN) were included in this study. The TEN and CON participants were recruited from South African (SA TEN, N=81 and SA CON=149) and Australia (AUS TEN, N=79 and AUS CON=193) as previously described Mokone et al. Am J Sports Med 2005; 33:1016-1021, Mokone et al. Scand J Med Sci Sports 2006; 16:19-26, September et al. Br J Sports Med 2009; 43:357-365, September et al. Br J Sports Med 2011; 45:1040-1047). All participants were of self-reported European Caucasian ancestry. The profiles of the CON and TEN SA and AUS participants were previously described in detail (September et al. Br J Sports Med 2011; 45:1040-1047).
All the participants signed an informed consent form prior to participation in this study. Approval for the study was obtained from the Human Research Ethics Committee of the Faculty of Health Sciences at the University of Cape Town and the Human Ethics Committee of La Trobe and Deakin Universities, Melbourne, Australia.
DNA was extracted for the SA (Mokone et al. Am J Sports Med 2005) and AUS participants as previously described (September et al. Br J Sports Med 2009; 43:357-365).
The participants were genotyped for the rs4919510 (-/AGGG; 91 TEN and 198 CON), rs16399 (ATCT/-; 120 TEN and 254 CON) and rs1134170 (A/T; 107 TEN and 241 CON) polymorphisms within the 3′-UTR of the COL5A1 gene. Genotyping was performed using custom designed Fluorescence-based Taqman® PCR assays (Applied Biosystems, Foster City, Calif., USA). Allele specific probes and flanking primer sets (sequences available on request) were used along with a pre-made PCR mastermix containing ampliTaq® DNA polymerase Gold (Applied Biosystems, Foster City, Calif., USA) in a final reaction volume of 8 μl. The two-step PCR consisted of a 10 min heat activation step (95° C.) followed by 40 cycles of 15s at 92° C. and 1 min at 60° C. using the XP Thermal Cycler, Block model XP-G (BIOER Technology CO., LTD, Tokyo, Japan). End-point fluorescence using a 7900 HT Fast Real-Time PCR System and the SDS Software version 2.3 (Applied Biosystems, Foster City, Calif., USA) was used to determine the genotypes of each polymorphism.
In addition, high resolution melting (HRM) analysis was performed for rs16399 (ATCT/-) by the Central Analytical Facility (University of Stellenbosch, Stellenbosch, South Africa). A designed primer set (FWD: 5′ CAC TTC TCT CTT GTG GCT C 3′, REV: 5′ CAG TGC GCC TTC AAG GAG AC 3′) was used for that purpose. DNA template was quantified using The NanoDrop ND1000 (NanoDrop Technologies, Wilmington, Del., USA) and normalized to 5 ng/μl. Reactions were set up in an ABI Fast 96-well optical plate (Applied Biosystems, Foster City, Calif., USA) using the following reaction: 1×ABI MeltDoctor HRM Master Mix (Applied Biosystems, Foster City, Calif., USA), 6 pmol of each primer, 20 ng of DNA template with a final volume of 20 μl. The HRM-PCR was performed in the StepOne Real-time PCR System (Applied Biosystems, Foster City, Calif., USA) with the following cycling and melting conditions: An activation step at 95° C. for 10 mins followed by 40 cycles with a denaturing step at 95° C. for 15 sec and annealing step at 60° C. for 1 min. This was followed by a melt curve comprising the sequential steps: a denaturing step at 95° C. for 10 sec, an annealing step at 60° C. for 1 min, a HRM step at 95° C. for 15 sec (ramping rate of 1%) ending with an annealing step at 60° C. for 15 sec. Sequenced controls representative of each genotype were included in each experiment. Data collection and primary analysis including amplification plots were performed with StepOne Software Version 2.2.1 (Applied Biosystems, Foster City, Calif., USA). The high-resolution melt analysis was performed using the High Resolution Melt Software Version 3.0.1 (Applied Biosystems, Foster City, Calif., USA). Variants were called automatically and the pre-melt region was between 70.9° C. and 71.3° C. while the post-melt region was between 78.0° C. and 78.3° C. Aligned melt curves and difference plots are generated as well as silhouette scores for each sample. Any samples with low amplification or with outlier melt profiles were removed from the HRM analysis.
The participants (143 TEN and 312 CON) were genotyped for the G>C SNP (rs4919510) present in the MIR608 geneusing a custom designed Fluorescence-based Taqman® polymerase chain reaction (PCR) assay (Applied Biosystems, Foster City, Calif., USA) as described above. The mature Hsa-miR-608 has the following sequence: 5′-AGGGGTGGTGTTGGGACAGCT SCG T-3′, where S is a C or G.
mRNA Secondary Structure and Binding Energy
All secondary structures of the wild-type and mutated C and T functional forms of the COL5A1 3′-UTR were generated using the Sfold online RNA folding tool (available at http://sfold.wadsworth.org) (Ding, et al. Nucleic acids research 2003, 31(24), 7280-7301; Ding et al. RNA 2005 (New York, N.Y.), 11(8), 1157-1166). The Sfold RNA folding algorithm generates RNA secondary structures using a statistical sample from the Boltzmann ensemble of secondary structures. All structures are folded at 37° C. and 1M NaCl in the absence of divalent ions.
The predicted change in Gibb's Free Energy in the Hsa-miR-608:COL5A1mRNA complex was predicted using the miRanda algorithm (v3.0) (http://cbio.mskcc.org/microrna_data/miRanda-aug2010.tar.gz-) (Enright et al. Genome Biology, 2003 5(1), R1. doi:10.1186/gb-2003-5-1-r1).
Data were analysed using STATISTICA Version 10.0(StatSoft, Tulsa, Okla., USA) and GraphPad Prism version 5.0d for Mac OS X (GraphPad Software, San Diego, Calif., USA, www.graphpad.com) programs. A one-way analysis of variance was used to determine any significant differences between the characteristics of the TEN and CON groups within the AUS and SA cohorts. A Chi2-analysis or Fisher's exact test was used to analyse any differences in the genotype frequencies and other categorical data between the groups. Significance was accepted when P<0.05 and P<0.025 when combined gene-gene interactions or effects were analysed. Combined genotype frequencies were analyzed using the Monte Carlo test (CLUMP program, version 2.0) (Sham et al. Ann Hum Genet. 1995; 59:97-105). Hardy-Weinberg equilibrium (HWE) was established using the program Genepop web version 3.4 (http://genepop.curtin.edu.au/). Linkage disequilibrium (LD) was calculated using CubeX: cubic exact solution (www.oege.org/software/cubex/) (Gaunt et al BMC bioinformatics 2007, 8, 428).
The SA and AUS TEN and CON groups used in study have been previously described in detail (September et al. Br J Sports Med 2009; 43:357-365). In summary, there were significantly more males within the combined SA and AUS TEN (73.0%, N=159) groups when compared to the combined CON groups (50.6%, N=340, P<0.001). The combined TEN and CON groups were matched for age (TEN age of initial injury 39.8±14.5 years, N=153vs CON age at recruitment 37.7±11.7 years, N=331; P=0.091) and height (TEN 176±9 cm, N=147vs CON 172±13 years, N=335; adjusted for sex P=0.960). The combined TEN groups were significantly heavier (TEN 79.6±14.1 kg, N=153vs CON 72.5±13.4 kg, N=339; adjusted for sex P=0.003; adjusted for sex and recruitment age P=0.039) and had an increased body mass index (BMI) (TEN 25.7±3.8 kg·m−2, N=147vs CON 24.2±3.6 kg·m−2, N=330; adjusted for sex P=0.002; adjusted for sex and recruitment age P=0.152) when compared to the combined CON groups. The combined TEN groups were however recruited on average 8.6±9.6 years (N=153) after their initial injury.
With the exception of significant differences between BMI and the three rs16399 (ATCT/-) genotype groups, there were no significant genetic interactions with any of the physiological variables (age, height, weight and sex) for any of the COL5A1 3′-UTR variants (results not shown). Participants with a 2/2 ATCT genotype (BMI 27.0±6.1 kg·m−2) were significant larger than those with a 2/1 ATCT (BMI 24.0±3.2 kg·m−2, P value adjusted for age at recruitment and sex=0.001) or a 1/1 ATCT (BMI 24.6±3.4 kg·m−2, P value adjusted for age at recruitment and sex=0.009) genotype.
The genotype distributions of rs71746744, rs16399 and rs1134170 were similar within the SA and AUS cohorts (
MIR608 Genotype Frequencies and Interactions with its COL5A1 3′-UTR Binding Site
There were no significant genetic interactions with any of the physiological variables for the rs4919510 polymorphism within the MIR608 gene (results not shown). The genotype distributions of rs4919510 were similar within the SA and AUS cohorts (
The Hsa-miR-608 binding site within the COL5A1 3′-UTR is polymorphic (September et al. Br J Sports Med 2009; 43:357-365). We investigated the combined genotype frequencies M/R608SNP rs4919510 and SNP rs3196378 (C/A, AciI RFLP) within the miRNA binding site. In addition, the A allele of rs3196378 was identified within the T functional form of the COL5A1 3′-UTR which was predominately cloned from TEN subjects (Laguette et al. Matrix biology: journal of the International Society for Matrix Biology, 30(5-6), 338-345. doi:10.1016/j.matbio.2011.05.001). The combined genotype distributions of rs4919510 and rs3196378 were similar within the SA and AUS cohorts (
The most favourable binding energy was calculated to be between the C allele of the mature Hsa-miR-608 and the C allele of its COL5A1 binding site (−24.5 kcal/mol). The least favourable calculated binding energy was between the G allele of Hsa-miR-608 and either variants (C or A) of its binding site (−22.2 kcal/mol). The binding energy between the C allele of Hsa-miR-608 and the A allele of its COL5A1 binding site was calculated to be −23.5 kcal/mol.
The CC MIR608 rs4919510, 2/2 AGGG COL5A1 rs71746744, 1/1 ATCTCOL5A1 rs16399 and TT COL5A1 rs1134170 genotypes were all independently associated with increased risk of chronic Achilles tendinopathy (
The 2/2 AGGG rs71746744, 1/1 ATCT rs16399 and TT rs1134170 COL5A1 genotypes were significantly over-represented (P=0.006; odds ratio=2.3; 95% confidence interval 1.3 to 4.3) within the combined TEN participants (60.0%, N=36 of 60) when compared to the combined CON participants (39.4%, N=61 of 155). In contrast, participants with none of the three at risk COL5A1 genotypes were significantly over-represented (P=0.002; odds ratio=2.7; 95% confidence interval 1.4 to 5.0) within the CON participants (55.5%, N=86 of 155) when compared to the TEN participants (31.7%, N=19 of 60). When the CC MIR608 rs4919510 at risk genotype was included in the analyses the participants with all four risk genotypes (genotype score of 8) were significantly over-represented within the TEN participants (P=0.004; odds ratio=2.6; 95% confidence interval 1.3 to 4.9), while those with none of the four risk genotypes (genotype score of 0) was significantly under-represented within the TEN participants (P=0.019; odds ratio=3.1; 95% confidence interval 1.2 to 8.5) (
There were structural differences in the most stable C and T functional forms of the COL5A1 3′-UTR (
To date, only seven polymorphic sites have been identified within a 2.5 kb region of the COL5A1 3′-UTR to influence the predicted secondary structures of the C and T functional forms. In attempted to identify which of the seven variants were responsible for determining the “gross” structural differences between the two functional forms, the secondary structures of the COL5A1 3′-UTR were determined after in silico sit-directed mutagenesis. The structure of region A (
The first main finding of this study was that three additional sequence variants, rs71746744 (AGGG/-), rs16399 (-/ATCT) and rs1134170 (T/A), downstream from the previously associated BstUI RFLP (rs12722) within the COL5A1 3′-UTR was associated with chronic Achilles tendinopathy (refer to
Although the miRNA binding site within this region is polymorphic we have previously shown that SNP rs11103544 (MboII RFLP) was not associated with chronic Achilles tendinopathy (September et al. Br J Sports Med 2009; 43:357-365). In addition this SNP was not one of the major sequence variants that differentiated between the C- and T-functional forms of the COL5A1 3′-UTR (Laguette et al. Matrix biology: journal of the International Society for Matrix Biology, 2011:30(5-6), 338-345. doi:10.1016/j.matbio.2011.05.001). Furthermore, investigations have concluded that SNP rs11103544 did not interact with the AGGG VNTR to modify the association with chronic Achilles tendinopathy (
The second main finding of this study was that the polymorphic MIR608 gene (SNP rs4919510) was also associated with chronic Achilles tendinopathy. The CC genotype of this variant was significantly over-represented within the Tendiopathic participants. The MIR608 gene encodes for miRNA, Hsa-miR-608, which binds to a functional polymorphic cis-acting element within the COL5A1 3′-UTR (September et al. Br J Sports Med 2009; 43:357-365; Laguette et al. Matrix biology: journal of the International Society for Matrix Biology, 2011; 30(5-6), 338-345. doi:10.1016/j.matbio.2011.05.001). Since the A allele of rs3196378 within the Hsa-miR-608 binding site was identified within the T functional form of the COL5A1 3′-UTR which was predominately cloned from TEN participants (Laguette et al. Matrix biology: journal of the International Society for Matrix Biology, 2011:30(5-6), 338-345. doi:10.1016/j.matbio.2011.05.001), we investigated the combined genotype frequencies MIR608 SNP rs4919510 and SNP rs3196378 (C/A, AciI RFLP) within the miRNA binding site. Although the M/R608CC and COL5A1 rs3196378 AA genotypes distributions were similar between the AUS TEN and AUS CON groups, the combined MIR608 CC genotype and COL5A1 rs3196378 C allele (CA and AA genotypes) were significantly over-represented in all the TEN participants when compared to all the CON participants.
The binding energy between the C allele of the mature miRNA and the A allele of its binding site was calculated to be the second most favourable. The most favourable was between the C alleles of both the Hsa-miR-608 and its binding sites. These calculations are calculated in silico and do not necessarily mimic the in vivo situation. The C form of Hsa-miR-608 bound the A rather than the C nucleotide of the SNP with higher affinity resulting in a corresponding decreased mRNA stability of the T allele.
The CC genotype of SNP rs12722 (BstUI RFLP), previously associated with chronic Achilles tendinopathy (Mokone et al. Scand J Med Sci Sports 2006; 16:19-26, September et al. Br J Sports Med 2009; 43:357-365), was over-represented in the participants with the genotypes of variants rs71746744 (2/1 AGGG and 1/1 AGGG), rs16399 (2/1 ATCT and 2/2 ATCT) and rs1134170 (TA and AA) not associated with Achilles tendinoapthy (
The third main finding of this study was the clear structural differences in the most stable C and T functional forms of the COL5A1 3′-UTR (refer to
The data presented herein demonstrate that the MIR608 polymorphism investigated in this study interacts with the COL5A1 polymorphisms described herein in modifying the risk of AT. Although AT is likely to be a complex condition involving a number of gene-gene and gene-environment interactions (September et al. Br J Sports Med. 2007; 41:241-246), there have, to the Applicant's knowledge, been no such reports of a gene-gene interaction that relates to increased risk of AT. Furthermore the CASP8 polymorphisms described herein were unexpectedly found to increase risk of AT.
The invention relates to the association of the interactions of (i) rs16399 (ATCT/-) VNTR within the COL5A1 3′-untranslated region and the rs143383 (T/C) SNP within GDF5 (
In one embodiment, the invention provides a method of determining in a subject a predisposition to, or increased risk for, developing a tendon, ligament, or other soft tissue injury or pathology related to other exercise related phenotypes, but not limited to, including ROM, endurance running performance and EAMC, the method comprising the step of screening the subject for the presence of at least one polymorphism in the MIR608 gene which encodes a miRNA which binds to a recognition sequence within the 3′-UTR of the collagen V gene COL5A1; and at least one polymorphism the collagen V gene COL5A1, which polymorphism is a polymorphism which results in a modified, augmented, or mitigated interaction with one or more other genes selected from the group, when compared to a wild-type interaction and wherein the presence of the polymorphism is indicative of a predisposition to, or increased risk for, developing a musculoskeletal soft tissue injury in the subject.
In a preferred embodiment, the polymorphism of the COL5A1 gene is selected from the group including rs71746744 (-/AGGG), rs16399 (ATCT/-) and rs1134170 (A/T) within the 3′-untranslated region (UTR) of the alpha 1 chain of the COL5A1 gene; and the polymorphism of the MIR608 gene is rs4919510 (C/G).
In another embodiment, the invention provides a method of determining in a subject a predisposition to, or increased risk for, developing a tendon, ligament, or other soft tissue injury or pathology, the method comprising the step of screening the subject for the presence of at least one polymorphism in the CASP8 gene, which polymorphism is a polymorphism which results in a modified, augmented, or mitigated interaction with one or more other genes selected from the group, when compared to a wild-type interaction and wherein the presence of the polymorphism is indicative of a predisposition to, or increased risk for, developing a musculoskeletal soft tissue injury in the subject.
The tendon, ligament, or other soft tissue injury or pathology may be a pathology related to other exercise related phenotypes, such as ROM, endurance running performance and EAMC.
In a preferred embodiment, the polymorphism of the CASP8 gene may be rs1045485 (G/C, D302H) and rs3834129 (CTTACT/del).
In a further embodiment of the invention, there is provided a DNA-based polymorphic marker molecular marker for use in diagnosing a predisposition to, or increased risk for, developing tendon, ligament, or other soft tissue pathology or injury in a subject, the molecular marker comprising at least one isolated nucleic acid fragment derived from a COL5A1 gene, flanking sequences thereof, cis-regions associated therewith, 5′UTR regions, 3′UTR regions thereof, sequences complementary thereto, sequences which can hybridize under strict hybridization conditions thereto, and functional discriminatory truncations thereof and at least one isolated nucleic acid fragment derived from a MIR608 gene, flanking sequences thereof, cis-regions associated therewith, 5′UTR regions, 3′UTR regions thereof, sequences complementary thereto, sequences which can hybridize under strict hybridization conditions thereto, and functional discriminatory truncations thereof.
More particularly, the molecular marker is as polymorphism selected from the group comprising rs71746744, rs16399 and rs1134170 within the 3′-untranslated region (UTR) of the alpha 1 chain of the COL5A1 gene and rs4919510 of the MIR608 gene.
In another embodiment of the invention, there is provided a DNA-based polymorphic marker molecular marker for use in diagnosing a predisposition to, or increased risk for, developing tendon, ligament, or other soft tissue pathology or injury in a subject, the molecular marker comprising at least one isolated nucleic acid fragment derived from a CASP8 gene, flanking sequences thereof, cis-regions associated therewith, 5′UTR regions, 3′UTR regions thereof, sequences complementary thereto, sequences which can hybridize under strict hybridization conditions thereto, and functional discriminatory truncations thereof.
The tendon, ligament, or other soft tissue injury or pathology may be a pathology related to other exercise related phenotypes, such as ROM, endurance running performance and EAMC.
In one embodiment, the molecular marker is a polymorphic marker, preferably a polymorphism including SNP rs1045485 and rs3834129 of the CASP8 gene.
Number | Date | Country | Kind |
---|---|---|---|
2012/00047 | Jan 2012 | ZA | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/IB2013/050083 | 1/4/2013 | WO | 00 |