The present invention relates to methods of detecting and/or quantifying a RNA, particularly a tRNA, using padlock probes comprising terminal regions complementary to said RNA. The present invention also relates to methods of detecting, diagnosing and/or assessing the clinical severity of an RNA-associated disease.
RNA and RNA-protein complexes (RNAPs) are interesting targets for molecular diagnostics. However, it can be technically difficult to specifically quantify an RNA molecule by the commonly used reverse-transcription polymerase chain reaction (RT-PCR) or RNA sequencing (RNAseq) techniques. For instance, the reverse-transcription of the RNA to complementary DNA (cDNA) or the subsequent PCR amplification of the cDNA can be difficult if the RNA (or the cDNA) exhibits stable structural features that prevent specific binding of oligonucleotide primers to the region of interest, or prevent progression of the reverse transcriptase or the DNA polymerase. Chemical modifications can hinder RT-PCR, and the proteins of RNAPs can induce structures in the nucleic acids or stabilize pre-existing structure, and thus prevent processive action of the essential enzymes.
Due to these difficulties, RNAs and RNAPs are sometimes treated as a secondary choice in molecular diagnostics; if possible the DNA molecule is usually preferred. Moreover, DNA is more stable than RNA and reverse-transcription is not required if DNA is used as molecular target. Due to the RNA fragility, RNA samples extracted from medical samples often comes in the form of short fragments, exhibiting a distribution of short sizes, which makes analysis difficult. Short sizes are particularly common in urine, which otherwise is an attractive tissue for diagnostics since it is obtained in a fully non-invasive manner.
One type of RNA, transfer RNA (tRNA) has received increasing attention recently (S. Kirchner, Z. Ignatova, Nat. Rev. Genet. 16, 98-112 (2015)). A method to specifically quantify tRNAs will have applications in research and development in biology, biotechnology and medicine, and in molecular diagnostics. However, tRNA is particularly short, strongly structured and contains post-transcriptional modifications.
Human mitochondrial tRNA (mt-tRNA) is a type tRNA of particular interest. Many mutations in mitochondrial DNA transcribe into mutations in mt-tRNA and are linked to defects in oxidative energy metabolism. It is now emerging that these mutations are also linked to other complex traits, including neurodegenerative diseases, ageing and cancer. MELAS (Mitochondrial Encephalopathy, Lactic Acidosis and Stroke-like episodes) and MERRF (Myoclonic Epilepsy with Ragged Red Fibers) are two classic diseases associated with mitochondrial tRNA mutations in mt-tRNA Leu(UUR) and mt-tRNA Lys, respectively.
MELAS syndrome is a mitochondrial cytopathy associated with mutations of mitochondrial DNA. It can be caused by several mutations, but by far the most frequent one is m.3243A>G, located on the MT-TL1 gene coding for leucine(UUR) mt-tRNA. This mutation was found with a prevalence of 0.14% in the general population of North Cumbria (England); interval of 4-36% with 95% confidence (H. R. Elliott et al, Am. J. Hum. Genet. 83, 254-260 (2008)). It is a heteroplasmic mutation, i.e. mutated and non-mutated mtDNA molecules coexist in the cell. Diseases caused by m.3243A>G represent one of the most frequent groups of OXPHOS deficiencies.
MERRF syndrome is a mitochondrial encephalomyopathy characterized by myoclonic seizures. Four point mutations in the genome can be identified that are associated with MERRF: m.A8344G, m.T8356C, m.G8361A, and m.G8363A. The point mutation m.A8344G is associated with MERRF in 80% of patients (DiMauro, S. and Hirano, M., 2015. Merrf. In GeneReviews. University of Washington, Seattle). This point mutation disrupts the mitochondrial gene for lysine mt-tRNA, which disrupts the synthesis of proteins Thus, there is a demand for methods of detecting and quantifying a RNA, particularly a tRNA.
The present inventors have developed a method of detecting and/or quantifying a RNA, particularly a tRNA, using padlock probes comprising terminal regions complementary to said RNA. The present inventors have surprisingly found that the method can be used to reliably detect and quantify the percentage of mutant tRNAs, even though the tRNA is short, strongly structured and exhibits some chemical modifications.
The inventors have also developed a method of detecting, diagnosing and/or assessing the clinical severity of a RNA-associated disease. In particular, the present inventors have surprisingly found that the mutation load of the mt-tRNA can differ significantly from the mt-DNA mutation load. Thus, mt-tRNA mutation load may correlate with the clinical severity of a mitochondrial disease and be used as a prognostic tool and/or as an indicator for the appropriate medical treatment.
In one aspect, the present invention provides a padlock probe for detecting and/or quantifying a RNA. Suitably, the padlock probe comprises terminal regions complementary to the RNA. Suitably, each terminal region is complementary to a region of the RNA.
In some embodiments the RNA is a tRNA, suitably a mitochondrial tRNA (mt-tRNA). Suitable mt-tRNAs include leucine(UUR) mt-tRNA, a lysine mt-tRNA, a methionine mt-tRNA, a tryptophan mt-tRNA, a aspartate mt-tRNA, an isoleucine mt-tRNA, a glycine mt-tRNA, an arginine mt-tRNA, a histidine mt-tRNA, a serine(AGY) mt-tRNA, a leucine(CUN) mt-tRNA, a threonine mt-tRNA, a phenylalanine mt-tRNA, a valine mt-tRNA, a glutamine mt-tRNA, an alanine mt-tRNA, an asparagine mt-tRNA, a cysteine mt-tRNA, a tyrosine mt-tRNA, a serine(UCN) mt-tRNA, a glutamate mt-tRNA, or a proline mt-tRNA. Optionally, the mt-tRNA is a leucine(UUR) mt-tRNA, a lysine mt-tRNA, a histidine mt-tRNA, a leucine(CUN) mt-tRNA, a phenylalanine mt-tRNA, a valine mt-tRNA, a glutamine mt-tRNA, a serine(UCN) mt-tRNA, or a proline mt-tRNA. Optionally, the mt-tRNA is a leucine(UUR) mt-tRNA or a lysine mt-tRNA, optionally wherein the mt-tRNA is a leucine(UUR) mt-tRNA. In some embodiments, the tRNA comprises a 3′ ligated oligonucleotide.
The regions of the RNA to which the terminal regions are complementary may each be 5-30 nucleotides in length, optionally 5-15 nucleotides in length. The terminal regions may be each 5-30 nucleotides in length, optionally 5-15 nucleotides in length. The terminal regions may be in total are 15-50 nucleotides in length, optionally 20-30 nucleotides in length. The terminal regions may be complementary to adjacent regions or non-adjacent regions, optionally wherein the non-adjacent regions are separated by 6 or fewer nucleotides, 5 or fewer nucleotides, 4 or fewer nucleotides, 3 or fewer nucleotides, 2 or fewer nucleotides, or one nucleotide.
The padlock probe may be 50-200 nucleotides in length, 50-150 nucleotides in length, 50-100 nucleotides in length, or 70-100 nucleotides in length. The padlock probe may comprises one or more primer binding sites, optionally wherein the padlock probe comprises a forward primer site and a reverse primer site. The forward primer site may be complementary to a forward primer. The complement of the reverse primer site may be complementary to a reverse primer. Suitably, the padlock probe comprises from 5′ to 3′: a first terminal region; a first primer binding site; a second primer binding site; and a second terminal region, optionally wherein the first primer binding site is a forward primer site and the second primer site is a reverse primer site, or vice versa. The primer binding sites may be 10-30 nucleotides in length. Optionally, the padlock probe comprises a tag sequence.
The padlock probe may be capable of detecting and/or quantifying a pathogenic mutation.
Suitably, a terminal region is complementary to a region of the RNA that is associated with a pathogenic mutation, optionally wherein the pathogenic mutation is a pathogenic single-nucleotide variant. In some embodiments, (i) the RNA is a leucine(UUR) mt-tRNA and the pathogenic mutation is A3243G or T3271C; or (ii) the RNA is a lysine mt-tRNA and the pathogenic mutation is A8344G.
Suitably, a terminal region is complementary to a region of the RNA which comprises a pathogenic mutation site, optionally wherein the pathogenic mutation is a pathogenic single-nucleotide variant. In some embodiments, (i) the RNA is a leucine(UUR) mt-tRNA and the pathogenic mutation site is m.3243 or m.3271; or (ii) the RNA is a lysine mt-tRNA and the pathogenic mutation site is m.8344.
Suitably, a terminal region is complementary to a region of the RNA that comprises a pathogenic mutation, optionally wherein the pathogenic mutation is a pathogenic single-nucleotide variant. In some embodiments, (i) the RNA is a leucine(UUR) mt-tRNA and the pathogenic mutation is A3243G or T3271C; or (ii) the RNA is a lysine mt-tRNA and the pathogenic mutation is A8344G.
In some embodiments, a terminal region is complementary to a region of a nucleotide sequence which comprises or consists of:
In some embodiments, a terminal region comprises or consists of the nucleotide sequences TTACCGGGCC or TTACCGGGCT (SEQ ID NOs: 56 and 58), or fragments thereof.
In some embodiments, the terminal regions comprise or consist of the nucleotide sequences CTGCCATCTTAAC and TTACCGGGCC (SEQ ID NOs: 55 and 56) or CTGCCATCTTAAC and TTACCGGGCT (SEQ ID NOs: 57 and 58), or fragments thereof.
In some embodiments, the padlock probe comprises or consists of a nucleotide sequence with at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity to
In some embodiments the padlock probe comprises or consists of:
The padlock probe may be capable of detecting and/or quantifying a modified nucleotide.
Suitably, a terminal region is complementary to a region which is associated with a modified nucleotide. Aberrant modification of the modified nucleotide may be pathological, optionally wherein the aberrant nucleotide modification is the absence of the nucleotide modification. In some embodiments, (i) the RNA is a leucine(UUR) mt-tRNA and the modified nucleotide is τm5U or (ii) the RNA is a lysine mt-tRNA and the modified nucleotide is τm5s2U.
Suitably, a terminal region is complementary to a region which comprises an aberrantly modified nucleotide, optionally wherein the aberrant nucleotide modification is the absence of the nucleotide modification and/or is pathological. In some embodiments, (i) the RNA is a leucine(UUR) mt-tRNA and the aberrantly modified nucleotide is the absence of τm5U or (ii) the RNA is a lysine mt-tRNA and the aberrantly modified nucleotide is the absence of τm5s2U.
In some embodiments, a terminal region is complementary to a region of a nucleotide sequence which comprises or consists of fragment of the nucleotide sequence
The modified nucleotide may be selected from: N2,N2-dimethyl guanosine (m22G), 5-methylcytosine (m5C), 7-methylguanosine (m7G), 5-methoxycarbonylmethyl-2-thiouridine (mcm5s2U), 5-methyl uridine (m5U), 1-methylguanosine (m1G), 5-methoxycarbonylmethyluridine (mcm5U), 2-methylthio-N6-threonyl carbamoyladenosine (ms2t6A), 5-taurinomethyluridine (τm5U), 5-taurinomethyl-2-thiouridine (τm5s2U), and 2-thiouridine (s2U). Optionally, the modified nucleotide is 5-taurinomethyluridine (τm5U) or 5-taurinomethyl-2-thiouridine (τm5s2U).
The padlock probe may be capable of detecting and/or quantifying amino acid charging of a tRNA. Suitably, the RNA is a tRNA comprising a 3′ ligated oligonucleotide and a terminal region is complementary to a region of the tRNA that comprises at least part of the 3′ ligated oligonucleotide.
In another aspect, the present invention also provides a kit or composition comprising one or more padlock probes according to the present invention.
In some embodiments, the one or more padlock probes comprise or consist of:
In some embodiments, the kit or composition further comprises:
Optionally, the kit or composition further comprises a fifth padlock probe comprising terminal regions complementary to a reference nuclear tRNA, optionally a nuclear methionine tRNA and/or the kit or composition further comprises a sixth padlock probe comprising terminal regions complementary to a reference mt-tRNA, optionally a methionine mt-tRNA.
The kit or composition may comprise any combination of the first, second, third, fourth, fifth, and sixth padlock probes.
The kit or composition may further comprise one or more primers, optionally one forward primer and one reverse primer. The one or more primers may be complementary to one or more primer binding sites on the one or more padlock probes. The kit or composition may further comprise one or more capture probes. The kit or composition may further comprise:
The kit or composition may further comprise:
In another aspect, the present invention provides a method of detecting a RNA using one or more padlock probes according to the present invention.
The method may comprise:
The sample may be an RNA sample. Suitably, the sample is obtained or obtainable from urine, muscle and/or blood. Optionally, the method may further comprise a step of extracting, purifying and/or isolating the sample from urine, muscle and/or blood.
Step (c) may be performed using a DNA ligase, optionally a Splint R ligase. Step (d) may be performed by magnetic bead-based purification and/or exonuclease digestion of non-circularised padlock probes. The amplification in step (e) may be carried out by rolling circle amplification (RCA), optionally hyperbranched RCA (HRCA). Step (f) may be performed by detecting an increase in fluorescence, optionally the increase in fluorescence is detected in real-time. The method may further comprise a step (g) of quantifying the number of RNAs, optionally wherein the number of RNAs is quantified relative to a reference sample.
In another aspect, the present invention provides a method of detecting, diagnosing and/or assessing the clinical severity of a RNA-associated disease by determining the RNA mutation load.
The method may comprise:
Suitably, the RNA-associated disease is a tRNA-associated disease and the RNAs are tRNAs, optionally wherein the RNA-associated disease is a mt-tRNA-associated disease and the RNAs are mt-tRNAs. Exemplary mt-tRNA-associated diseases are mitochondrial encephalopathy, lactic acidosis, and stroke-like episodes (MELAS) syndrome or myoclonic epilepsy with ragged red fibers syndrome (MERRF) syndrome.
In some embodiments, the concentration of mutant tRNAs is determined, and wherein, optionally: (i) the tRNAs are leucine(UUR) mt-tRNAs and the mutation is m.3243A>G; or (ii) the tRNAs are lysine mt-tRNAs and the mutation is m.8344A>G.
In some embodiments, the concentration of aberrantly-modified tRNAs is determined, and wherein, optionally: (i) the tRNAs are leucine(UUR) mt-tRNAs and the modified nucleotide is τm5U or (ii) the tRNAs are lysine mt-tRNAs and the modified nucleotide is τm5s2U.
Optionally, the method further comprises:
Suitably, the reference RNA is a nuclear tRNA, optionally a nuclear methionine tRNA, or a mt-tRNA, optionally a methionine mt-tRNA.
One or more of cwt, cmut, and cref may be determined by a method of the invention.
Optionally, the method further comprises:
In another aspect, the present invention provides for use of a padlock probe in detecting and/or quantifying a mutant RNA. In a related aspect, the present invention provides for use of a padlock probe in detecting and/or quantifying an aberrantly modified RNA. In a related aspect, the present invention provides for use of a padlock probe in detecting and/or quantifying amino acid charging of a tRNA.
In another aspect, the present invention provides for use of a padlock probe in detecting, diagnosing and/or assessing the clinical severity of a RNA-associated disease.
Suitably, the RNA-associated disease is a tRNA-associated disease and the RNAs are tRNAs, optionally wherein the RNA-associated disease is a mt-tRNA-associated disease and the RNAs are mt-tRNAs. Exemplary mt-tRNA-associated diseases are mitochondrial encephalopathy, lactic acidosis, and stroke-like episodes (MELAS) syndrome or myoclonic epilepsy with ragged red fibers syndrome (MERRF) syndrome.
In another aspect, the present invention provides for use of a padlock probe for detecting and/or quantifying tRNA amino-acid charging.
The signals are measured during enzymatic amplification. Signal intensity increases with time. RNA concentration is smallest for the measurement shown as (*), increased by a factor of 10 for the measurement shown as (E) and by a further factor of 10 for the measurement shown as (o). The fluorescence signals are small at small time, then increase and finally saturate. The increase occurs earlier for higher RNA concentration. The time at which the relative signal passes a predetermined threshold is used to quantify the RNA concentration.
For (A) the wt DNA case, (B) the mut DNA case, (C) the wt RNA case, (D) the mut RNA case.
(A, B) The vertical axis (linear scale) shows the characteristic time ct in minutes, while the horizontal axis (logarithmic scale) shows the amount (copy number) of the synthetic wt or mut DNA introduced before the ligation. Three technical replica were done for each amount.
(C, D) The vertical axis shows the characteristic time ct in minutes, while the horizontal axis shows the amount (copy number) of the synthetic wt or mut RNA introduced before the ligation.
Lane 2: WT Leu UUR mt-tRNA, Lane 3: Mut Leu UUR mt-tRNA. Lane 1 and 4: 50 bp DNA ladder (New England Biolabs).
Various preferred features and embodiments of the present invention will now be described by way of non-limiting examples.
It must be noted that as used herein and in the appended claims, the singular forms “a”, “an”, and “the” include plural referents unless the context clearly dictates otherwise.
The terms “comprising”, “comprises” and “comprised of” as used herein are synonymous with “including”, “includes” or “containing”, “contains”, and are inclusive or open-ended and do not exclude additional, non-recited members, elements or method steps. The terms “comprising”, “comprises” and “comprised of” also include the term “consisting of”.
Numeric ranges are inclusive of the numbers defining the range. Unless otherwise indicated, any nucleic acid sequences are written left to right in 5′ to 3′ orientation and amino acid sequences are written left to right in amino to carboxy orientation, respectively.
This disclosure is not limited by the exemplary methods and materials disclosed herein, and any methods and materials similar or equivalent to those described herein can be used in the practice or testing of embodiments of this disclosure.
The publications discussed herein are provided solely for their disclosure prior to the filing date of the present application. Nothing herein is to be construed as an admission that such publications constitute prior art to the claims appended hereto.
Ribonucleic Acid (RNA)
The present invention relates to methods of detecting and/or quantifying RNA.
Ribonucleic acid (RNA) is assembled as a chain of nucleotides. The RNA may be any type of RNA. Suitably, the RNA is transfer RNA (tRNA), ribosomal RNA (rRNA), messenger RNA (RNA), and/or regulatory RNA.
Transfer RNA (tRNA)
In preferred embodiments, the present invention relates to methods of detecting and/or quantifying a transfer RNA (tRNA).
A transfer RNA (abbreviated tRNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length, that delivers amino acids to the ribosome to translate the genetic information in an mRNA template-directed manner into a corresponding polypeptide chain (S. Kirchner, Z. Ignatova, Nat. Rev. Genet. 16, 98-112 (2015)).
tRNA molecules are synthesized as precursors that are then processed by a sequence of maturation events. These maturation events include removal of the 5′ leader, trimming of the 3′ trailer, splicing of introns, addition of the 3′-terminal CCA residues by a CCA-adding enzyme and covalent modification of multiple nucleoside residues (S. Kirchner, Z. Ignatova, Nat. Rev. Genet. 16, 98-112 (2015)).
Once charged with its cognate amino acid and in a complex with the elongation factor, each tRNA reaches the A-site of the ribosome and base-pairs its anticodon with the corresponding mRNA codon. Thus, to fit the same ribosomal site, the architecture of all tRNAs conform to a narrow set of structural parameters constrained by common identity rules and structural features. tRNAs are short, strongly structured and exhibit some chemical modifications, making detection challenging.
The tRNA may be any tRNA. Suitably, the tRNA may be a naturally-occurring tRNA. Suitably, the tRNA may be a human tRNA.
A method to specifically quantify tRNAs will have applications in research and development in biology, biotechnology and medicine, and in molecular diagnostics.
tRNAs vary significantly in their concentrations and codons pairing to low-abundance tRNAs are read slower than codons pairing to major tRNAs. The ribosomal speed is precisely regulated along the mRNA through selection of slow-translating codon clusters at specific positions which locally delay the synthesis rate and provide a time window for proper co-translational folding of the peptide chain. Consequently, silent mutations through an exchange of a slow codon for a fast and vice versa alter the local speed of translation and thus can also introduce aberrancies in protein folding.
Multicellular organisms carry over 500 different tRNA genes which differ in the codon they read (isoacceptors) or in the sequence of their body (isodecoders). Few examples suggest that tRNAs vary among tissues in the human organism, which may explain why some organs (including muscles, neuronal system) seem to be affected more than others. Furthermore, among all RNA species, tRNAs undergo by far the most and chemically diverse post-transcriptional modifications which control tRNA stability and modulate translation fidelity. Hypomodifications or a lack of modifications are also linked to complex human pathologies, including diabetes type-2 and various forms of mitochondrial encephalomyopathies. Despite its importance for deep understanding of the molecular pathology of many diseases, information on the variations in tRNA expression, abundance and modification pattern among different tissues (or organisms) is largely missing.
All tRNAs fit the same ribosomal site and their architecture conform to a narrow set of structural parameters constrained by common identity rules, i.e. narrow length range (73-90 nt) with large conservation. This is an inherent obstacle for tRNA identification. High-throughput RNA sequencing can be only used in part, because post-transcriptional modifications interfere with the PCR-based cDNA synthesis and change nucleotide identities. tRNA-based microarray technology is an alternative methodology, but is limited to only relative concentrations and can only distinguish tRNAs with more than 8 nt difference. The majority of the tRNA species, however, have less than 8 nt variations in the whole sequence. Currently, a technique for tRNA identification and quantification does not exist. With knowledge on the composition and concentration of the tRNAome of each species, cell or tissue, we may move to a more accurate understanding of the molecular basis of silent mutations-based pathologies and tissue-specific aspects that modulate disease severity and progression.
In addition, tRNA sets largely differ among organisms, an obstacle for overexpression of proteins with therapeutic or industrial applications. Thus, knowledge of the tRNAome of a parental strain, donating the protein to be overexpressed, and of the expression host will allow synchronization of translation patterns between organisms. Specifically, using silent mutations the original translation pattern within the parental strain can be adapted to the tRNAome of the expression host and consequently will increase yield of a soluble and active heterologous protein.
Mitochondrial-Encoded tRNAs (Mt-tRNAs)
In preferred embodiments of the present invention, the RNA is a mitochondrial-encoded tRNA (mt-tRNA).
The mt-tRNA may be any mt-tRNA. Suitably, the mt-tRNA may be a naturally-occurring and/or human mt-tRNA.
Mitochondria are organelles found in most eukaryotic cells. Each cell contains hundreds to thousands of mitochondria and each contains several copies of the mitochondrial DNA (mtDNA) genome. Human mtDNA is a closed circular double-stranded DNA with 16,569 base pairs encoding 37 genes: 13 for the essential subunits of respiratory complexes I, III, IV, and V; 22 for tRNAs (mt-tRNAs); and two for rRNAs (mt-rRNAs). Therefore, all the RNA components necessary for mitochondrial translation are supplied in mitochondria, whereas all protein components, including ribosomal proteins, translational factors, aminoacyl-tRNA synthetases, and various factors required for the biogenesis of tRNAs and rRNAs, are encoded in the nucleus and transported into the mitochondria subsequent to their synthesis in the cytoplasm (T. Suzuki et al, Annu. Rev. Genet. 45, 299-329 (2011)).
Mitochondrial-encoded tRNAs (mt-tRNAs) show broader structural heterogeneity than nuclear-encoded tRNAs (S. Kirchner, Z. Ignatova, Nat. Rev. Genet. 16, 98-112 (2015)). Further, for all the identified cases in which human disease are directly linked to mutations in tRNAs, the mutations occur in mt-tRNAs. A possible explanation is that a mutated mt-tRNA is unlikely to be compensated for by other tRNAs: each mitochondrial genome bears a single copy of only 22 mt-tRNAs, and importing a nuclear-encoded tRNA is rare (S. Kirchner, Z. Ignatova, Nat. Rev. Genet. 16, 98-112 (2015)).
Pathogenic point mutations have been described that affect every mt-tRNA and are linked to defects in oxidative energy metabolism. It is now emerging that these mutations are also linked to other complex traits, including neurodegenerative diseases, ageing and cancer (E. A. Schon et al, Nat. Rev. Genet. 13, 878-890 (2012)). Mutations are catalogued in variety of databases, such as MITOMAP and Online Mendelian Inheritance in Man (OMIM). MELAS and MERRF are two classic diseases associated with mitochondrial tRNA mutations in mt-tRNA Leu(UUR) and mt-tRNA Lys, respectively. Other mtDNA mutations associated with the MERRF and MELAS phenotypes are reported in Yarham, J. W., et al., 2010. Wiley Interdisciplinary Reviews: RNA, 1(2), pp. 304-324.
If all mitochondrial genomes in one cell carry a mutation, a condition known as homoplasmy, the effect of this mutation is enhanced. However, a pathogenic mutation in mt-tRNA may also affect a proportion of the mt-tRNA copies, a condition known as heteroplasmy. Manifestation of a clinical phenotype depends on the threshold of mutation-affected mitochondria; this threshold varies among tissues. (S. Kirchner, Z. Ignatova, Nat. Rev. Genet. 16, 98-112 (2015)).
Suitably, the mt-tRNA may be selected from one or more of: a leucine(UUR) mt-tRNA, a lysine mt-tRNA, a methionine mt-tRNA, a tryptophan mt-tRNA, an aspartate mt-tRNA, an isoleucine mt-tRNA, a glycine mt-tRNA, an arginine mt-tRNA, a histidine mt-tRNA, a serine(AGY) mt-tRNA, a leucine(CUN) mt-tRNA, a threonine mt-tRNA, a phenylalanine mt-tRNA, a valine mt-tRNA, a glutamine mt-tRNA, an alanine mt-tRNA, an asparagine mt-tRNA, a cysteine mt-tRNA, a tyrosine mt-tRNA, a serine(UCN) mt-tRNA, a glutamate mt-tRNA, and a proline mt-tRNA.
In some embodiments, the mt-tRNA is selected from one or more of: a leucine(UUR) mt-tRNA, a lysine mt-tRNA, a histidine mt-tRNA, a leucine(CUN) mt-tRNA, a phenylalanine mt-tRNA, a valine mt-tRNA, a glutamine mt-tRNA, a serine(UCN) mt-tRNA, or a proline mt-tRNA.
In some embodiments, the mt-tRNA is selected from a leucine(UUR) mt-tRNA or a lysine mt-tRNA.
In some embodiments, the mt-tRNA is a leucine(UUR) mt-tRNA.
Exemplary mt-tRNA consensus sequences are given in Table 1, below. These sequences do not show the 3′-terminal CCA residues. The mt-tRNA may comprise CCA residues at the 3′ terminal of the exemplary mt-tRNA sequences.
Suitably, the mt-tRNA comprises a nucleotide sequence selected from one or more of SEQ ID NOs: 1-22, or a variant with at least 90%, at least 95%, or at least 98% sequence identity to one or more of SEQ ID NOs: 1-22.
In some embodiments, the mt-tRNA comprises a nucleotide sequence selected from one or more of SEQ ID NOs: 1, 2, 9, 11, 13, 14, 15, 20, 22, or a variant with at least 90%, at least 95%, or at least 98% sequence identity to one or more of SEQ ID NOs: 1, 2, 9, 11, 13, 14, 15, 20, 22.
In some embodiments, the mt-tRNA comprises a nucleotide sequence selected from SEQ ID NO: 1 or SEQ ID NO: 2, or a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 1 or SEQ ID NO: 2.
In some embodiments, the mt-tRNA comprises the nucleotide sequence of SEQ ID NO: 1, or a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 1.
Suitably, a mt-tRNA Leu(UUR) consists of the nucleotide sequence of SEQ ID NO: 23, or a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 23;
Suitably, a mt-tRNA Lys consists of the nucleotide sequence of SEQ ID NO: 24, or a variant with at least 90%, at least 95%, or at least 98% sequence identity SEQ ID NO: 24.
tRNA Amino Acid-Charging
Mature eukaryotic tRNAs are prepared to function as an adaptor in translation by the covalent attachment of an amino acid to the adenosine at the invariant 3′ CCA tail. This reaction is catalysed by 20 different aminoacyl tRNA synthetases (ARSs), each of which is specific for the 20 different amino acids (S. Kirchner, Z. Ignatova, Nat. Rev. Genet. 16, 98-112 (2015)). This process is known as known as Amino acid activation, aminoacylation or tRNA amino acid-charging.
Mutations in the tRNA itself (such as those discussed above) can disrupt the tRNA charging activity or lead to mischarging of the tRNA. Further, mutations in the ARS genes can result in reduced tRNA charging activity, complete loss of tRNA charging activity, and/or mischarging of tRNA.
In human cells, two distinct sets of ARSs are distinguished by their cytoplasmic or mitochondrial localization. Human cells contain 17 cytoplasmic ARS polypeptides, 18 mitochondrial ARSs, and 2 dual-localized ARSs present in both cytoplasm and mitochondria.
Mutations in cytoplasmic ARSs are associated with Charcot-Marie-Tooth (CMT) and related neuropathies. Mutations in mitochondrial ARSs are associated with a wider variety of syndromes and diseases (Yao, P. and Fox, P. L., 2013. EMBO molecular medicine, 5(3), pp. 332-343).
To detect and/or quantify tRNAs which are not amino acid-charged, an oligonucleotide can be ligated to the 3′ end of the tRNA. This ligation is possible only if the tRNA is not AA-charged, since otherwise the 3′ end of the tRNA is not free but bound to the amino acid. Subsequently, the presence of a nucleotide sequence located in the oligonucleotide can be detected and/or quantified.
The oligonucleotide is suitably a heteroduplex composed of an RNA oligonucleotide and a DNA oligonucleotide (i.e. an RNA/DNA heteroduplex) or a DNA oligonucleotide duplex. Suitably, the (hetero)duplex is formed from two complementary oligonucleotides and/or comprises a TGG-3′ overhang. Optionally, the oligonucleotide is an RNA/DNA heteroduplex comprising a TGG-3′ overhang of the DNA oligonucleotide. For example, the oligonucleotide may consist of: (i) a DNA oligonucleotide consisting of from 5′ to 3′: (N)x-TGG, where x is 1 to 100, 5 to 50, or 10 to 30; and (ii) a RNA oligonucleotide comprising or consisting of the reverse complement of (N)x.
Accordingly, in some embodiments the tRNA comprises a 3′ ligated oligonucleotide. Suitably, the tRNA comprises a sequence comprising or consisting of from 5′ to 3′: SEQ-CCA-(N)x, where x is 1 to 100, 5 to 50, or 10 to 30. SEQ may be any tRNA sequence, for example a nucleotide sequence selected from one or more of SEQ ID NOs: 1-22, or a variant with at least 90%, at least 95%, or at least 98% sequence identity to one or more of SEQ ID NOs: 1-22. The presence of the sequence (N)x (i.e. the 3′ ligated oligonucleotide) can be detected and/or quantified.
Other RNAs
In some embodiments, the present invention relates to methods of detecting and/or quantifying a ribosomal RNA (rRNA).
Ribosomal RNA (rRNA) is a type of non-coding RNA which is the primary component of ribosomes. The rRNA may be any rRNA. Suitably, the rRNA may be a naturally-occurring rRNA. Suitably, the rRNA may be a human rRNA. Suitably, the rRNA may be a mitochondrially-encoded rRNA.
In some embodiments, the present invention relates to methods of detecting and/or quantifying a messenger RNA (mRNA).
Messenger RNA (mRNA) is a single-stranded RNA molecule that corresponds to the genetic sequence of a gene and is read by the ribosome in the process of producing a protein. The mRNA may be any mRNA. Suitably, the mRNA may be a naturally-occurring mRNA.
Suitably, the mRNA may be a human mRNA. Suitably, the mRNA may be a mitochondrially-encoded mRNA.
In some embodiments, the present invention relates to methods of detecting and/or quantifying a regulatory RNA.
Regulatory RNAs are non-coding RNA molecules that play a role in cellular processes such as activation or inhibition processes. Regulatory RNAs include microRNA (miRNAs) and their precursors, small interfering RNAs (siRNAs), short hairpin RNAs (shRNAs), and antisense RNAs (asRNAs). Other regulatory RNAs are described in Morris, K. V. and Mattick, J. S., 2014. Nature Reviews Genetics, 15(6), pp. 423-437. The regulatory RNA may be any regulatory RNA. Suitably, the regulatory RNA may be a naturally-occurring regulatory RNA. Suitably, the regulatory RNA may be a human regulatory RNA.
RNA Mutations
The RNA may comprise one or more mutations, i.e. the RNA may be a mutant RNA. A “mutation” is an alteration in the nucleotide sequence compared to the consensus nucleotide sequence (e.g. SEQ ID NOs: 1-24). Suitably, the mutation may be a point mutation, an insertion, a deletion, or duplication.
The RNA may comprise one or more pathogenic mutations, i.e. the RNA may be a pathogenic mutant RNA. A “pathogenic mutation” is a mutation that increases an individual's susceptibility or predisposition to a certain disease or disorder.
Suitably, the mutation is a point mutation, i.e. the RNA may comprise one or more point mutations, optionally one or more pathogenic point mutations. A “point mutation” is a mutation that only affects a single nucleotide of the nucleotide sequence.
The RNA may comprise one or more heteroplasmic or homoplasmic mutations, particularly when the RNA is mitochondrially-encoded (e.g. an mt-tRNA). A cell can have some mitochondria that have a mutation in the mtDNA and some that do not. In addition, a mitochondria can have some copies of mtDNA that have a mutation and some copies that do not. This is termed heteroplasmy. A “heteroplasmic mutation” is a mutation which can be present in a proportion of the mtDNA, and thus in a proportion of the mt-RNA. Most common mt-tRNA mutations are heteroplasmic. Homoplasmy refers to a cell and/or mitochondria that has a uniform collection of mtDNA. A “homoplasmic mutation” is a mutation which can be present in all the mtDNA, and thus in all the mt-RNA. Suitably, the RNA (e.g. mt-tRNA) comprises a heteroplasmic mutation.
mt-RNA mutations may be referred to based on the corresponding position in the mt-DNA rather than the position in the mt-RNA. For example, the A3243G mt-DNA mutation corresponds to an A14G point mutation in leucine(UUR) mt-tRNA and the A8344G mt-DNA mutation corresponds to an A55G point mutation in lysine mt-tRNA. A conversion of nucleotide numbering in the human mitochondrial genome to conventional tRNA numbering is provided by Mamit-tRNA (Joern Putz, Bruno Dupuis, Marie Sissler and Catherine Florentz, (2007), RNA, 13, pp 1184-90).
Exemplary mt-tRNA mutations, including mt-tRNA pathogenic mutations are provided by MITOMAP.
Exemplary mt-tRNA pathogenic mutations for each mt-tRNA are given in Table 2, below. An mt-tRNA may comprise one or more pathogenic mutation recited in Table 2. Suitably, an mt-tRNA may comprise a nucleotide sequence selected from any of SEQ ID NOs: 1-24 and a corresponding pathogenic mutation recited in Table 2.
Exemplary confirmed mt-tRNA pathogenic mutations for each mt-tRNA are given in Table 3 below. Confirmed pathogenic mutations have confirming reports which address criteria including: (1) independent reports of two or more unrelated families with evidence of similar disease; (2) evolutionary conservation of the nucleotide (for RNA variants) or amino acid (for coding variants); (3) presence of heteroplasmy; (4) correlation of variant with phenotype/segregation of the mutation with the disease within a family; (5) biochemical defects in complexes I, III, or IV in affected or multiple tissues; (6) functional studies showing differential defects segregating with the mutation (cybrid or single fiber studies); (7) histochemical evidence of a mitochondrial disorder; and (8) for fatal or severe phenotypes, the absence or extremely rare occurrence of the variant in large mtDNA sequence databases. An mt-tRNA may comprise one or more pathogenic mutation recited in Table 3. Suitably, an mt-tRNA may comprise a nucleotide sequence selected from any of SEQ ID NOs: 1-24 and a corresponding pathogenic mutation recited in Table 3.
Exemplary mt-tRNA pathogenic mutations for different diseases are given in Table 4 below. An mt-tRNA may comprise one or more pathogenic mutation recited in Table 4. Suitably, an mt-tRNA may comprise a nucleotide sequence selected from any of SEQ ID NOs: 1-24 and a corresponding pathogenic mutation recited in Table 4.
An mt-tRNA may be selected from the following mutant mt-tRNAs, which are associated with MELAS and MERRF:
An mt-tRNA may be selected from the following mutant mt-tRNAs, which are associated with MELAS:
In some embodiments, the mt-tRNA is mt-tRNA Leu (UUR) with mutation A3243G or T3271C, or mt-tRNA Lys with mutation A8344G. In some embodiments, the mt-tRNA is mt-tRNA Leu (UUR) with mutation A3243G.
Suitably, an mt-tRNA Leu (UUR) with mutation A3243G may consist of SEQ ID NO: 25, or a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 25.
Suitably, an mt-tRNA Leu (UUR) with mutation T3271C may consist of SEQ ID NO: 26, or a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 26.
Suitably, an mt-tRNA Lys with mutation A8344G may consist of SEQ ID NO: 27, or a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 27.
In SEQ ID NOs: 25-27 the position of the mutation is shown in bold and underlined.
RNA Post-Transcriptional Modifications
The RNA may comprise one or more post-transcriptional modifications. A “post-transcriptional modification” is any chemical modification of the nucleosides in the RNA following transcription.
RNA can be post-transcriptionally modified with over 100 chemical moieties. For example, on average a tRNA can carry 14 modifications that contribute to its function. Modifications can directly influence RNA structure, by promoting or disrupting certain intramolecular interactions; they can make the RNA molecule more rigid or more flexible. They can also influence RNA interactions with other molecules, in particular proteins. Post-transcriptional RNA modifications are described in the MODOMICS database. (Boccaletto, P., et al., 2018. Nucleic acids research, 46(D1), pp. D303-D307.).
For example, to be fully active, tRNAs need to be heavily modified post-transcriptionally. Growing evidence indicates that tRNA post-transcriptional modifications may play important roles in complex human pathologies (A. G. Torres et al, Trends. Mol. Med. 20, 306-314 (2014); S. Kirchner, Z. Ignatova, Nat. Rev. Genet. 16, 98-112 (2015); and Pereira, M., et al., 2018. International journal of molecular sciences, 19(12), p. 3738).
Post translational modifications found in tRNAs include τm5s2U, 5-taurinomethyl-2-thiouridine; τm5U, 5-taurinomethyluridine; Ψ, pseudouridine; ac4C, N4-acetylcytidine; Am, 2′-O-methyladenosine; Ar(p), 2′-O-ribosyladenosine (phosphate); Cm, 2′-O-methylcytidine; D, dihydrouridine; f5C, 5-formylcytidine; Gm, 2′-O-methylguanosine; I, inosine; I6A, N6-isopentenyladenosine; m1A, 1-methyladenosine; m1G, 1-methylguanosine; m1I, 1-methylinosine; m2G, N2-methylguanosine; m22G, N2,N2-dimethylguanosine; m3C, 3-methylcytidine; m5C, 5-methylcytidine; m5U, 5-methyluridine; m7G, 7-methylguanosine; mcm5s2U, 5-methoxycarbonylmethyl-2-thiouridine; mcm5U, 5-methoxycarbonylmethyluridine; ms2A, 2-methylthioadenosine; ms216A, 2-methylthio-N6-isopentenyladenosine; ms6t2A, 2-methylthio-N6-threonyl carbamoyladenosine; ncm5U, 5-carbamoylmethyluridine; ncm5Um, 5-carbamoylmethyl-2′-O-methyluridine; Q, queuosine; rT, ribothymidine; s2U, 2-thiouridine; t6A, N6-threonylcarbamoyladenosine; Un, 2′-O-methyluridine; yW, wybutosine (S. Kirchner, Z. Ignatova, Nat. Rev. Genet. 16, 98-112 (2015)).
mt-tRNAs may contain one or more of the following post-transcriptional modifications (Positions in a tRNA are according to the nucleotide numbering system from tRNAdb): m1A9, m1G9, m2G10, D20, m2G26, m22G26, Ψ27, Ψ28, Ψ31, m3C32, ω32, f5C34, τm5U34, S2U34, τm5s2U34, Q34, m1G37, t6A37, i6A37, ms2A37, Ψ39, Ψ40, m5C49, Ψ50, m5U54, Ψ55, m1mA58, Ω67 (T. Suzuki et al, Annu. Rev. Genet. 45, 299-329 (2011)).
Exemplary consensus post-transcriptional modifications for mt-tRNAs are given in Table 5, below. An mt-tRNA may comprise one or more post-transcriptional modification recited in Table 5. Suitably, an mt-tRNA may comprise a nucleotide sequence selected from any of SEQ ID NOs: 1-24 and a corresponding post-transcriptional modification recited in Table 5.
Suitably, the mt-tRNA is a leucine(UUR) mt-tRNA and the post-transcriptional modification is τm5U34 or the mt-tRNA is a lysine mt-tRNA and the post-transcriptional modification is τm5s2U34.
In some embodiments, the RNA comprises one or more aberrant post-transcriptional modifications, i.e. the RNA is an aberrantly-modified RNA. An “aberrant post-transcriptional modification” is a post-transcriptional modification which differs from a consensus post-transcriptional modification. Suitably, the aberrant post-transcriptional modification is the absence of a consensus post-transcriptional modification.
In some embodiments, the RNA comprises one or more pathogenic aberrant post-transcriptional modification, i.e. the RNA is an aberrantly-modified pathogenic RNA. A “pathogenic aberrant post-transcriptional modification” is an aberrant post-transcriptional modification that increases an individual's susceptibility or predisposition to a certain disease or disorder. Pathogenic aberrant post-transcriptional modifications are described in de Crécy-Lagard, et al., 2019. Nucleic acids research, 47(5), pp. 2143-2159 and Pereira, M., et al., 2018. International journal of molecular sciences, 19(12), p. 3738.
Exemplary aberrant post-transcriptional modifications for mt-tRNAs are given in Table 6, below. An mt-tRNA may comprise one or more aberrant post-transcriptional modification recited in Table 6. Suitably, an mt-tRNA may comprise a nucleotide sequence selected from any of SEQ ID NOs: 1-24 and a corresponding aberrant post-transcriptional modification recited in Table 6.
In some embodiments:
In some embodiments, the RNA is a leucine(UUR) mt-tRNA and the aberrant post-transcriptional modification is the absence of τm5U34 or m1A58 or the RNA is a lysine mt-tRNA and the aberrant post-transcriptional modification is the absence of τm5s2U34. These aberrant modifications are associated with MELAS and MERRF, respectively (Richter, U., et al., 2018. Nature communications, 9(1), pp. 1-11).
In some embodiments, the RNA is a leucine(UUR) mt-tRNA and the aberrant post-transcriptional modification is the absence of τm5U34 or the RNA is a lysine mt-tRNA and the aberrant post-transcriptional modification is the absence of τm5s2U34.
Padlock Probe
In one aspect, the present invention provides a padlock probe comprising terminal regions complementary to a RNA. The padlock probe may be used to detect and/or quantify said RNA. The RNA may be any RNA described herein.
Padlock probes (PLPs) are oligonucleotides, for example single stranded DNA molecules, consisting of two target-complementary segments connected by a linker region. The target-complementary segments are referred to herein as “terminal regions” and are complementary to a “target sequence”. Upon hybridization to the target sequence, the two terminal regions are brought into contact, allowing PLP circularization by ligation (Nilsson, M., et al., 1994. Science, 265(5181), pp. 2085-2088). As used herein, the term “padlock probe” includes molecular inversion probes and connector inversion probes, in which the terminal regions are separated by a gap sequence when hybridised to the target sequence. A schematic showing the hybridization, and circularization of a padlock probe is shown in
Padlock probes may be used to detect and/or quantify single nucleotide variations in genomic DNA. The circularization of the padlock probe is highly specific as it requires that the terminal regions are brought into contact. Ligation is strongly inhibited by any mismatches at the ligation junction. Multiple padlock probes can be used at once because since only intramolecular probe ligation is scored, cross-reactions are unlikely to arise between many simultaneously added probes. Moreover, highly sensitive detection and/or quantification of circularized probes is possible, e.g. by amplifying reacted probes via a rolling circle replication mechanism.
Techniques for the design and synthesis of padlock probes will be well known to those of skill in the art. The padlock probe may be any length suitable for specific hybridization, circularization, and detection/quantification. Suitably, the padlock probe may be about 50 to about 200 nucleotides in length. Suitably, the padlock probe may be about 50 to about 150 nucleotides in length. Suitably, the padlock probe may be about 50 to about 100 nucleotides in length. Suitably, the padlock probe may be about 70 to about 100 nucleotides in length.
Target Sequence
The terminal regions are each complementary to a region of the RNA. Together the terminal regions are complementary to a target sequence in the RNA. The padlock probe is designed to be complementary to the target sequence and can circularize when the target sequence is present.
The target sequence may be any nucleotide sequence in the RNA which will allow for specific detection/quantification of the RNA. Suitably, the target sequence is a unique nucleotide sequence, e.g. does not occur in any other naturally-occurring RNA (or DNA).
Suitably, the target sequence may be 15-50 nucleotides in length. Suitably, the target sequence may be 20-30 nucleotides in length.
If the RNA is a tRNA comprising a 3′ ligated oligonucleotide, then the target sequence may comprise at least part of the nucleotide sequence of the 3′ ligated oligonucleotide. For example, if the tRNA comprises a sequence comprising or consisting of from 5′ to 3′: SEQ-CCA-(N)x, where x is 1 to 100, 5 to 50, or 10 to 30, the target sequence may comprise at least part of the nucleotide sequence (N)x.
The target sequence may be associated with one or more mutation and/or one or more post-transcriptional modification. The mutation may be any mutation described herein (e.g. in Tables 2-4). The post-transcriptional modification may be any aberrant post-transcriptional modification described herein (e.g. in Table 6).
The target sequence may comprise one or more mutation site and/or one or more post-transcriptional modification site. The mutation site may be the site of any mutation described herein (e.g. in Tables 2-4). The post-transcriptional modification site may be the site of any aberrant post-transcriptional modification described herein (e.g. in Table 6).
The target sequence may comprise one or more mutation and/or one or more post-transcriptional modification. The mutation may be any mutation described herein (e.g. in Tables 2-4). The post-transcriptional modification may be any aberrant post-transcriptional modification described herein (e.g. in Table 6).
Exemplary Target Sequences
The target sequence may consist of a fragment of SEQ ID NOs: 1-24 or a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to any of SEQ ID NOs: 1-24, optionally wherein the target sequence comprises one or more mutations or post-transcriptional modifications described herein (e.g. in Tables 2-4 or 6).
Suitably, the target sequence consists of a fragment of SEQ ID NO: 23 or SEQ ID NO: 24 or a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 23 or SEQ ID NO: 24, optionally wherein the target sequence comprises one or more mutations or post-transcriptional modifications described herein (e.g. in Tables 2-4 or 6).
In some embodiments, the target sequence consists of a fragment of SEQ ID NO: 23 or a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 23, wherein the target sequence comprises aberrant post-transcriptional modification U34 and/or A58.
In some embodiments, the target sequence consists of a fragment of SEQ ID NO: 24 or a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 24, wherein the target sequence comprises aberrant post-transcriptional modification U34.
In some embodiments, the target sequence consists of a fragment of SEQ ID NO: 25 or a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 25, wherein the target sequence comprises mutation A3243G.
In some embodiments, the target sequence consists of a fragment of SEQ ID NO: 26 or a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 26, wherein the target sequence comprises mutation T3271C.
In some embodiments, the target sequence consists of a fragment of SEQ ID NO: 27 or a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 27, wherein the target sequence comprises mutation A8344G.
Terminal Regions
The terminal regions may be any nucleotide sequence which will allow for specific hybridisation and circularization with the target sequence. The terminal regions may be any suitable length. Suitably, the terminal regions are each 5-30 nucleotides in length, optionally 5-15 nucleotides in length. Suitably, the terminal regions in total are 15-50 nucleotides in length, optionally 20-30 nucleotides in length.
Each terminal region is complementary to part of the target sequence. One terminal region (the first terminal region) is complementary to nucleotides a to n of the target sequence, and the other terminal region (the second terminal region) is complementary to nucleotides n+m to b of the target sequence, wherein:
The terminal regions may be complementary to adjacent nucleotide sequences (i.e. m=1). Adjacent nucleotide sequences may together cover the whole target sequence.
The terminal regions may be complementary to non-adjacent nucleotide sequences separated by a gap sequence (i.e. m>1). Non-adjacent nucleotide sequences and the gap sequence may together cover the whole target sequence. Suitably, the gap sequence may consist of 6 or fewer nucleotides (i.e. m≤7), 5 or fewer nucleotides (i.e. m≤6), 4 or fewer nucleotides (i.e. m≤5), 3 or fewer nucleotides (i.e. m≤4), 2 or fewer nucleotides (i.e. m≤3), or one nucleotide (i.e. m≤2). When the terminal regions are complementary to non-adjacent nucleotide sequences separated by a gap sequence the padlock probe may be referred to as a molecular inversion probe and/or a connector inversion probe (Hardenbol, P., et al., 2003. Nature biotechnology, 21(6), pp. 673-678; and Akhras, M. S., et al., 2007. PloS one, 2(9), p. e915).
Suitably, one or more nucleotides in the target sequence may be a mutation site or post-transcriptional modification site. For example, one or more of positions a to n or n+m to b may correspond to a mutation or a post-transcriptional modification in the target sequence. In particular, the terminal nucleotides of a terminal region may be complementary to a mutation site or post-transcriptional modification site to increase specificity of the padlock probe. For example, the nucleotide at position n or the nucleotide at position n+m may correspond to a mutation or a post-transcriptional modification in the target sequence.
Complementarity is achieved by distinct interactions between nucleobases: adenine (A), thymine (T) (or uracil (U) in RNA), guanine (G) and cytosine (C). In nucleic acids, nucleobases are held together by hydrogen bonding, which normally only works efficiently between A and T (or U) and between G and C. A complementary strand of DNA may therefore be constructed based on nucleobase complementarity, where A is complementary to T (or U) and G is complementary to C.
As used herein, the term “complementary” means completely or partially complementary, i.e. each terminal region may be either completely or partially complementary to part of the target sequence. Suitably, each terminal region is completely complementary to part of the target sequence.
As used herein, “completely complementary” refers to the case where every nucleotide is complementary. In other words the terminal region is a (reverse) complement of part of the target sequence.
As used herein, “partially complementary” refers to the case wherein the majority of nucleotides are complementary, but wherein one or more nucleotides are not complementary. In other words, the terminal region is a (reverse) complement of the target sequence, but wherein one or more nucleotides are not complementary. When the terminal regions are partially complementary, the degree of complementarity is such that circularization of the padlock probe can still occur, i.e. the padlock probe can still hybridise to the RNA and ligation can still occur. Suitably, the degree of complementarity is such that the padlock probe can specifically hybridize and circularize, i.e. the padlock probe does not detect other polynucleotides. A person skilled in the art will be able to design partially complementary padlock probes such that the desired degree of complementarity is achieved.
Suitably, when a terminal region is partially complementary, four or fewer nucleotides, three or fewer nucleotides, two or fewer nucleotides, or one nucleotide in the terminal region is not complementary. Suitably, a terminal region may have at least 80% sequence identity, at least 85% sequence identity, at least 90% sequence identity, or at least 95% sequence identity to a fragment of a (reverse) complement of the target sequence.
Suitably, when a terminal region is partially complementary, two or more terminal nucleotides of the terminal region are completely complementary to the target sequence. For example, positions n-x to n and/or positions n+m to n+m+y of the target sequence may be completely complementary to the terminal regions, wherein x and y are each independently 1 or more. Suitably, three or more (x≥2 and/or y≥2), four or more (x≥3 and/or y≥3), or five or more (x≥4 and/or y≥4) nucleotides of the terminal region are completely complementary to the target sequence. This may increase ligation efficiency.
If the RNA is a tRNA comprising a 3′ ligated oligonucleotide, then a terminal region may be complementary to at least part of the nucleotide sequence of the 3′ ligated oligonucleotide. For example, if the tRNA comprises a sequence comprising or consisting of from 5′ to 3′: SEQ-CCA-(N)x, where x is 1 to 100, 5 to 50, or 10 to 30, then a terminal region may be complementary to at least part of the nucleotide sequence (N)x.
A terminal region may be complementary to a nucleotide sequence associated with one or more mutation and/or one or more post-transcriptional modification. The mutation may be any mutation described herein (e.g. in Tables 2-4). The post-transcriptional modification may be any aberrant post-transcriptional modification described herein (e.g. in Table 6).
A terminal region may be complementary to a nucleotide sequence comprising one or more mutation site and/or one or more post-transcriptional modification site. The mutation may be any mutation described herein. The mutation site may be the site of any mutation described herein (e.g. in Tables 2-4). The post-transcriptional modification site may be the site of any aberrant post-transcriptional modification described herein (e.g. in Table 6).
A terminal region may be complementary to a nucleotide sequence comprising one or more mutation and/or one or more post-transcriptional modification. The mutation may be any mutation described herein (e.g. in Tables 2-4). The post-transcriptional modification may be any aberrant post-transcriptional modification described herein (e.g. in Table 6).
Exemplary Terminal Regions
The terminal regions may each consist of a (reverse) complement of a fragment of any one of SEQ ID NOs: 1-24 or a (reverse) complement of a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to any of SEQ ID NOs: 1-24, optionally wherein a terminal region is complementary to a nucleotide sequence comprising one or more mutation site or post-transcriptional modification site described herein (e.g. in Tables 2-4 and 6).
In some embodiments, the terminal regions each consist of a (reverse) complement of a fragment of SEQ ID NO: 23 or SEQ ID NO: 24 or a (reverse) complement of a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 23 or SEQ ID NO: 24, optionally wherein a terminal region is complementary to a nucleotide sequence comprising one or more mutation site or post-transcriptional modification site described herein (e.g. in Tables 2-4 and 6).
In some embodiments, the terminal regions each consist of a (reverse) complement of a fragment of SEQ ID NO: 23 or a (reverse) complement of a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 23, wherein one terminal region is complementary to a nucleotide sequence comprising aberrant post-transcriptional modification U34 or A58.
In some embodiments, the terminal regions each consist of a (reverse) complement of a fragment of SEQ ID NO: 24 or a (reverse) complement of a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 24, wherein one terminal region is complementary to a nucleotide sequence comprising aberrant post-transcriptional modification U34.
In some embodiments, the terminal regions each consist of a (reverse) complement of a fragment of SEQ ID NO: 25 or a (reverse) complement of a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 25, wherein one terminal region is complementary to a nucleotide sequence comprising mutation A3243G.
In some embodiments, the terminal regions each consist of a (reverse) complement of a fragment of SEQ ID NO: 26 or a (reverse) complement of a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 26, wherein one terminal region is complementary to a nucleotide sequence comprising mutation T3271C.
In some embodiments, the terminal regions each consist of a (reverse) complement of a fragment of SEQ ID NO: 27 or a (reverse) complement of a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 27, wherein one terminal region is complementary to a nucleotide sequence comprising mutation A8344G.
Suitably, the terminal regions consist of fragments which are adjacent fragments. Suitably, the terminal regions consist of fragments which are non-adjacent fragments separated by a gap sequence. Suitably, the gap sequence may consist of 6 or fewer nucleotides, 5 or fewer nucleotides, 4 or fewer nucleotides, 3 or fewer nucleotides, 2 or fewer nucleotides, or one nucleotide.
Exemplary reverse complements of SEQ ID NOs: 1-27 are shown in Table 7 below as SEQ ID NOs: 28-54.
The terminal regions may each consist of a fragment of any one of SEQ ID NOs: 28-51, or a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to any one of SEQ ID NOs: 28-51, optionally wherein a terminal region is complementary to a nucleotide sequence comprising one or more mutation site or post-transcriptional modification site described herein (e.g. in Tables 2-4 and 6).
In some embodiments, the terminal regions each consist of a fragment of SEQ ID NO: 50 or SEQ ID NO: 51, or a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity, optionally wherein a terminal region is complementary to a nucleotide sequence comprising one or more mutation site or post-transcriptional modification site described herein (e.g. in Tables 2-4 and 6).
In some embodiments, the terminal regions each consist of a fragment of SEQ ID NO: 50 or a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 50, wherein a terminal region is complementary to a nucleotide sequence comprising aberrant post-transcriptional modification U34 or A58.
In some embodiments, the terminal regions each consist of a fragment of SEQ ID NO: 51 or a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 51, wherein a terminal region is complementary to a nucleotide sequence comprising aberrant post-transcriptional modification U34.
In some embodiments, the terminal regions each consist of a fragment of SEQ ID NO: 52 or a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 52, wherein a terminal region is complementary to a nucleotide sequence comprising mutation A3243G.
In some embodiments, the terminal regions each consist of a fragment of SEQ ID NO: 53 or a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 53, wherein a terminal region is complementary to a nucleotide sequence comprising mutation T3271C.
In some embodiments, the terminal regions each consist of a fragment of SEQ ID NO: 54 or a fragment of a variant with at least 90%, at least 95%, or at least 98% sequence identity to SEQ ID NO: 54, wherein a terminal region is complementary to a nucleotide sequence comprising mutation A8344G.
Suitably, the terminal regions consist of fragments which are adjacent fragments. Suitably, the terminal regions consist of fragments which are non-adjacent fragments separated by a gap sequence. Suitably, the gap sequence may consist of 6 or fewer nucleotides, 5 or fewer nucleotides, 4 or fewer nucleotides, 3 or fewer nucleotides, 2 or fewer nucleotides, or one nucleotide.
Suitably, when a terminal region is complementary to a mutation or post-transcriptional modification, the complementary nucleotide is a terminal nucleotide, i.e., the nucleotide at position n or the nucleotide at position n+m is complementary to the mutation or a post-transcriptional modification. This may increase the specificity of the padlock probe.
In some embodiments the terminal regions may comprise or consist of SEQ ID NOs: 56 or 58 or variants thereof. Suitably, the variants comprise two or fewer, or one nucleotide substitution. Suitably, the variants do not comprise nucleotide substitutions of the two 3′ terminal nucleotides of SEQ ID NOs: 56 and 58. In some embodiments the terminal regions may consist of SEQ ID NOs: 56 or 58.
In some embodiments the terminal regions may comprise or consist of SEQ ID NOs: 55 and 56 or SEQ ID NOs: 57 and 58 or variants thereof. Suitably, the variants comprise two or fewer, or one nucleotide substitution. Suitably, the variants do not comprise nucleotide substitutions of the two 5′ terminal nucleotides of SEQ ID NOs: 55 and 57 and the two 3′ terminal nucleotides of SEQ ID NOs: 56 and 58.
In some embodiments the terminal regions may consist of SEQ ID NOs: 55 and 56 or SEQ ID NOs: 57 and 58.
The nucleotides shown in bold in SEQ ID NOs: 56 and 58 are those nucleotides corresponding to mutation site A3243.
In some embodiments the terminal regions may comprise or consist of SEQ ID NOs: 59 and 60 or SEQ ID NOs: 61 and 62 or variants thereof. Suitably, the variants comprise two or fewer, or one nucleotide substitution. Suitably, the variants do not comprise nucleotide substitutions of the two 5′ terminal nucleotides of SEQ ID NOs: 59 and 61 and the two 3′ terminal nucleotides of SEQ ID NOs: 60 and 62.
In some embodiments the terminal regions may consist of SEQ ID NOs: 59 and 60 or SEQ ID NOs: 61 and 62.
Linker Region
The linker region may be any linker regions which allows for detection and/or quantification of the circularized padlock probe. The linker regions may be any suitable length. Suitably, the linker region may be 10-150 nucleotides in length. Suitably, the linker region may be 10-100 nucleotides in length. Suitably, the linker region may be 10-60 nucleotides in length.
The linker region will typically comprise one or more primer binding sites for detection and/or quantification via amplification of the circularized padlock probe. As used herein, a “primer binding site” is a nucleotide sequence where an RNA or DNA single-stranded primer can bind to start replication or the reverse complement thereof. A primer binding site may consist of any suitable nucleotide sequence for specific amplification. Suitably, a primer binding site is 10-30 nucleotides in length.
The linker region may comprise a forward primer binding site. A “forward primer site” is complementary to a forward primer. The forward primer may bind to the circularized padlock probe at the forward primer site. When the forward primer is extended by a DNA polymerase it may form a complementary strand to the circularized padlock probe. By using a forward primer binding site, the circularized padlock probe may be amplified by rolling circle amplification.
The linker region may further comprise a reverse primer binding site. A “reverse primer site” is identical to a reverse primer, i.e. the complement of the “reverse primer site” is complementary to a reverse primer. The reverse primer may bind to the complementary strand of the circularized padlock probe (e.g. formed by extension of the forward primer) at the site corresponding to the reverse primer site. When the reverse primer is extended by a DNA polymerase it may form a strand identical to the circularized padlock probe. By using a forward primer binding site and a reverse primer binding site, the circularized padlock probe may be amplified by hyper-branched rolling circle amplification.
The linker region may comprise a tag or barcode sequence. A tag or barcode sequence may be used to detect and/or quantify the circularized padlock probe. For example, a tag or barcode can be used to hybridize circularized (and optionally amplified) padlock probes to a microarray. Incorporating a tag or barcode sequence may allow the detection and/or quantification to be multiplexed. A method for tagged hyper-branched rolling circle amplification is described in US2012264630A1. A tag or barcode sequence may consist of any sequence suitable for specific detection and/or quantification. Suitably, a tag or barcode sequence is 10-30 nucleotides in length.
The padlock probe may comprise or consist of from 5′ to 3′: a first terminal region; a first primer binding site; a second primer binding site; and a second terminal region. Suitably, the first primer binding site is a forward primer site and the second primer binding site is a reverse primer binding site, or vice versa. Optionally, the padlock probe further comprises a tag or barcode sequence.
The terminal regions, primer sites, and optionally tag or barcode sequence may overlap as long as the padlock probe is still capable of circularising and being detected. Suitably, the overlap may be five or fewer, four or fewer, three of fewer, two or fewer, or one or fewer nucleotides. Suitably, the terminal regions, primer sites, and optionally tag or barcode sequence do not overlap.
Any suitable nucleotide sequence may be present between the terminal regions, primer sites, and optionally tag or barcode sequence, e.g. a nucleotide sequence with one or more, two or more, three or more, four or more, or five or more nucleotides, as long as the padlock probe is still capable of circularising and being detected. For example, the nucleotide sequence (A)z may be inserted, where z is between 1 and 5.
Exemplary Padlock Probes
In some embodiments, the padlock probe comprises or consists of a nucleotide sequence with at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity to any one of SEQ ID NOs: 63-66, or a fragment thereof. In some embodiments, the padlock probe comprises or consists of the nucleotide sequence of any one of SEQ ID NOs: 63-66, or a fragment thereof.
In some embodiments, the padlock probe comprises or consists of a nucleotide sequence with at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 63 or SEQ ID NO: 64, or a fragment thereof. In some embodiments, the padlock probe comprises or consists of the nucleotide sequence of SEQ ID NO: 63 or SEQ ID NO: 64, or a fragment thereof.
Kit, Composition, and RNA Detection and/or Quantification System
The present invention provides a kit, composition, or RNA detection and/or quantification system comprising one or more padlock probes of the present invention.
As used herein, an “RNA detection and/or quantification system” is a system which comprises all components necessary to detect and/or quantify an RNA using a padlock probe.
Padlock Probe Combinations
In some embodiments, the kit, composition, or RNA detection and/or quantification system of the present invention comprises two or more different padlock probes. A combination of two or more different padlock probes may be useful, for example, to detect and/or quantify both mutant RNA and non-mutant RNA, or to detect and/or quantify an RNA of interest and a reference RNA.
Suitably, when the kit, composition, or RNA detection and/or quantification system of the present invention comprises two or more different padlock probes, each different padlock probe may comprise a unique tag or barcode sequence. The unique tag or barcode sequence may be used for multiplexed detection and/or quantification.
Detection and/or Quantification of Mutant RNA and Non-Mutant RNA
The kit, composition, or RNA detection and/or quantification system may comprise one or more padlock probes for detecting and/or quantifying a mutant RNA and one or more padlock probes for detecting and/or quantifying a non-mutant (e.g. consensus) RNA.
Suitably, the mutant RNA is identical to the non-mutant (e.g. consensus) RNA except for the mutation. The RNA and the mutation may be any RNA and mutation described herein.
In some embodiments, the kit, composition, or RNA detection and/or quantification system comprises:
In some embodiments:
In some embodiments, the first padlock probe comprises terminal regions complementary to a mutant mt-tRNA Leu (UUR) with mutation A3243G, and the second padlock probe comprises terminal regions complementary to a non-mutant (e.g. consensus) mt-tRNA Leu (UUR).
In some embodiments, the first padlock probe comprises terminal regions consisting of SEQ ID NOs: 55 and 56, and the second padlock probe comprises terminal regions consisting of SEQ ID NOs: 57 and 58.
In some embodiments, the first padlock probe comprises or consists of the nucleotide sequence of SEQ ID NO: 63 or a variant with at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 63, and the second padlock probe comprises or consists of the nucleotide sequence of SEQ ID NO: 64 or a variant with at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 64.
In some embodiments, the first padlock probe consists of the nucleotide sequence of SEQ ID NO: 63, and the second padlock probe consists of the nucleotide sequence of SEQ ID NO: 64.
Detection and/or Quantification of Corresponding Genes
The kit, composition, or RNA detection and/or quantification system may comprise one or more padlock probes for detecting and/or quantifying an RNA of interest and one or more padlock probes for detecting and/or quantifying the gene which encodes the RNA of interest. For example, if the RNA of interest is a heteroplasmic mutant RNA, the RNA mutation load and DNA mutation load can be compared.
The padlock probes for detecting and/or quantifying the gene which encodes the RNA of interest is not a padlock probe according to the present invention. However, such padlock probes can be designed and synthesised by one of skill in the art.
In some embodiments, the kit, composition, or RNA detection and/or quantification system comprises:
In some embodiments, the kit, composition, or RNA detection and/or quantification system comprises:
Suitably, the kit, composition, or RNA detection and/or quantification system comprises the first and third padlock probes and/or the second and fourth padlock probes. Suitably, the kit, composition or RNA detection and/or quantification system comprises the first, second, third, and fourth padlock probes.
In some embodiments:
In some embodiments, the first padlock probe comprises terminal regions complementary to a mutant mt-tRNA Leu (UUR) with mutation A3243G, the second padlock probe comprises terminal regions complementary to a non-mutant (e.g. consensus) mt-tRNA Leu (UUR), the third padlock probe comprises terminal regions complementary to a gene encoding the mutant mt-tRNA Leu (UUR) with mutation A3243G, and the fourth padlock probe comprises terminal regions complementary to a gene encoding the non-mutant (e.g. consensus) mt-tRNA Leu (UUR).
In some embodiments, the first padlock probe comprises terminal regions consisting of SEQ ID NOs: 55 and 56, the second padlock probe comprises terminal regions consisting of SEQ ID NOs: 57 and 58, the third padlock probe comprises terminal regions consisting of SEQ ID NOs: 67 and 68, and the fourth padlock probe comprises terminal regions consisting of SEQ ID NOs: 69 and 70.
In some embodiments, the first padlock probe comprises or consists of the nucleotide sequence of SEQ ID NO: 63 or a variant with at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 63, the second padlock probe comprises or consists of the nucleotide sequence of SEQ ID NO: 64 or a variant with at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 64, the third padlock probe comprises or consists of the nucleotide sequence of SEQ ID NO: 71 or a variant with at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 71, and the fourth padlock probe comprises or consists of the nucleotide sequence of SEQ ID NO: 72 or a variant with at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 72.
In some embodiments, the first padlock probe consists of the nucleotide sequence of SEQ ID NO: 63, the second padlock probe consists of the nucleotide sequence of SEQ ID NO: 64, the third padlock probe consists of the nucleotide sequence of SEQ ID NO: 71, and the fourth padlock probe consists of the nucleotide sequence of SEQ ID NO: 72.
Detection and/or Quantification of Reference RNAs
The kit, composition, or RNA detection and/or quantification system may comprise one or more padlock probes for detecting and/or quantifying an RNA of interest, and one or more padlock probes for detecting and/or quantifying a reference RNA. The quantity of a reference tRNA may, for example, act as indicator of the level of transcriptional activity.
In some embodiments, the kit, composition, or RNA detection and/or quantification system comprises:
In some embodiments, the kit, composition, or RNA detection and/or quantification system comprises:
Suitably, the kit, composition, or RNA detection and/or quantification system comprise any combination of the first, second, third, fourth, and fifth padlock probes. Suitably, the kit, composition or RNA detection and/or quantification system comprises the first, second, and fifth padlock probes. Suitably, the kit, composition or RNA detection and/or quantification system comprises the first, second, third, fourth, and fifth padlock probes.
The kit, composition, or RNA detection and/or quantification system may comprise one or more padlock probes for detecting and/or quantifying an RNA of interest, and one or more padlock probes for detecting and/or quantifying a reference nuclear-encoded RNA and/or one or more padlock probes for detecting and/or quantifying a reference mitochondrial-encoded RNA.
In some embodiments, the kit, composition, or RNA detection and/or quantification system comprises:
In some embodiments, the kit, composition, or RNA detection and/or quantification system comprises:
Suitably, the kit, composition, or RNA detection and/or quantification system comprises any combination of the first, second, third, fourth, fifth, and sixth padlock probes. Suitably, the kit, composition or RNA detection and/or quantification system comprises the first, second, and sixth padlock probes. Suitably, the kit, composition or RNA detection and/or quantification system comprises the first, second, third, fourth, fifth, and sixth padlock probes.
In some embodiments, the fifth padlock probe comprises terminal regions complementary to initiator methionine nuclear-encoded tRNA. Suitably, the fifth padlock probe comprises terminal regions consisting of SEQ ID NOs: 59 and 60. Suitably, the fifth padlock probe comprises or consists of the nucleotide sequence of SEQ ID NO: 65 or a variant with at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 65.
In some embodiments, the sixth padlock probe comprises terminal regions complementary to methionine mt-tRNA. Suitably, the sixth padlock probe comprises terminal regions consisting of SEQ ID NOs: 61 and 62. Suitably, the sixth padlock probe comprises or consists of the nucleotide sequence of SEQ ID NO: 66 or a variant with at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 66.
Multiplexed Detection and/or Quantification
The kit, composition, or RNA detection and/or quantification system may comprise a plurality of padlock probes for detecting and/or quantifying a plurality of RNAs of interest. Suitably, each padlock probe comprises a unique tag or barcode sequence for specific detection and/or quantification. Suitably, each padlock probe is specific for a unique target sequence.
The unique target sequences may be in different RNAs. For example, the kit, composition, or RNA detection and/or quantification system suitably comprises:
The unique target sequences may be in the same RNA. For example, the kit, composition, or RNA detection and/or quantification system suitably comprises:
In some embodiments:
Additional Components
The kit, composition, or RNA detection and/or quantification system may comprise any additional component, for example any additional component necessary to detect and/or quantify an RNA using a padlock probe of the present invention.
Primers
The kit, composition, or RNA detection and/or quantification system may comprise one or more primers. A “primer” is a short oligonucleotide that provides a starting point for DNA synthesis. The primer may be any suitable primer. Suitably, the one or more primers are complementary to the one or more primer binding sites present in the one or more padlock probes of the present invention which are present in the kit, composition, or RNA detection and/or quantification system.
In some embodiments the kit, composition, or RNA detection and/or quantification system of the present invention comprises: (i) one or more padlock probes of the invention comprising or consisting of from 5′ to 3′: a first terminal region; a first primer binding site; a second primer binding site; and a second terminal region; (ii) one or more first primers; and (iii) one or more second primers. Suitably, the first primer is a forward primer and the second primer is a reverse primer, or vice versa.
For example, the kit, composition, or RNA detection and/or quantification system of the present invention may comprise:
Suitably, one or more of the primers are primers according to US2012264630A1. For example, one or more the primers may consist of or comprise the following sequence, from its 5′ end to its 3′ end: 5′-(F1)n1-T1-(E1)m1-A1-3′, in which:
Suitably, one or more of the forward primers consists of or comprises the following sequence, from its 5′ end to its 3′ end: 5′-(F1)n1-T1-(E1)m1-A1-3′, in which:
Suitably, m1+m2 is equal to 1 or 2. Suitably, the spacer E1 and/or E2 is selected from the group constituted by an abasic site and a linear or branched, optionally substituted alkyl, alkenyl or alkynyl group, e.g. polyethylene glycol. Suitably, said tag is selected from the group constituted by a luminescent agent, a radioisotope, an enzyme, biotin, acrylamide, a thiol and a phosphorothioate.
Capture Probes
The kit, composition, or RNA detection and/or quantification system may comprise one or more capture probes. A “capture probe” is an oligonucleotide which can be used to specifically purify a RNA or DNA target molecule. The one or more capture probes may be any suitable capture probes. The one or more capture probes may specifically hybridise to one or more padlock probes of the invention. Suitably, the one or more capture probes are complementary to a nucleotide sequence of 10-50 nucleotides in one or more padlock probes of the invention.
Suitably, the one or more capture probes are modified oligonucleotides. Suitably, the modification can be used to specifically purify the RNA or DNA target molecule. Suitably, the capture probe is a biotinylated oligonucleotide.
The kit, composition, or RNA detection and/or quantification system may comprise one beads specific for the capture probe. For example, if the capture probe is a biotinylated oligonucleotide, then the beads may be coated with streptavidin. Suitably, the beads are magnetic beads.
For example, the kit, composition, or RNA detection and/or quantification system of the present invention may comprise:
DNA Ligase
The kit, composition, or RNA detection and/or quantification system may comprise one or more DNA ligases. A “DNA ligase” is an enzyme that facilitates the joining of DNA strands together by catalyzing the formation of a phosphodiester bond. The DNA ligase may be any suitable DNA ligase. Suitably, the DNA ligase can efficiently ligate the padlock probes of the invention when they are hybridised to the RNA.
A suitable DNA ligase is a Splint R ligase. Splint R Ligase (NCBI Reference Sequence: NP_048900.1) is also known as PBCV-1 DNA Ligase or Chlorella virus DNA and efficiently catalyzes the ligation of adjacent, single-stranded DNA splinted by a complementary RNA strand (Jin, J., et al., 2016. Nucleic acids research, 44(13), pp. e116-e116).
Suitably, a Splint R ligase comprises or consists of the amino acid sequence of SEQ ID NO: 85 or a variant with at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 85.
The kit, composition, or RNA detection and/or quantification system may comprise one or more DNA ligase buffers. A “DNA ligase buffer” may be a concentrated buffer which can be added to a composition to provide suitable conditions for activity of the corresponding DNA ligase. Suitably, a DNA ligase buffer is 5× to 100× concentrated.
Suitably, the buffer is a Splint R ligase buffer. Suitably, a 1× Splint R ligase buffer comprises 1 μM-1.5 mM ATP or 10 μM 1 mM ATP and/or has a pH of 7.5-8.0, e.g. pH 7.5 or pH 7.6 at 25° C. Suitably, a 1× Splint R ligase buffer comprises 66 mM Tris-HCl, 10 mM MgCl2, 1 mM ATP, 1 mM DTT, 7.5% PEG or 50 mM Tris-HCl, 10 mM MgCl2, 1 mM ATP, 10 mM DTT.
Exonucleases
The kit, composition, or RNA detection and/or quantification system may comprise one or more exonucleases. An “exonuclease” is an enzyme which removes successive nucleotides from the end of a polynucleotide molecule. The exonuclease may be any exonuclease suitable for exonuclease digestion of non-circularised padlock probes. Suitable exonucleases will be well known to those of skill in the art. Suitable exonucleases include Exo I, Lambda Exo and T5 Exo.
Amplification, Detection, and Quantification Reagents
The kit, composition, or RNA detection and/or quantification system may comprise one or more DNA polymerases. A “DNA polymerase” is an enzyme that synthesizes DNA molecules from nucleoside triphosphates. During this process, DNA polymerase reads the existing DNA strand and creates a new strand. The DNA polymerase may be any suitable DNA polymerase. Suitable DNA polymerases will be well known to those of skill in the art. A suitable DNA polymerase is a Bst DNA polymerase Warmstart.
The kit, composition, or RNA detection and/or quantification system may comprise one or more amplification buffers. An “amplification buffer” may be a concentrated buffer which can be added to a composition to provide suitable conditions for activity of a DNA polymerase. Suitably, an amplification buffer is 5× to 100× concentrated. Suitable amplification buffers will be well known to those of skill in the art.
The kit, composition, or RNA detection and/or quantification system may comprise one or more deoxynucleoside triphosphate (dNTP) mixes. A “dNTP mix” may be a premixed aqueous solutions of dATP, dCTP, dGTP and dTTP suitable for activity of a DNA polymerase. Suitably, a dNTP mix comprises 10 mM of each of dATP, dCTP, dGTP and dTTP.
The kit, composition, or RNA detection and/or quantification system may comprise one or more DNA dyes. A “DNA dye” may be any dye which binds to double-stranded DNA, which increases the fluorescence quantum yield of the dye. An increase in DNA product during amplification therefore leads to an increase in fluorescence intensity. The DNA dye may be any suitable DNA dye. Suitable DNA dyes will be well known to those of skill in the art. A suitable DNA dye is SYBR Green I.
In some embodiments, the kit, composition, or RNA detection and/or quantification system comprises one or more DNA polymerases, one or more amplification buffers, one or more dNTP mixes, and one or more DNA dyes.
Reference Samples
The kit, composition, or RNA detection and/or quantification system may comprise one or more reference samples. For example, the kit, composition, or RNA detection and/or quantification system may comprise one or more reference samples comprising a known RNA at a known concentration. The reference samples may be suitable for calibration when quantifying an RNA of interest.
Reference samples may be prepared by any suitable method known to those of skill in the art. For example, a reference sample may be prepared by a method comprising: (i) obtaining a reference DNA template encoding an RNA; (ii) transcribing the reference DNA to obtain a transcribed RNA; and (iii) isolating and purifying the transcribed RNA.
Components for Detection and/or Quantification of Corresponding Genes
When the kit, composition, or RNA detection and/or quantification system comprises one or more padlock probes for detecting a gene which encodes an RNA of interest, the kit, composition, or RNA detection and/or quantification system may comprise any additional components necessary to detect and/or quantify the gene which encodes an RNA of interest.
The kit, composition, or RNA detection and/or quantification system may comprise one or more primers specific for one or more primer binding sites on the padlock probes for detecting and/or quantifying a gene which encodes an RNA of interest.
In some embodiments the kit, composition, or RNA detection and/or quantification system of the present invention comprises: (i) one or more padlock probes for detecting and/or quantifying a gene which encodes an RNA of interest comprising or consisting of from 5′ to 3′: a first terminal region; a first primer binding site; a second primer binding site; and a second terminal region; (ii) one or more first primers; and (iii) one or more second primers. Suitably, the first primer is a forward primer and the second primer is a reverse primer, or vice versa.
For example, the kit, composition, or RNA detection and/or quantification system of the present invention may comprise:
The kit, composition, or RNA detection and/or quantification system may comprise one or more capture probes specific for the one or more padlock probes for detecting and/or quantifying a gene which encodes an RNA of interest.
For example, the kit, composition, or RNA detection and/or quantification system of the present invention may comprise:
The kit, composition, or RNA detection and/or quantification system may comprise one or more DNA ligases which can efficiently ligate padlock probes when they are hybridised to DNA. Suitable DNA ligases will be well known to those of skill in the art. A suitable DNA ligase is a Taq DNA ligase.
Methods of Detecting and/or Quantifying RNA
The padlock probes of the present invention may be used to detect RNA, for example RNA comprising the target site. The padlock probes of the present invention may also be used to quantify said RNA.
In one aspect, the present invention provides for use of a padlock probe for detecting and/or quantifying a RNA. The RNA may be any RNA described herein. The padlock probe may be any padlock probe described herein.
In one aspect, the present invention provides a method of detecting and/or quantifying a RNA using one or more padlock probes. The RNA may be any RNA described herein. The padlock probe may be any padlock probe described herein.
The method may comprise:
Step (a): Providing a Sample Comprising One or More RNAs
Methods to obtain a sample comprising one or more RNAs will be well known to those of skill in the art. Suitably, RNA may be extracted from one or more tissues and subsequently purified and/or isolated to provide a purified sample comprising one or more RNAs.
The sample comprising one or more RNAs may be an RNA sample. As used herein, an “RNA sample” is a sample consisting essentially of RNA and an aqueous medium (e.g. water or a buffer). An RNA sample may consist of RNA and an aqueous medium (e.g. water or a buffer). An RNA sample may comprise essentially no DNA, i.e. an RNA sample may contain non-detectable amounts of DNA. Suitably, the sample is a purified and/or isolated RNA sample.
The sample comprising one or more RNAs may be obtained or obtainable from urine, muscle, blood and/or saliva. Suitably the sample is obtained or obtainable from urine. Urine is an attractive tissue for diagnostics since it is obtained in a fully non-invasive manner. Suitably the sample is obtained or obtainable from blood.
The method may further comprise a step of extracting, purifying and/or isolating the sample from tissue. As used herein, “extracting” may be a step of extracting nucleic acids from a tissue to obtain a crude sample comprising one or more RNAs. As used herein, “purifying” or “isolating” may be a step of separating the desired nucleic acids from a crude sample to provide an RNA sample. Any suitable method may be used and commercial kits for extracting, purifying and/or isolating RNA are available. Suitably, PAXgene blood RNA tubes may be used according to the commercial protocol. Suitably, the sample can be transported on dry ice and/or stored at −80° C.
The method may further comprise a step of modifying the RNA.
Suitably, if the RNA is tRNA, an oligonucleotide is ligated to the 3′ end of the tRNA. This ligation is possible only if the tRNA is not AA-charged, since otherwise the 3′ end of the tRNA is not free but bound to the amino acid.
The oligonucleotide is suitably a heteroduplex composed of an RNA oligonucleotide and a DNA oligonucleotide (i.e. an RNA/DNA heteroduplex) or a DNA oligonucleotide duplex. Suitably, the (hetero)duplex is formed from two complementary oligonucleotides and/or comprises a TGG-3′ overhang. Optionally, the oligonucleotide is an RNA/DNA heteroduplex comprising a TGG-3′ overhang of the DNA oligonucleotide. For example, the oligonucleotide may consist of: (i) a DNA oligonucleotide consisting of from 5′ to 3′: (N)x-TGG, where x is 1 to 100, 5 to 50, or 10 to 30; and (ii) a RNA oligonucleotide comprising or consisting of the reverse complement of (N)x. Using such a RNA/DNA heteroduplex is advantageous because it can lead to a ligation product that is entirely composed of RNA.
Conditions suitable for ligation of an oligonucleotide to the 3′ end of the tRNA will be well known to those of skill in the art. Suitably, if the oligonucleotide is a RNA/DNA heteroduplex comprising a TGG-3′ overhang of the DNA oligonucleotide, an RNA polymerase may be used (e.g. T4 RNA ligase 2).
Step (b): Hybridising One or More Padlock Probes to the One or More RNAs to Obtain One or More Hybridised Padlock Probes
Suitable methods for hybridisation will be well known to those of skill in the art. Suitably, the one or more padlock probes may be mixed with the one or more RNAs in a hybridisation buffer and incubated to obtain one or more RNA/padlock probe complexes. The padlock probes may be any padlock probe of the invention.
Any suitable amount of padlock probe and RNA may be used. Suitably, about 0.1 pmol of padlock probe and about 40-60 ng of RNA template can be mixed.
Any suitable hybridisation buffer may be used. The hybridisation buffer may be a 1×DNA ligase buffer, for example a 1× Splint R ligase buffer.
Any suitable incubation may be used. For example, a temperature ramp may be used in which the nucleic acids are heated to a high temperature to denature the nucleic acids, followed by a gradual decrease in temperature to enable specific hybridisation of the padlock probe to the target site. Suitably, a temperature ramp of between about 97° C. and about 37° C., with about a 5° C. decrement every 2 minutes may be used.
Optionally, the method may comprise a step of purifying the one or more RNA/padlock probe complexes prior to ligation of the padlock probe. Suitable methods for purifying the one or more RNA/circularised padlock probe complexes will be well known to those of skill in the art (e.g. the method described in Zhang, D. Y., et al., 1998. Gene, 211(2), pp. 277-285 or Smith, J. H. and Beals, T. P., 2013. PloS one, 8(5), p. e65053). For example, the one or more RNA/padlock probe complexes can be purified by bead separation, optionally magnetic bead separation e.g. by the method described below.
Step (c): Circularising the One or More Hybridised Padlock Probes to Obtain One or More Circularised Padlock Probes
Suitable methods for circularising the padlock probes will be well known to those of skill in the art. Suitably, the one or more hybridised padlock probes (i.e. RNA/padlock probe complexes, optionally purified) are mixed with one or more DNA ligases in a DNA ligase buffer and incubated to obtain one or more circularised padlock probes (i.e. RNA/circularised padlock probe complexes).
Any suitable DNA ligases and corresponding DNA ligase buffers may be used. The DNA ligase may be a Splint R ligase and the DNA ligase buffer may be 1× Splint R ligase buffer. Optionally, the DNA ligase buffer is the same as the hybridisation buffer.
Any suitable incubation may be used. Suitably, the hybridised probes may be incubated with the DNA ligase at about 37° C. for about 45 minutes or less.
Step (d): Purifying the One or More Circularised Padlock Probes
Suitable methods for purifying the one or more circularised padlock probes will be well known to those of skill in the art.
In some embodiments, the one or more circularised padlock probes are purified by bead separation, optionally magnetic bead separation. Suitable methods will be well known to those of skill in the art (e.g. the method described in Zhang, D. Y., et al., 1998. Gene, 211(2), pp. 277-285 or Smith, J. H. and Beals, T. P., 2013. PloS one, 8(5), p. e65053). Suitably the following steps may be used:
In some embodiments, non-circularised padlock probes and other nucleic acids (e.g. the RNA and capture probes) are digested. Suitably, the digestion may occur after bead separation. Suitable methods will be well known to those of skill in the art. Suitably, one or more exonucleases are mixed with the sample containing the circularised padlock probes (e.g. a sample containing eluted RNAs, circularised padlock probes, and capture probes) in a suitable buffer and incubated. Exonucleases will not digest the circularised padlock probes. The one or more exonuclease may be any suitable exonucleases, for example those described herein. Suitably, the buffer is a 1× amplification buffer. Suitably the incubation is for about 90 minutes at about 37° C. The one or more exonucleases should be inactivated prior to step (e), for example by incubation for about 20 minutes at about 80° C.
Step (e): Amplifying the One or More Circularised Padlock Probes to Obtain Amplified Padlock Probes
Suitable methods for amplifying the one or more circularised padlock probes will be well known to those of skill in the art. Suitably, the one or more circularised padlock probes are mixed with one or more primers, one or more DNA polymerases, and one or more dNTP mixes in an amplification buffer, and incubated. Any suitable primers, DNA polymerase, dNTP mixes and amplification buffers may be used, for example those described herein.
Suitably, the one or more padlock probes are amplified by rolling circle amplification (RCA), particularly when the padlock probe comprises one primer binding site. Suitable conditions will be well known to those of skill in the art and are described in Ali, M. M., et al., 2014. Chemical Society Reviews, 43(10), pp. 3324-3341.
The RCA may be exponential RCA. By employing multiple primers that hybridise with the same circle, multiple amplification events can be initiated producing multiple RCA products. Exponential RCA can be used to improve the sensitivity of detection. For example, the exponential RCA may be multi-primed RCA or hyper-branched RCA (HRCA).
Suitably, the one or more padlock probes are amplified by HRCA, particularly when the padlock probe comprises at least one forward primer binding site and at least one reverse primer binding site. In HRCA the RCA product produced using the forward primer is used as the template for further amplification with the reverse primer.
Suitable incubation conditions will be well known to those of skill in the art. For example, if Bst DNA polymerase Warmstart is used, the incubation may be at about 65° C. for about 1 hour.
Step (f): Detecting the Amplified Padlock Probes.
Suitable methods for detecting the amplified circularised padlock probes will be well known to those of skill in the art. Suitably, the amplified circularised padlock probe is mixed with a DNA dye and the presence of the amplified padlock probe is detected by an increase in fluorescence. Any suitable DNA may be used, for example those described herein.
Suitably, the DNA dye is added during amplification, such that the amplified padlock probe (e.g. the increase in fluorescence) can be detected in real-time.
Suitably, the amplified circularised padlock probes are detected using a label-free detection method e.g. using biosensors comprising field-effect transistors. For example, by a method described in US2006246443A1 or US2006011911A1.
Suitably, the amplified circularised padlock probes may be detected using a sensor consisting of a network of field-effect transistors (T1, T2 etc.), each of which has a source region (S), a drain region (D), and a gate region which forms an active zone. The method using said sensor may comprise:
Step (g): Quantifying the One or More RNAs
Suitable methods for quantifying the one or more RNAs will be well known to those of skill in the art. Suitably, the fluorescence intensity of the amplified padlock probes is compared to one or more reference samples containing a known concentration of RNA.
Suitably, the one or more RNAs are quantified by quantitative polymerase chain reaction (qPCR). Real-time detection of fluorescence intensity can be used quantitatively. For example, if the increase in fluorescence is detected in real-time, the fluorescence signal will increase with time, and finally saturate. The increase occurs more rapidly for higher RNA concentrations. The time at which the relative signal passes a predetermined threshold can therefore be used to quantify the RNA concentration. qPCR is described in Taylor, S. C., et al., 2019. Trends in biotechnology, 37(7), pp. 761-774.
Methods of Detecting and/or Quantifying RNA Mutations
The padlock probes of the present invention may be used to detect and/or quantify mutant RNA.
In one aspect, the present invention provides for use of a padlock probe for detecting and/or quantifying a mutant RNA.
In a related aspect, the present invention provides a method of detecting and/or quantifying a mutant RNA using one or more padlock probes.
The mutant RNA may be any mutant RNA described herein. The padlock probe may be any padlock probe described herein. Suitably, the padlock probe hybridises to a target region comprising the mutation. Suitably, one of the terminal regions hybridises to a target region comprising the mutation.
The method may comprise:
Suitably, step (g) may be used to determine the concentration of mutant RNAs in the sample (cmut).
Suitably, the sample further comprises one or more wild-type RNAs (i.e. RNAs corresponding to the one or more mutant RNAs but not comprising the mutation). Suitably, the method may further comprise:
The padlock probe may be any padlock probe described herein. Suitably, the padlock probe hybridises to a target region comprising the mutation site. Suitably, one of the terminal regions hybridises to a target region comprising the mutation site. Suitably, the padlock probe terminal regions are identical to those used in step (b), except the nucleotide corresponding to the mutation site, which is complementary to the wild-type sequence.
Suitably, step (g′) may be used to determine the concentration of wild-type RNAs in the sample (cwt).
Suitably, step (g) and step (g′) may be used to calculate the RNA mutation load, cmut/(cwt+cmut). The RNA mutation load corresponds to the percentage of RNA molecules in the sample which were mutant RNAs, i.e. comprised the mutation. This may also be referred to as the RNA heteroplasmy level, or RNA mutation frequency.
Alternatively, the method may further comprise:
The padlock probe may be any padlock probe described herein. Suitably, the padlock probe hybridises to a target region which does not comprise the mutation site. Suitably, both terminal regions hybridise to target regions which do not comprise the mutation site.
Suitably, step (g′) may be used to determine the concentration of total RNA (i.e. mutant and wild-type) in the sample (ctotal=cwt+cmut).
Suitably, step (g) and step (g′) may be used to calculate the RNA mutation load, cmut/ctotal.
Suitably, the sample further comprises one or more reference RNAs. The reference RNAs may be any suitable reference RNA, for example those described herein. Suitably, the method may further comprise:
The padlock probe may be any padlock probe described herein.
Suitably, step (g″) may be used to determine the concentration of reference RNAs in the sample (cref). Suitably, this may be used to calculate the relative quantity of wild-type RNA molecules, cwt/cref and/or the relative quantity of mutant molecules, cmut/cref.
Suitably, the method may further comprise detecting and/or quantifying the corresponding genes encoding the RNA. Suitably, the method may further comprise:
Methods to obtain a sample comprising one or more polynucleotides encoding the mutant RNA will be well known to those of skill in the art. Suitably, polynucleotides may be extracted from one or more tissues and subsequently purified and/or isolated to provide a purified sample comprising one or more polynucleotides encoding the mutant RNA.
The sample comprising one or more polynucleotides encoding the mutant RNA may be a DNA sample. As used herein, a “DNA sample” is a sample consisting essentially of DNA and an aqueous medium (e.g. water or a buffer). A DNA sample may consist of DNA and an aqueous medium (e.g. water or a buffer). A DNA sample may comprise essentially no RNA, i.e. a DNA sample may contain non-detectable amounts of RNA. Suitably, the sample is a purified and/or isolated DNA sample.
The sample comprising one or more polynucleotides encoding the mutant RNA may be obtained or obtainable from urine, muscle, blood and/or saliva. The method may further comprise a step of extracting, purifying and/or isolating the sample from tissue. Any suitable method may be used and commercial kits for extracting, purifying and/or isolating DNA are available. Suitably, PAXgene blood DNA tubes may be used according to the commercial protocol. Suitably, the sample can be transported on dry ice and/or stored at −80° C.
The padlock probe may be any padlock probe described herein for detecting and/or quantifying the gene which encodes the RNA. Suitably, the padlock probe hybridises to a target region comprising the mutation. Suitably, one of the terminal regions hybridises to a target region comprising the mutation.
Suitably, step (g′″) may be used to determine the concentration of mutant RNA genes in the sample (c′mut).
Suitably, the sample comprises one or more polynucleotides encoding the wild-type RNA.
Suitably, the method may further comprise:
The padlock probe may be any padlock probe described herein for detecting and/or quantifying the gene which encodes the RNA. Suitably, the padlock probe hybridises to a target region comprising the mutation site. Suitably, one of the terminal regions hybridises to a target region comprising the mutation site. Suitably, the padlock probe terminal regions are identical to those used in step (b), except the nucleotide corresponding to the mutation site, which is complementary to the wild-type sequence.
Suitably, step (g″″) may be used to determine the concentration of wild-type RNA genes in the sample (c′wt).
Suitably, step (g′″) and step (g″″) may be used to calculate the DNA mutation load, c′mut/(c′wt+c′mut). The DNA mutation load corresponds to the percentage of DNA molecules in the sample which encoded mutant RNAs. This may also be referred to as the DNA heteroplasmy level, or DNA mutation frequency.
Methods of Detecting Aberrant RNA Post-Transcriptional Modifications
The padlock probes of the present invention may be used to detect and/or quantify aberrantly-modified RNA (i.e. RNA with aberrant post-transcriptional modifications).
In one aspect, the present invention provides for use of a padlock probe for detecting and/or quantifying an aberrantly-modified RNA.
In a related aspect, the present invention provides a method of detecting and/or quantifying an aberrantly-modified RNA using one or more padlock probes.
The aberrantly-modified RNA may be any aberrantly-modified RNA described herein. The padlock probe may be any padlock probe described herein. Suitably, the padlock probe hybridises to a target region comprising the aberrant post-transcriptional modification.
Suitably, one of the terminal regions hybridises to a target region comprising the aberrant post-transcriptional modification.
The method may comprise:
Suitably, step (g) may be used to determine the concentration of aberrantly-modified RNAs in the sample (cmut).
Suitably, the sample further comprises one or more wild-type RNAs (i.e. RNAs corresponding to the one or more aberrantly-modified RNAs but not comprising the aberrantly-modified). Suitably, the method may further comprise:
The padlock probe may be any padlock probe described herein. Suitably, the padlock probe hybridises to a target region which does not comprise the aberrant modification site.
Suitably, both terminal regions hybridise to target regions which do not comprise the aberrant modification site.
Suitably, step (g′) may be used to determine the concentration of total RNA in the sample (ctotal).
Suitably, step (g) and step (g′) may be used to calculate the RNA aberrant-modification load, cmut/ctotal. The RNA aberrant-modification load corresponds to the percentage of RNA molecules in the sample which were aberrantly modified.
Suitably, the sample further comprises one or more reference RNAs. The reference RNAs may be any suitable reference RNA, for example those described herein. Suitably, the method may further comprise:
The padlock probe may be any padlock probe described herein.
Suitably, step (g″) may be used to determine the concentration of reference RNAs in the sample (cref). Suitably, this may be used to calculate the relative quantity of aberrantly-modified molecules, cmut/cref.
Methods of Detecting tRNA Amino Acid-Charging
The padlock probes of the present invention may be used to detect and/or quantify tRNA amino acid-charging.
In one aspect, the present invention provides for use of a padlock probe for detecting and/or quantifying tRNA amino acid-charging.
In a related aspect, the present invention provides a method of detecting and/or quantifying tRNA amino acid-charging using one or more padlock probes.
The tRNA may be any tRNA described herein comprising a 3′ ligated oligonucleotide. To detect and/or quantify tRNAs which are not amino acid-charged, an oligonucleotide can be ligated to the 3′ end of the tRNA.
The oligonucleotide is suitably a heteroduplex composed of an RNA oligonucleotide and a DNA oligonucleotide (i.e. an RNA/DNA heteroduplex) or a DNA oligonucleotide duplex. Suitably, the (hetero)duplex is formed from two complementary oligonucleotides and/or comprises a TGG-3′ overhang. Optionally, the oligonucleotide is an RNA/DNA heteroduplex comprising a TGG-3′ overhang of the DNA oligonucleotide. For example, the oligonucleotide may consist of: (i) a DNA oligonucleotide consisting of from 5′ to 3′: (N)x-TGG, where x is 1 to 100, 5 to 50, or 10 to 30; and (ii) a RNA oligonucleotide comprising or consisting of the reverse complement of (N)x. Using such a RNA/DNA heteroduplex is advantageous because it can lead to a ligation product that is entirely composed of RNA.
The oligonucleotide can be ligated using any suitable method. For example, if the oligonucleotide is a RNA/DNA heteroduplex comprising a TGG-3′ overhang of the DNA oligonucleotide, an RNA polymerase may be used (e.g. T4 RNA ligase 2). The padlock probe hybridises to a target region comprising at least part of the nucleotide sequence of the 3′ ligated oligonucleotide. Suitably, a terminal region is complementary to at least part of the nucleotide sequence of the 3′ ligated oligonucleotide.
The method may comprise:
The method of may further comprise a step of ligating a oligonucleotide the 3′ end of one or more tRNAs. Suitably, step (a) may comprise:
This ligation is possible only if the tRNA is not AA-charged, since otherwise the 3′ end of the tRNA is not free but bound to the amino acid.
Suitably, step (g) may be used to determine the concentration of tRNAs in the sample which are not charged with an amino acid (cmut).
Suitably, the sample in step (a) further comprises one or more tRNAs which do not comprise a 3′ ligated oligonucleotide. For example, the sample in step (a1) further comprises one or more tRNAs which are amino-acid charged and thus the oligonucleotide is not ligated to the 3′ end of the tRNA. Suitably, the method may further comprise:
The padlock probe may be any padlock probe described herein. Suitably, the padlock probe hybridises to a target region which does not comprise the 3′ ligated oligonucleotide. Suitably, both terminal regions hybridise to target regions which do not comprise the 3′ ligated oligonucleotide.
Suitably, step (g′) may be used to determine the concentration of total tRNA in the sample (ctotal).
Suitably, step (g) and step (g′) may be used to calculate the tRNA amino acid charging percentage, cmut/ctotal. The tRNA amino acid charging percentage corresponds to the percentage of tRNA molecules in the sample which were not charged with an amino acids.
Suitably, the sample further comprises one or more reference tRNAs. The reference tRNAs may be any suitable reference tRNA, for example those described herein. Suitably, the method may further comprise:
The padlock probe may be any padlock probe described herein.
Suitably, step (g″) may be used to determine the concentration of reference tRNAs in the sample (cref). Suitably, this may be used to calculate the relative quantity of tRNA molecules in the sample which were not charged with an amino acids, cmut/cref.
Currently, a technique for identifying and quantifying tRNA amino-acid charging does not exist. The padlock probe or method of the invention may be used to determine the tRNAome of an organism or a type of cell (e.g. an expression host). Consequently, tRNAomes can be used to adapt the codons used when expressing an exogenous protein and improve the expression of the exogenous protein. The padlock probe or method of the invention therefore may be used to codon-optimise a nucleotide encoding a protein of interest and/or optimise an in vitro translation system.
Multiplexed Detection
A plurality of padlock probes of the present invention may be used to detect and/or quantify a plurality of RNAs. Suitably, each padlock probe comprises a unique tag or barcode sequence for specific detection and/or quantification. Suitably, each padlock probe is specific for a unique target sequence.
The unique target sequences may be in the same RNA. For example, the method may comprise:
The unique target sequences may be on different RNAs. For example, the method may comprise:
Steps (e)-(g) may be carried out as described in US2012264630A1, e.g. by amplifying the circularised padlock probes using one or more primers according to US2012264630A1 (as described above).
Methods of Diagnosing Disease
The detection and/or quantification of RNA may be used to detect, diagnose and/or assess the clinical severity of a RNA-associated disease in a subject. Suitably, the subject is a human subject.
Accordingly, in one aspect the present invention provides for use of a padlock probe, suitably a padlock probe of the invention, in detecting, diagnosing and/or assessing the clinical severity of a RNA-associated disease.
RNA-Associated Disease
In higher eukaryotes, many different RNAs are encoded in and transcribed from genomic DNAs. Transcribed RNA molecules undergo multiple post-transcriptional processes such as splicing, editing, modification, translation, and degradation. A defect, mis-regulation, or malfunction of these processes often results in diseases in humans, and recently they have been referred to as “RNA diseases” or “RNA-associated diseases” (Kataoka, N., Mayeda, A. and Ohno, K., 2019. Frontiers in molecular biosciences, 6, p. 53).
Pre-mRNA splicing is one of the major processes for post-transcriptional regulatory steps in eukaryotes. Defects in this step result in many diseases, such as neuromuscular diseases and cancers. Long non-coding RNAs (lncRNAs) also play important roles in gene expression in cells. Some of them can be good markers for cancer progression. (Kataoka, N., Mayeda, A. and Ohno, K., 2019. Frontiers in molecular biosciences, 6, p. 53).
The padlock probe of the present invention can be used to detect and/or quantify mutant or aberrantly-modified RNA and therefore have utility in detecting, diagnosing and/or assessing the clinical severity of RNA-associated diseases.
A summary of representative disease-causing RNA mutations is provided by Cooper, T. A., Wan, L. and Dreyfuss, G., 2009. Cell, 136(4), pp. 777-793. Diseases associated with RNA mutations are shown in Table 8 below. Suitably, the RNA-associated disease may be selected from one or more of the diseases listed in Table 8.
tRNA-Associated Disease
The detection and/or quantification of RNA may be used to detect, diagnose and/or assess the clinical severity of a tRNA-associated disease in a subject. Suitably, the subject is a human subject.
Accordingly, in one aspect the present invention provides for use of a padlock probe, suitably a padlock probe of the invention, in detecting, diagnosing and/or assessing the clinical severity of a tRNA-associated disease.
Pathological mutations in tRNA genes and tRNA processing enzymes are numerous and result in very complicated clinical phenotypes. Multiple mutations in individual tRNA genes have been associated with multiple diseases, and individual diseases have been found to be caused by mutations in one of several tRNAs (Abbott, J. A., Francklyn, C. S. and Robey-Bond, S. M., 2014. Frontiers in genetics, 5, p. 158).
Suitably, a tRNA-associated disease is caused by a mutation in a gene encoding a tRNA and/or a mutation in a gene encoding a tRNA processing or modifying enzyme. Genetic disorders in which tRNA alterations are thought to play a direct part can be classified into two groups of pathogenic mutations: within the tRNAs or in the tRNA processing and modifying enzymes. Additionally, many diseases that do not have a direct mutational link to tRNAs and their associated enzymes nevertheless display alterations in tRNA pools, albeit usually as a secondary effect of the altered disease biology (S. Kirchner, Z. Ignatova, Nat. Rev. Genet. 16, 98-112 (2015)).
Suitably, a tRNA-associated disease is caused by a mutation in a gene encoding a tRNA. For all the identified cases in which human disease are directly linked to mutations in tRNAs, these mutations occur in mt-tRNAs (S. Kirchner, Z. Ignatova, Nat. Rev. Genet. 16, 98-112 (2015)).
The padlock probe of the invention can be used to assess the heteroplasmy level in one or more different tissue. Suitably, the tRNA-associated disease is heteroplasmic. The disease phenotypes of many of tRNA-based pathologies are potentiated in a tissue-specific manner (S. Kirchner, Z. Ignatova, Nat. Rev. Genet. 16, 98-112 (2015)). The mutation load (also called heteroplasmy level, or mutation frequency) can be measured on different tissues, the most studied ones being urine, muscle and blood. Mutation loads observed in the different tissues are correlated, but not identical (A. Lombes et al, Biochemie 100, 171-176 (2014); J. P. Grady et al, EMBO Mol Med. 10, e8262 (2018)).
The data on the relation between the mutation load measured on mtDNA and the clinical severity are under debate (P. F. Chinnery et al, Brain 120, 1713-1721 (1997) and S. J. Pickett et al, Ann Clin Transl Neurol. 5, 333-45 (2018)). The heteroplasmy level can vary considerably from one individual to another. There is a relation between mutation load and clinical severity, which seems to be stronger if the mutation load is measured in blood, as compared to urine or muscle tissues.
Suitably, the tRNA-associated disease is a mt-tRNA-associated disease. Diseases associated with mt-tRNA mutations are catalogued in variety of databases, such as MITOMAP and Online Mendelian Inheritance in Man (OMIM).
Suitably, the mt-tRNA disease is selected from one or more of: a mitochondrial encephalomyopathy, a mitochondrial cardiomyopathy, mitochondrial myopathy deafness/sensorineural hearing loss, or diabetes mellitus. Suitably, the mt-tRNA disease is a mitochondrial encephalomyopathy.
Mitochondrial encephalomyopathies are a group of clinically, genetically, and biochemically heterogeneous dis-orders caused by a defect in the oxidative phosphorylation pathway of the respiratory chain. Examples include MELAS syndrome and MERRF syndrome (DiMauro, S. and Hirano, M., 2005. Neuromuscular disorders, 15(4), pp. 276-286.).
Mitochondrial cardiomyopathies are myocardial disorders characterized by abnormal myocardial structure and/or function secondary to genetic defects resulting in the impairment of the mitochondrial respiratory chain, in the absence of concomitant coronary artery disease, hypertension, valvular disease, and congenital heart disease (El-Hattab, A. W. and Scaglia, F., 2016. Frontiers in cardiovascular Medicine, 3, p. 25).
Mitochondrial myopathies are clinically heterogeneous disorders that can affect multiple systems besides skeletal muscle and are usually defined by morphological abnormalities of muscle mitochondria (DiMauro, S., et al., 1985. Annals of Neurology: Official Journal of the American Neurological Association and the Child Neurology Society, 17(6), pp. 521-538).
Deafness is a degree of hearing loss such that a person is unable to understand speech, even in the presence of amplification. Sensorineural hearing loss is a type of hearing loss in which the root cause lies in the inner ear or sensory organ (cochlea and associated structures) or the vestibulocochlear nerve (cranial nerve VIII).
Diabetes mellitus (DM), commonly known as diabetes, is a group of metabolic disorders characterized by a high blood sugar level over a prolonged period of time.
Suitably, the mt-tRNA disease is selected from one or more of: Alzheimer's Disease and Parkinson Disease (ADPD); Ataxia, Myopathy, and Deafness (AMDF); Chronic Progressive Ophthalmoplegia (CPEO); Deafness/Sensorineural Hearing Loss (DEAF/SNHL); Dementia and Chorea (DEMCHO); Diabetes Mellitus (DM); Diabetes Mellitus and Deafness (DMDF); exercise intolerance (EXIT); Focal Segmental Glomerulosclerosis (FSGS); Gastrointestinal Reflux (GER); Leber Hereditary Optic Neuropathy (LHON); Leigh Syndrome (LS); Myoclonic Epilepsy and Ragged Red Fiber disease (MERRF); Mitochondrial Encephalomyopathy, Lactic acidosis and Stroke-like episodes (MELAS); Maternally Inherited Cardiomyopathy (MICM); Maternally Inherited Diabetes and Deafness (MIDD); Maternally Inherited Leigh Syndrome (MILS); mitochondrial myopathy (MM); Mitochondrial Myopathy and Cardiomyopathy (MMC); Mitochondrial NeuroGastrolntestinal Encephalopathy (MNGIE); Progressive Encephalomyopathy (PEM); Retinitis Pigmentosa (RP); Sudden Infant Death Syndrome (SIDS); Sensorineural Hearing Loss (SNHL).
Suitably, the mt-tRNA disease is MERRF or MELAS.
MELAS Syndrome
The detection and/or quantification of RNA may be used to detect, diagnose and/or assess the clinical severity of MELAS syndrome in a subject. Suitably, the subject is a human subject.
Accordingly, in one aspect the present invention provides for use of a padlock probe, suitably a padlock probe of the invention, in detecting, diagnosing and/or assessing the clinical severity of MELAS syndrome.
MELAS syndrome (Mitochondrial Encephalopathy, Lactic Acidosis and Stroke-like episodes) is a mitochondrial cytopathy associated with mutations of mitochondrial DNA.
MELAS syndrome affects many of the body's systems, particularly the brain and nervous system (encephalo-) and muscles (myopathy). The signs and symptoms of this disorder most often appear in childhood following a period of normal development, although they can begin at any age. Early symptoms may include muscle weakness and pain, recurrent headaches, loss of appetite, vomiting, and seizures. Most affected individuals experience stroke-like episodes beginning before age 40. These episodes often involve temporary muscle weakness on one side of the body (hemiparesis), altered consciousness, vision abnormalities, seizures, and severe headaches resembling migraines. Repeated stroke-like episodes can progressively damage the brain, leading to vision loss, problems with movement, and a loss of intellectual function (dementia).
Most people with MELAS have a buildup of lactic acid in their bodies, a condition called lactic acidosis. Increased acidity in the blood can lead to vomiting, abdominal pain, extreme tiredness (fatigue), muscle weakness, and difficulty breathing. Less commonly, people with MELAS may experience involuntary muscle spasms (myoclonus), impaired muscle coordination (ataxia), hearing loss, heart and kidney problems, diabetes, and hormonal imbalances.
MERRF Syndrome
The detection and/or quantification of RNA may be used to detect, diagnose and/or assess the clinical severity of MERRF syndrome in a subject. Suitably, the subject is a human subject.
Accordingly, in one aspect the present invention provides for use of a padlock probe, suitably a padlock probe of the invention, in detecting, diagnosing and/or assessing the clinical severity of MERRF syndrome.
MERRF syndrome (Myoclonic Epilepsy with Ragged Red Fibers) is a mitochondrial encephalomyopathy characterized by myoclonic seizures. The clinical diagnosis of MERRF is based on the following four “canonic” features: myoclonus, generalized epilepsy, ataxia, and ragged red fibers (RRF) in the muscle biopsy.
MERRF is characterized by muscle twitches (myoclonus), weakness (myopathy), and progressive stiffness (spasticity). When the muscle cells of affected individuals are stained and viewed under a microscope, these cells usually appear abnormal. These abnormal muscle cells are called ragged-red fibers. Other features of MERRF include recurrent seizures (epilepsy), difficulty coordinating movements (ataxia), a loss of sensation in the extremities (peripheral neuropathy), and slow deterioration of intellectual function (dementia). People with this condition may also develop hearing loss or optic atrophy, which is the degeneration (atrophy) of nerve cells that carry visual information from the eyes to the brain. Affected individuals sometimes have short stature and a form of heart disease known as cardiomyopathy. Less commonly, people with MERRF develop fatty tumors, called lipomas, just under the surface of the skin.
Due to the multiple symptoms presented by the individual, the severity of MERRF syndrome is currently very difficult to evaluate.
Methods of Detecting, Diagnosing and/or Assessing the Clinical Severity of a RNA-Associated Disease
A method of detecting and/or quantifying a RNA, as described herein may be used to detect, diagnose and/or assess the clinical severity of a RNA-associated disease. For example, the methods of detecting and/or quantifying a RNA may comprise a further step (h), of detecting, diagnosing and/or assessing the clinical severity of a RNA-associated disease based on the detection and or quantification of RNA in the sample.
A method of detecting, diagnosing and/or assessing the clinical severity of a RNA-associated disease may comprise detecting and/or quantifying mutant or aberrantly-modified RNAs in a sample. The RNA may be any RNA described herein, particularly a tRNA or a mt-tRNA. The sample may be any sample described herein, particularly a sample obtained or obtainable from urine, muscle, blood and/or saliva.
Suitably, the method of detecting, diagnosing and/or assessing the clinical severity of a RNA-associated disease may comprise:
Suitably, cwt and cmut are determined by a method described herein.
The RNA mutation load or aberrant-modification load may be used to detect, diagnose and/or assess the clinical severity of a RNA-associated disease. A higher RNA mutation load mutation load may be associated with increased clinical severity of a RNA-associated disease.
In some embodiments, the method of detecting, diagnosing and/or assessing the clinical severity of a RNA-associated disease may further comprise:
Suitably, the reference RNA is a nuclear tRNA, optionally a nuclear methionine tRNA. Suitably, the reference RNA is a mt-tRNA, optionally a methionine mt-tRNA. Suitably, cref is determined by a method described herein. Suitably, cwt, cmut, and cref are determined by a method described herein.
The relative quantity of wild-type RNA molecules and/or relative quantity of mutant or aberrantly-modified RNA molecules may be used to detect, diagnose and/or assess the clinical severity of a RNA-associated disease. A lower quantity of wild-type RNA molecules and/or a higher quantity of mutant or aberrantly-modified RNA molecules may be associated with increased clinical severity of a RNA-associated disease.
In some embodiments, the method of detecting, diagnosing and/or assessing the clinical severity of a RNA-associated disease may further comprise (in addition to or in the absence of steps (d) and (e)):
Suitably, c′wt and c′mut are determined by a method described herein. Suitably, cwt, cmut, c′wt and c′mut are determined by a method described herein.
The percentage DNA mutation load may be used in combination with the percentage RNA mutation load to detect, diagnose and/or assess the clinical severity of a RNA-associated disease. A higher RNA mutation load and a higher DNA mutation load may be associated with increased clinical severity of a RNA-associated disease.
In some embodiments, the method of detecting, diagnosing and/or assessing the clinical severity of a RNA-associated disease may further comprise (in addition to or in the absence of steps (d) and (e) and/or steps (f) to (h)):
The DNA copy number may be determined by any suitable method, for example the method described in Grady, J. P., et al., 2014. PLoS One, 9(12), p. e114462. Suitably, the DNA copy number may be the total DNA copy number or the wild-type DNA copy number.
The DNA copy number may be used in combination with any other measure described herein to detect, diagnose and/or assess the clinical severity of a RNA-associated disease.
A lower DNA copy number may be associated with increased clinical severity of a RNA-associated disease.
Any further measure (e.g. the age of the subject) may be used to detect, diagnose and/or assess the clinical severity of a RNA-associated disease. For example, it has been shown that the age of the subject is associated with the clinical severity of MELAS (J. P. Grady et al, EMBO Mol Med. 10, e8262 (2018)).
Exemplary Method of Detecting, Diagnosing and/or Assessing the Clinical Severity of MELAS
A method of detecting, diagnosing and/or assessing the clinical severity of MELAS may comprise:
The sample may be obtained or obtainable from urine, muscle, blood and/or saliva. Suitably the sample is obtained or obtainable from urine. Urine is an attractive tissue for diagnostics since it is obtained in a fully non-invasive manner. Suitably the sample is obtained or obtainable from blood. Suitably, the method is used for more than one sample, optionally from more than one tissue.
The concentration of wild-type leucine(UUR) mt-tRNAs and/or the concentration of mutant leucine(UUR) mt-tRNAs may be determined by using a padlock probe, kit and/or RNA quantification system of the present invention (e.g. using a method according to the present invention).
The concentration of wild-type leucine(UUR) mt-tRNA genes and/or the concentration of mutant leucine(UUR) mt-tRNA genes may also be determined using a padlock probe (e.g. using a method described herein).
A higher RNA mutation load (and optionally one or more of: a lower quantity of wild-type leucine(UUR) mt-tRNAs and/or a higher quantity of mutant leucine(UUR) mt-tRNAs with mutation A3243G, a higher DNA mutation load, a lower mtDNA copy number, and or increased age of the subject) may be associated with increased clinical severity of MELAS.
Overview
The measurement of mutations in RNA (and optionally DNA) involves four main steps.
These steps are described in more detail in the protocols section below.
Results
We studied 8 samples obtained from culture of cybrid cells, 2 samples obtained from culture of skin fibroblasts and 2 samples extracted from patient blood. Two of them are controls, C1 and C8, for whom we expected 0% mutation load in both DNA and RNA. Each value of columns 3 and 4 is an average of at least two separate measurements, each measurement containing three technical replicas.
Discussion
In the method described herein, specific hybridisation and ligation of a circularisable DNA probe is performed on the target RNA and subsequently the resulting circles are amplified by hyper-branched rolling circle amplification (HRCA) (P. M. Lizardi et al, Nature Genetics 19, 225-232 (1998)). The RNA sequence required to form the DNA circle can be as short as twenty nucleotides, or less if required, which provides versatility to specifically target even short and highly structured RNA molecules.
We use our approach to quantify the well-known MELAS mutation m.3243A>G in both mitochondrial tRNA and mitochondrial DNA (mtDNA). The transfer RNA (tRNA) is short, strongly structured and exhibits some chemical modifications. We propose that measuring the mutation load of the tRNA has clinical utility. It is expected that the mutation load of the tRNA is more closely related to the cell phenotype than the mutation load of the corresponding DNA, because the tRNA is involved in protein synthesis, while mitochondrial DNA is just an intermediate. We measured a number of different samples and observed that the mutation load of the tRNA differs significantly from the DNA one. The two quantities are not just proportional, the RNA mutation load provides additional information. We propose that these measurements correlate with the clinical evolution of a patient, and propose using them as a prognostic tool and/or as an indicator for the appropriate medical treatment.
The method allows quantifying mutation rates in RNA and DNA with good precision, sensitivity and specificity, even though initially HRCA was not considered being useful for quantitative analyses (Dahl, F., et al., 2004. Proceedings of the National Academy of Sciences, 101(13), pp. 4548-4553). Our circle purification uses a biotinylated capture probe oligonucleotide that binds the RNA target and subsequent purification with streptavidin-coated paramagnetic beads. This purification of circularised probes against non-circularised probes was previously published (J H Smith et al, BioMed Res. Int. 2014, 641090).
The m.3243A>G mutation has a broad clinical expression, ranging from absence of symptoms (asymptomatic carrier) to a multi-systemic disease leading to very premature death (P. Kaufmann et al, Neurology 77, 1965-71 (2011); E. Malfati et al, Neurology 80, 100-105 (2013)). The differences relate to age of onset of the pathology, clinical severity and the specifically affected organs: skeletal muscles, eye muscles, endocrine pancreas, brain, heart (A. W. El-Hattab et al, Mol. Genet. Metab. 116, 4-12 (2015)). It is important to note that this extreme variability in the progression occurs within a same family of patients, showing the same mutation. In the most severe cases, the premature death may be caused by convulsive seizures, heart failure, renal failure or multiple organ dysfunction with lactic acidosis. The possibility of a 3243A>G mutation should also be kept in mind for diabetes, sensorineural hearing impairment, short stature or delayed maturation, exercise intolerance, and milder neurological manifestations such as migraine and learning difficulties (J. Uusimaa et al, Ann. Neurol. 62, 278-87 (2007)).
The heteroplasmy level can vary considerably from one individual to another. There is a relation between mutation load and clinical severity, which seems to be stronger if the mutation load is measured in blood, as compared to urine or muscle tissues (P. F. Chinnery et al, Brain 120, 1713-1721 (1997); S. J. Pickett et al, Ann Clin Transl Neurol. 5, 333-45 (2018)). However, sizeable inter individual variations are observed, even if complementary data is used, like copy-number of mtDNA or mutation load measured on muscle tissue. It is thus very difficult to base a reliable prognostic on heteroplasmy values obtained from mtDNA target molecules. The m.8344A>G mutation, associated with the inherited mitochondrial disease MERRF (myoclonic epilepsy with ragged-red fibres), is a further interesting case for which a correlation between the frequency of the more common clinical features and the level of mutant mtDNA was reported.
The methods herein could improve the prognostics of disease caused by m.3243A>G by measuring the mutation load on the tRNA. The measurement could be used for reliable prognostics of disease progression, which would allow adapting the treatment to the individual patient. The methods herein may also be used for determining the favourable or unfavourable effect of the transcriptional response, unveiled by the relation between the heteroplasmy of the DNA and the tRNA, as well as by the levels of the mutated and non-mutated tRNAs. These measurements could orient therapeutic research toward modulation of the transcriptional response, toward its activation or inhibition.
We performed a study of 10 blood and cell model samples carrying the m.3243A>G mutation. In all cases, the mutation load measured on RNA was lower than the one measured on the DNA, but by varying proportion (see Table 1). This RNA/DNA difference probably reflects a reduced stability of the mutated tRNA, perhaps partially compensated by an increase in mitochondrial transcription. It is possible that this reduction of mutation load is a favourable factor, reducing for instance amino acid mis-incorporation. On the other hand, it could also be unfavourable if compensatory increase of mitochondrial transcription does not occur. In the latter case the number of leucine tRNA would decrease and in turn lead to a global reduction of the mitochondrial translation.
We hypothesise that the mutation load measured on mt-tRNA, exhibits a significant, positive or negative, correlation with the severity of clinical impairment of MELAS patients. In order to evaluate the possible compensation of transcription, we propose measuring the quantity of leucine-1 tRNA in comparison to the cytosolic methionine tRNA. Met-tRNA here acts as a level indicator of the global transcriptional activity of the cell.
The methods described herein gives access to new values and in turn new combinations of values. To improve molecular diagnostics and disease prognosis for mitochondrial disorders related to a mutation in a tRNA coding gene, we propose measuring the quantities of the mutated and non-mutated forms of the mt-tRNA, the quantities of the mutated and non-mutated forms of the mt-DNA and the quantity of a reference RNA acting as indicator of the level of transcriptional activity. Our technique allows measuring these values by the same experimental approach and within the same measurement run, which provide good relative precision and facilitates quantitative comparison between values. The resulting set of data can be used in a manner that can be optimised and adjusted, leveraging results of future clinical studies.
At this stage and regarding the MELAS mutation, we can exploit (i) the heteroplasmy level of the tRNA itself, (ii) the heteroplasmy level of the tRNA in comparison with the heteroplasmy level of the DNA, (iii) the quantity of the mutated and non-mutated tRNA in comparison with the quantity of control tRNAs that are quantitative indicators of global or mitochondrial transcription activity, (iv) a combined use of (i)-(iii), possibly with a set of weighing factors to calculate a score that optimises the value of the molecular diagnostics. To achieve most reliable prognostics of disease progression and best adaption of treatment to the individual patient, it is useful to combine molecular diagnostics with other clinical diagnostic criteria. Clinical diagnostic criteria for MELAS are reported in the literature (Hirano et al, Neuromusc. Disord. 2, 125-135 (1992); Yatsuga et al, Biochim. Biophys. Acta 1820, 619-624 (2012)).
Besides providing new access to mutation frequencies of tRNA and other RNA targets that are difficult to analyse, our invention also has advantages as compared to existing technology for measuring mutation load in DNA.
First, it provides a unified platform to quantitatively address RNA and DNA, which has advantages when combining RNA and DNA results and also saves costs and space, because operational know-how, equipment and most of the consumables can be used for both RNA and DNA. Our technique also has advantages if it is compared to existing techniques to measure DNA mutation load, independent of its RNA capability. PCR Restriction Fragment Length Polymorphism (RFLP) is a traditional approach for measuring mutation load in mitochondrial DNA and is still widely used today (L. M. Scholle et al, Genes 11, 212 (2020)). There, a fragment carrying the mutation site is amplified from a DNA sample by PCR. Subsequently, a restriction endonuclease that specifically cleaves either the mutated or the non-mutated DNA is used. The number of resulting fragments, resolved by electrophoresis for instance, depends on presence/absence of the mutation.
Regarding quantitative analysis of mutation load in a mixed population of mutated and non-mutated templates, however, RFLP has a serious, well-recognized problem. During PCR heteroduplexes are formed between strands arising from the mutated and the non-mutated species. These heteroduplexes are not cleaved and thus cause bias. For RFLP analysis of the MELAS mutation with Apal, for instance, the mutation load of the m.3243A>G mutation is systematically underestimated (see Table 1). Moreover, the precise value depends on the PCR conditions, for instance the number of cycles used (also Table 1). This makes it very difficult to obtain results that can be compared from one laboratory to another; different PCR protocols and PCR machines exhibiting different temperature gradients are used. There is a possibility to circumvent the heteroduplex problem by “last-cycle hot” PCR, but this approach requires using radioactivity (Yatsuga et al, Biochim. Biophys. Acta 1820, 619-624 (2012)).
In well-equipped laboratories, DNA mutations are often analysed by next generation sequencing (NGS). The technique is powerful and can provide heteroplasmy values. However, NGS is an expensive and complex approach, the workflow before sequencing and the subsequent data analysis must be done with know-how and care, and it is difficult to give a reliable estimate for the precision of a heteroplasmy value obtained by NGS. Laboratories in rural regions or low- and middle-income countries often are not even equipped for NGS. In addition, the technique requires more input DNA than our approach, which can be a serious disadvantage when tissues with low DNA content are used, like small volume muscle biopsies, urine or buccal epithelial cell. The latter tissues are interesting, since they allow for less invasive molecular diagnostics.
If a molecular tag was used on the amplification primers, different DNA and RNA species (e.g. in the exemplified MELAS case wtDNA, mutDNA, wt-tRNA, mut-tRNA, mitochondrial methionine tRNA reference, nuclear ini methionine tRNA reference) could be amplified and detected in multiplex. Several mutations in parallel can also be quantified.
Protocols
1. Extraction
We studied 8 samples obtained from culture of cybrid cells, 2 samples obtained from culture of skin fibroblasts and 2 samples extracted from patient blood. Two of them are controls, C1 and C8, for whom we expected 0% mutation load in both DNA and RNA.
For cybrid and fibroblast samples, we used commercial extraction kits according to the supplier's protocol (Qiagen RNeasy for RNA and DNeasy for DNA). For DNA and RNA extraction from blood we used PAXgene Blood DNA tubes and PAXgene blood RNA tubes according to the commercial protocol, respectively. Other extraction procedures can be employed. Extracted nucleic acids can be transported on dry ice and stored at −80° C.
2. Circularization of DNA Oligonucleotide Probes and Ligation
a) DNA Targets
In a 5 μL reaction volume containing Hifi Taq ligase buffer (1×), we added two restriction enzymes (BsII (2.5 Units) and MboII (1.25 Units) to the purified DNA targets (typically 1 μL of DNA at a concentration of 40 ng/μL). The reaction was incubated at 37° C. for 15 minutes, at 55° C. for 15 minutes followed by inactivation for 20 minutes at 80° C. Then, we added the ligation mix (5 μL), which contains 0.1 pmol of wt (or mut) circularizable probes, 0.1 pmol of capture probes and 0.25 μL of Hifi Taq DNA ligase. After a hybridization step (ramp between 97° C. and 37° C., 5° C. decrement every 2 minutes), we incubated the tubes for 30 minutes at 56.6° C. We transferred the sample to a DNA Lobind plate (Eppendorf) and performed magnetic bead-based purification, as described below.
b) RNA Targets
In a 5 μL reaction volume containing Splint R ligase buffer (1×), we added 0.1 pmol of wt or mut circularizable probes, 0.1 pmol of capture probes and 1 μL of RNA template (typically 40-60 ng). After a hybridization step (ramp between 97° C. and 37° C., 5° C. decrement every 2 minutes), we added 0.25 μL of SplintR ligase (6.25 units) then we incubated the tubes for 45 minutes (or less) at 37° C. We transferred the sample to a DNA Lobind plate (Eppendorf) and performed magnetic bead-based purification, as described below.
3. Purification after Circle Ligation
a) Purification Using Magnetic Beads
b) Removal of Non-Circularised Probes by Specific Exonuclease Digestion
We directly added to each tube 2 μL of Isothermal Amplification buffer 10×, water (to get a total 20 μL volume) and Exonuclease cocktail (Exo I (10U), Lambda Exo (1.25 U), and T5 Exo (2.5 U)). Each tube was incubated for 90 min at 37° C. followed by 20 min at 80° C. Finally, we added the HRCA mastermix to the product of exonuclease restriction and performed HRCA in triplicates as described elsewhere.
4. Amplification and Real-Time Detection
Next, we added 34 μL of HRCA mix (1× Isothermal amplification buffer supplemented with 18 nmol dNTP, 18 pmoles forward primer, 18 pmoles reverse primer, 3.6 μL DMSO, 1.8 μL SybrGREEN I 10×, and 0.9 μL Bst DNA polymerase Warmstart) to tubes.
The tubes are distributed in triplicates on a white Twintec plate (Eppendorf). After sealing the plate and a short centrifugation, we incubated at 65° C. for 1 hour in a commercial real-time PCR machine (Eppendorf Mastercycler). SybrGREEN fluorescence is recorded during this incubation.
5. Quantification
On DNA, we typically measure two concentrations.
On RNA, we typically measure three concentrations.
We quantify the number of target molecules in each sample by comparing the amplification curve of the sample to reference curves obtained with a series of standards submitted to the same ligation-purification-HRCA run (
a) For DNA
For one of the wt DNA measurements of sample C3, we measured a characteristic time ct of 22.65 min. Comparison of this value to
For the corresponding mut DNA measurement of sample C3, we measured a characteristic time ct of 13.21 min. Comparison of this value to
Restricting, for the purpose of illustration, the calculation to this particular pair of measurements, the DNA mutation load of this sample would be cmut/(cwt+cmut)=0.96, i.e. 96%.
b) For RNA
For one of the wt RNA measurements of sample C3, we measured a characteristic time ct of 23.1 min. Comparison of this value to
For the corresponding mut RNA measurement of sample C3, we measured a characteristic time ct of 18.75 min. Comparison of this value to
Restricting, for the purpose of illustration, the calculation to this particular pair of measurements, the RNA mutation load of this sample would be cmut/(cwt+cmut)=0.51, i.e. 51%.
6. Preparation of Reference Targets
We prepared in vitro transcribed tRNA (wt and mut) to serve as template in the standard curves. In vitro transcribed tRNA is convenient and appropriate for the purpose of this invention.
We designed two forward oligonucleotides (one for wt variant of Leu and one for mutant). The forward oligonucleotides are composed of the promoter sequence of T7 RNA polymerase and the 40 first nucleotides of the Leu UUR tRNA. A reverse oligonucleotide is designed to overlap the forward primer so that its 3′ end (last 21 bases) can hybridize the 3′end of the forward oligonucleotide.
We synthetized DNA molecules (wt and mut) by using a MuIV-RT (Revertaid) polymerase to transform this template into a full double stranded molecule. First, we prepared a 40 μL hybridization mix as follows: 960 pmoles of Forward oligo added to 960 pmoles of Reverse oligo and 0.8 μL of Tris Buffer 1M (pH 7.5). We incubated the mix at 95° C. for 2 min and at RT for 3 min.
Then, we prepared a 160 μL polymerase mix as follows: we mixed 40 μL of 5×MuIV-RT Buffer, 10 mM dNTP, 4 μL of MuIV-RT polymerase and 108 μL of dH20. Finally, we added the Mastermix to the oligos and incubated the mix at 37° C. for 40 minutes. The resulting product is purified by using a phenol chloroform extraction.
This synthesized DNA that we sometimes call tDNA is used as reference in the DNA measurements. It can serve as DNA reference template provided there are no aborted products (this can be controlled, for instance by classical gel electrophoresis or a commercial, microfluidics-based electrophoresis instrument). Optionally one can perform a PCR of this DNA to assure that there is a single species.
To prepare the reference RNA, we used T7 polymerase to transcribe the reference DNA. First, we prepared a 250 μL transcription Mastermix as follows: we mixed 4×21 μL of 100 mM NTPs, 52.5 μL of 100 mM GMP, 210 μL of 5×T7 Transcription buffer, 26.25 μL of Ribolock, 63 μL of T7 polymerase and dH2O. Then, we prepared a 250 μL template mix containing 20 μg of tDNA prepared at the previous step. Finally, we added the transcription Mastermix to the template mix and incubated overnight at 37° C.
We then proceeded to the isolation and purification of the in vitro transcribed tRNA. We prepared 20% denaturing polyacrylamide gels (8M urea). We loaded the transcription products and after appropriate migration time, we either used UV shadowing or Sybr Gold staining to visualize the band of interest. The bands corresponding to the tRNA are cut using a scalpel and placed in a 15 mL tube. We delicately crushed the gel slices using a pipet tip and added crush and soak buffer until the gel pieces are covered by the buffer. Then, the tubes are placed on a rotating shaker overnight at 4° C.
Then, we spin down the tubes with 7000 RPM for 10 minutes at 4° C. The solutions are then filtered through a syringe and its 0.45 m filter. We finally performed ethanol precipitation and eluted the purified tRNA in dH20 and thus obtained the desired in vitro transcribed tRNAs. The expected size of 78 nt was controlled as well as the fact that it was indeed RNA and not DNA (by digestion with Rnase and Exo I) (
7. Sequences
In all target sequences: (i) the site complementary to the first terminal region of the corresponding circularizable probe is shown underlined; (ii) the site complementary to the second terminal region of the corresponding circularizable probe is shown in italics and underlined; (iii) the mutation site (if present) is shown in bold.
In all circularizable probes sequences: (i) the first terminal region is shown underlined; (ii) the first primer binding site is underlined in bold; (iii) the second primer binding site is shown in italics and underlined in bold; (iv) the second terminal region is shown in italics and underlined; (v) the mutation site (if present) is shown in bold.
In all forward oligos for preparing reference samples, the region complementary to the corresponding tRNA is underlined. In all reverse oligos for preparing reference samples, the overlap with forward oligo is underlined.
a) WT Leu(UUR) Mt-DNA Sequences
GCTCTGCCATCTTAACAAACCCTGTTCTTGGGTGG
GCCCGGTAATCGCATAAAACTTAAAACTAAA
AAATTAAGATGGCAGA
b) MUT Leu(UUR) Mt-DNA Sequences
GCCCTGCCATCTTAA
CAAACCCTGTTCTTGGGTGG
GCCCGGTAATCGCATAAAACTTAAAACTAAA
AAAATTAAGATGGCAG
c) WT Leu(UUR) Mt-RNA Sequences
GUUAAGAUGGCAGAGCCCGGUAA
UCGCAUAAAACUUAAAACUUUACAGUCAG
CTGCCATCTTAAC
TTACCGGGCT
AAAAC
TTA
d) MUT Leu(UUR) Mt-RNA Sequences
GUUAAGAUGGCAGGGCCCGGUAA
UCGCAUAAAACUUAAAACUUUACAGUCAG
CTGCCATCTTAAC
TTACCGGGCC
AAAAC
TTA
e) Nuclear Ini Methionine tRNA Sequences
ATGGGCCCAGCACGCTTC
CATCGACCTCTGGGTT
AACCCAG
GCTTC
f) Mitochondrial Methionine tRNA Sequences
AGUAAGGUCAGCUAAAUAAGCUAUCGGGCCCAUACCCCGAAAAUGUUGGUUA
AGCTTATTTAGCTGACCTTACT
ATTTTCGGGGTATGGGCCCGAT
CCGAAA
All publications mentioned in the above specification are herein incorporated by reference. Various modifications and variations of the disclosed methods, cells, compositions and uses of the invention will be apparent to the skilled person without departing from the scope and spirit of the invention. Although the invention has been disclosed in connection with specific preferred embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the disclosed modes for carrying out the invention, which are obvious to the skilled person are intended to be within the scope of the following claims.
Number | Date | Country | Kind |
---|---|---|---|
2017861.2 | Nov 2020 | GB | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2021/081578 | 11/12/2021 | WO |