This application claims the benefit of priority of Singapore provisional application No. 10201510448Q, filed 18 Dec. 2015, the contents of it being hereby incorporated by reference in its entirety for all purposes.
The present invention relates to molecular biology in particular biomarkers. In particular, the present invention relates to the detection and quantification of biomarkers associated with microorganisms such as bacteria, viruses, fungi and parasites, and methods of determining the likelihood that a patient suffers or is likely to suffer from diseases associated with microorganisms, and to predict the treatment outcome of a patient having diseases associated with microorganisms, by detecting and quantifying the biomarkers associated with the microorganisms.
Infections with microorganisms, including viruses, bacteria, fungi and parasites, have been recognized as risk factors for a wide range of diseases in humans, such as cancers, autoimmune diseases and cardiovascular diseases, which are associated with high mortality rates. Early detection of the causative microorganisms is crucial for the prevention, detection, diagnosis and treatment of these diseases associated with infections with microorganisms.
Examples of conventional methods used for the detection of microorganisms include culture-based procedures, serologic tests and microscopy. Each of these methods is associated with its own limitations. For example, culture-based procedures, which are dependent on the growth of the microorganisms, often takes days for results to become available, and such procedures often reveal false negative results due to the administration of an empiric antibiotic therapy. Serologic tests often face high false negative rate due to the absence or weakness of antibody production in the patient's body, or high false positive rate due to the presence of cross-reacting antibodies. Microscopy techniques may be limited by sample staining and fixing and the use of strong illumination, which may destroy or distort cellular features of the microorganisms to be detected.
Molecular diagnostic procedures such as PCR-based techniques have been introduced for the detection of microorganism infections in recent years. However, the currently available PCR-based techniques have some limitations in the quantitative measurement of the bacterial/viral load in patients' samples, as well as problems of false positive results. Thus, what is needed is an improved method of detecting and/or quantifying of microorganisms with better sensitivity and specificity.
In one aspect, there is provided a method for detecting and/or quantifying the presence of a target nucleic acid sequence of a microorganism in a sample obtained from a subject, comprising amplifying the target sequence in a CpG island of the nucleic acid of the microorganism, irrespective of the methylation status of the CpG island.
In another aspect, there is provided a method for detecting and/or quantifying the presence of a target nucleic acid sequence of Epstein-Barr virus (EBV) in a sample obtained from a subject, comprising amplifying a target sequence in the BamHI-W region of EBV, wherein the target sequence comprises the sequence of
AGATCTAAGGCCGGGAGAGGCAGCCCCAAAGCGGGTGCAGTAACAGGTAATC TCTGGTAGTGATTTGGACCCGAAATCTGACACTTTAGAGCTCTGGAGGACTTTA AAACTCTAAAAATCAAAACTTTAGAGGCGAATGGGCG (SEQ ID NO: 1), wherein amplifying the target sequence comprises the use of a pair of oligonucleotide primers and a probe, wherein the first oligonucleotide primer comprises the sequence of 5′-AGATCTAAGGCCGGGAGAGG-3′ (SEQ ID NO:2), and the second oligonucleotide primer comprises the sequence of 5′-CGCCCATTCGCCTCTAAAGT-3′ (SEQ ID NO: 3), and wherein the probe comprises the sequence of 5′-(6-FAM)CTCTGGTAGTGATTTGGACCCGAAATCTG(TAMRA)-3′ (SEQ ID NO: 4), and wherein the method is a quantitative polymerase chain reaction (qPCR).
In yet another aspect, there is provided a method of detecting a disease associated with microorganism infection, or risk of developing a disease associated with microorganism infection in a subject, comprising detecting and/or quantifying the presence of a nucleic acid sequence of the microorganism using the method of the present invention in a sample obtained from the subject, wherein the presence of the nucleic acid sequence of the microorganism in the sample indicates that the subject has a disease associated with microorganism infection or is at risk of developing a disease associated with microorganism infection.
In another aspect, there is provided a method of detecting and treating a disease associated with microorganism infection, comprising: (i) detecting and/or quantifying the presence of a nucleic acid sequence of the microorganism using the method of the present invention in a sample obtained from the subject, wherein the presence of the nucleic acid sequence of the microorganism in the sample indicates that the subject has a disease associated with microorganism; and (ii) administering to the subject a medicament suitable for the treatment of the disease associated with the microorganism.
In yet another aspect, there is provided a method of predicting the treatment outcome of a disease associated with microorganism infection in a patient, comprising:(i) quantifying the nucleic acid sequence of the microorganism in a sample collected from the patient before treatment or before a treatment step, and quantifying the nucleic acid sequence of the microorganism in a sample collected from the same patient after treatment or after a treatment step; (ii) comparing the amount of the nucleic acid sequence of the microorganism in the sample before and after treatment or a treatment step, wherein a decrease in the amount of the nucleic acid sequence of the microorganism in the sample after treatment or a treatment step indicates that treatment outcome of the disease associated with microorganism infection in the patient is positive, wherein the quantifying of the nucleic acid sequence of the microorganism in the sample is performed according to the method of the present invention.
In a further aspect, there is provide a kit for detecting and/or quantifying the nucleic acid sequence of a microorganism in a sample obtained from a subject, comprising a pair of oligonucleotide primers specific for the amplification of a target sequence in a CpG island of the nucleic acid of the microorganism.
The invention will be better understood with reference to the detailed description when considered in conjunction with the non-limiting examples and the accompanying drawings, in which:
In light of the issues discussed in the background of the invention, the present disclosure provides a method of detecting and/or quantifying a microorganism that has better sensitivity and/or specificity compared to the currently available methods.
In one aspect, there is provided a method for detecting and/or quantifying the presence of a target nucleic acid sequence of a microorganism in a sample obtained from a subject, comprising amplifying the target sequence in a CpG island of the nucleic acid of the microorganism, irrespective of the methylation status of the CpG island.
The term “amplifying” or “amplification” as used herein refers to the production of additional copies of the target sequence.
The term “CpG island” as used herein refers to nucleic acid regions with a high frequency of CpG sites. A CpG site (or CG site) is a region of DNA where a cytosine nucleotide is followed by a guanine nucleotide in the linear sequence of nucleotides along its 5′ to 3′ direction. CpG is shorthand for 5′-C-phosphate-G-3′. A CpG island can have at least about 100 nucleotides, or at least about 500 nucleotides, or at least about 1000 nucleotides, or at least about 2000 nucleotides, or at least about 3000 nucleotides, or at least about 4000 nucleotides, or at least about 5000 nucleotides, or at least about 6000 nucleotides, or at least about 7000 nucleotides, or at least about 8000 nucleotides, or at least about 9000 nucleotides, or at least about 10000 nucleotides, or at least about 15000 nucleotides, or at least about 20000 nucleotides, or at least about 25000 nucleotides, or at least about 30000 nucleotides, or at least about 35000 nucleotides, or at least about 40000 nucleotides, or at least about 45000 nucleotides, or at least about 50000 nucleotides, or at least about 60000 nucleotides, or at least about 70000 nucleotides, or at least about 80000 nucleotides, or at least about 90000 nucleotides, or at least about 100000 nucleotides, or between about 100 nucleotides to about 5000 nucleotides, or between about 200 nucleotides to about 4900 nucleotides, or between about 300 nucleotides to about 4800 nucleotides, or between about 400 nucleotides to about 4700 nucleotides, or between about 500 nucleotides to about 4600 nucleotides, or between about 600 nucleotides to about 4500 nucleotides, or between about 700 nucleotides to about 4400 nucleotides, or between about 800 nucleotides to about 4300 nucleotides, or between about 900 nucleotides to about 4200 nucleotides, or between about 1000 nucleotides to about 4100 nucleotides, or between about 1100 nucleotides to about 4000 nucleotides, or between about 1200 nucleotides to about 3900 nucleotides, or between about 1300 nucleotides to about 3800 nucleotides, or between about 1400 nucleotides to about 3700 nucleotides, or between about 1500 nucleotides to about 3600 nucleotides, or between about 1600 nucleotides to about 3500 nucleotides, or between about 1700 nucleotides to about 3400 nucleotides, or between about 1800 nucleotides to about 3300 nucleotides, or between about 1900 nucleotides to about 3200 nucleotides, or between about 2000 nucleotides to about 3100 nucleotides, or between about 2100 nucleotides to about 3000 nucleotides, or between about 2200 nucleotides to about 2900 nucleotides, or between about 2300 nucleotides to about 2800 nucleotides, or between about 2400 nucleotides to about 2700 nucleotides, or between about 2500 nucleotides to about 2600 nucleotides, or about 150 nucleotides, or about 250 nucleotides, or about 350 nucleotides, or about 450 nucleotides, or about 550 nucleotides, or about 650 nucleotides, or about 750 nucleotides, or about 850 nucleotides, or about 950 nucleotides, or about 1050 nucleotides, or about 1150 nucleotides, or about 1250 nucleotides, or about 1350 nucleotides, or about 1450 nucleotides, or about 1550 nucleotides, or about 1650 nucleotides, or about 1750 nucleotides, or about 1850 nucleotides, or about 1950 nucleotides, or about 2050 nucleotides, or about 2150 nucleotides, or about 2250 nucleotides, or about 2350 nucleotides, or about 2450 nucleotides, or about 2550 nucleotides, or about 2650 nucleotides, or about 2750 nucleotides, or about 2850 nucleotides, or about 2950 nucleotides, or about 3050 nucleotides, or about 3150 nucleotides, or about 3250 nucleotides, or about 3350 nucleotides, or about 3450 nucleotides, or about 3550 nucleotides, or about 3650 nucleotides, or about 3750 nucleotides, or about 3850 nucleotides, or about 3950 nucleotides, or about 4050 nucleotides, or about 4150 nucleotides, or about 4250 nucleotides, or about 4350 nucleotides, or about 4450 nucleotides, or about 4550 nucleotides, or about 4650 nucleotides, or about 4750 nucleotides, or about 4850 nucleotides, or about 4950 nucleotides. A CpG island usually has a C, G percentage of greater than about 50%, or greater than about 51%, or greater than about 52%, or greater than about 53%, or greater than about 54%, or greater than about 55%, or greater than about 56%, or greater than about 57%, or greater than about 58%, or greater than about 59%, or greater than about 60%, or greater than about 61%, or greater than about 62%, or greater than about 63%, or greater than about 64%, or greater than about 65%, or greater than about 66%, or greater than about 67%, or greater than about 68%, or greater than about 69%, or greater than about 70%, or greater than about 71%, or greater than about 72%, or greater than about 73%, or greater than about 74%, or greater than about 75%, or greater than about 76%, or greater than about 77%, or greater than about 78%, or greater than about 79%, or greater than about 80%. The observed-to-expected CpG ratio in CpG island is usually greater than about 55%, or greater than about 56%, or greater than about 57%, or greater than about 58%, or greater than about 59%, or greater than about 60%, or greater than about 61%, or greater than about 62%, or greater than about 63%, or greater than about 64%, or greater than about 65%, or greater than about 66%, or greater than about 67%, or greater than about 68%, or greater than about 69%, or greater than about 70%, or greater than about 71%, or greater than about 72%, or greater than about 73%, or greater than about 74%, or greater than about 75%, or greater than about 76%, or greater than about 77%, or greater than about 78%, or greater than about 79%, or greater than about 80%, or greater than about 81%, or greater than about 82%, or greater than about 83%, or greater than about 84%, or greater than about 85%, or greater than about 86%, or greater than about 87%, or greater than about 88%, or greater than about 89%, or greater than about 90%, or greater than about 91%, or greater than about 92%, or greater than about 93%, or greater than about 94%, or greater than about 95%, or greater than about 96%, or greater than about 97%, or greater than about 98%, or greater than about 99%. The “observed-to-expected CpG ratio” can be derived where the “observed” is the actual number of CpGs in the sequence, and where the “expected” is calculated as:
A number of softwares or analytical tools can be used for the prediction of CpG island in a nucleic acid sequence. Examples of analytical tools available online include but are not limited to: Sequence Manipulation Suite available at http://www.bioinformatics.org/sms2/cpg_islands.html; and Emboss Cpgplot available at http://www.ebi.ac.uk/Tools/segstats/emboss_cpgplot/.
The term “target sequence” or “target nucleic acid sequence” or their grammatical variants as used herein refer to a region of the nucleic acid sequence of the microorganism of interest to be amplified. The presence of the target sequence in a sample obtained from a subject indicates the presence of the nucleic acid sequence associated with the microorganism of interest, and/or the presence of the microorganism in the sample or in the subject.
In some examples, it is preferred that the target sequence to be amplified for the detection and/or quantification of the microorganism is located within the 5′end of the CpG island. The 5′ end location would allow for preferential preservation during exonuclease III degradation of the DNA. Thus, in some examples, the target sequence is located within the first 50%, or the first 49%, or the first 48%, or the first 47%, or the first 46%, or the first 45%, or the first 44%, or the first 43%, or the first 42%, or the first 41%, or the first 40%, or the first 39%, or the first 38%, or the first 37%, or the first 36%, or the first 35%, or the first 34%, or the first 33%, or the first 32%, or the first 31%, or the first 30%, or the first 29%, or the first 28%, or the first 27%, or the first 26%, or the first 25%, or the first 24%, or the first 23%, or the first 22%, or the first 21%, or the first 20%, or the first 19%, or the first 18%, or the first 17%, or the first 16%, or the first 15%, or the first 14%, or the first 13%, or the first 12%, or the first 11%, or the first 10%, or the first 9%, or the first 8%, or the first 7%, or the first 6%, or the first 5%, or the first 4%, or the first 3%, or the first 2%, or the first 1% nucleotides of the CpG island of the nucleic acid of the microorganism, counted from the 5′ end.
The term “methylation” as used herein refers to DNA methylation which typically occurs at a CpG site. Such methylation results in the conversion of the cytosine to 5-methylcytosine, and can by catalysed by the enzyme DNA methyltransferase. A CpG cite can be either methylated or unmethylated.
The method of for detecting and/or quantifying the presence of a target nucleic acid sequence of a microorganism as provided in the present disclosure comprises amplifying the target sequence in a CpG island of the nucleic acid of the microorganism, irrespective of the methylation status of the CpG island. This means that the methylation status of the CpG island for which the target sequence lies within is not important. One of the advantages provided by such a method is that no methylation-status-specific sequencing will be required to select the target sequence to be amplified.
The term “microorganism” as used herein refers to a microscopic living organism, which may be single-celled or multicellular. The microorganism contains DNAs, thus microorganisms that do not contain DNAs (e.g. RNA viruses) are excluded. Examples of microorganisms include but are not limited to bacteria, DNA viruses, fungi and parasites. In some specific examples, the microorganisms are bacteria and DNA viruses. DNA viruses include but are not limited to DNA viruses with double stranded DNAs and DNA viruses with single stranded DNAs. In some examples, the microorganisms are pathogenic.
The term “pathogenic”, “pathogen” and other grammatical variants as used herein refer to the ability of the microorganisms to cause diseases. Examples of such diseases include but are not limited to acinetobacter infections, Actinomycosis, African sleeping sickness (African trypanosomiasis), AIDS (Acquired immunodeficiency syndrome), Amebiasis, Anaplasmosis, Angiostrongyliasis, Anisakiasis, Anthrax, Arcanobacterium haemolyticum infection, Argentine hemorrhagic fever, Ascariasis, Aspergillosis, Astrovirus infection, Alzheimer's disease, Amyotrophic lateral sclerosis, Anorexia nervosa, Anxiety disorder, Asthma, Atherosclerosis, Autoimmune diseases, Babesiosis, Bacillus cereus infection, Bacterial pneumonia, Bacterial vaginosis, Bacteroides infection, Balantidiasis, Bartonellosis, Baylisascaris infection, BK virus infection, Black piedra, Blastocystosis, Blastomycosis, Bolivian hemorrhagic fever, Botulism (and infant botulism), Brazilian hemorrhagic fever, Brucellosis, Bubonic plague, Burkholderia infection, Buruli ulcer, Calicivirus infection (norovirus and sapovirus), Campylobacteriosis, Candidiasis (moniliasis; thrush), Capillariasis, Carrion's disease, Cellulitis, Chagas disease, Chancroid, Chickenpox, Chikungunya, Chlamydia, Chlamydophila pneumoniae infection, Cholera, Chromoblastomycosis, Chytridiomycosis, Clonorchiasis, Clostridium difficile colitis, Coccidioidomycosis, Colorado tick fever, Common cold (acute viral rhinopharyngitis; acute coryza), Creutzfeldt-Jakob disease, Crimean-Congo hemorrhagic fever, Cryptococcosis, Cryptosporidiosis, Cutaneous larva migrans, Cyclosporiasis, Cysticercosis, Cytomegalovirus infection, Cancers, Chronic obstructive pulmonary disease, Crohn's disease, Coronary heart disease, Dengue fever, Desmodesmus infection, Dientamoebiasis, Diphtheria, Diphyllobothriasis, Dracunculiasis, Dementia, Diabetes mellitus type 1, Diabetes mellitus type 2, Dilated cardiomyopathy, Ebola hemorrhagic fever, Echinococcosis, Ehrlichiosis, Enterobiasis, Enterococcus infection, Enterovirus infection, Epidemic typhus, Erythema infectiosum, Exanthem subitum, Epilepsy, Epstein-Barr virus infectious mononucleosis, Fasciolasis, Fasciolopsiasis, Fatal familial insomnia, Filariasis, Food poisoning by clostridium perfringens, Free-living amebic infection, Fusobacterium infection, Gas gangrene (Clostridial myonecrosis), Geotrichosis, Gerstmann-Sträussler-Scheinker syndrome, Giardiasis, Glanders, Gnathostomiasis, Gonorrhoea, Granuloma inguinale, Group A Streptococcal infection, Group B Streptococcal infection, Guillain-Barré syndrome, Haemophilus influenzae infection, Hand, foot and mouth disease, Hantavirus pulmonary syndrome, Heartland virus disease, Helicobacter pylori infection, Hemolytic-uremic syndrome, Hemorrhagic fever with renal syndrome, Hepatitis A, Hepatitis B, Hepatitis C, Hepatitis D, Hepatitis E, Herpes simplex, Histoplasmosis, Hookworm infection, Human bocavirus infection, Human ewingii ehrlichiosis, Human granulocytic anaplasmosis, Human metapneumovirus infection, Human monocytic ehrlichiosis, Human papillomavirus infection, Human parainfluenza virus infection, Hymenolepiasis, Influenza, Isosporiasis, Irritable bowel syndrome, Kawasaki disease, Keratitis, Kingella kingae infection, Kuru, Lassa fever, Legionellosis, Legionellosis, Leishmaniasis, Leprosy, Leptospirosis, Listeriosis, Lyme disease, Lymphatic filariasis, Lymphocytic choriomeningitis, Lupus, Malaria, Marburg hemorrhagic fever, Measles, Middle East respiratory syndrome, Melioidosis, Meningitis, Meningococcal disease, Metagonimiasis, Microsporidiosis, Molluscum contagiosum, Monkeypox, mumps, Murine typhus, Mycoplasma pneumonia, Mycetoma, Myiasis, Multiple sclerosis, Myocardial infarction, Neonatal conjunctivitis, Nocardiosis, Onchocerciasis, Opisthorchiasis, Paracoccidioidomycosis, Paragonimiasis, Pasteurellosis, Pediculosis capitis, Pediculosis corporis, Pediculosis pubis, Pelvic inflammatory disease, Pertussis, Plague, Pneumococcal infection, Pneumocystis pneumonia, Panencephalitis, Pneumonia, Poliomyelitis, Prevotella infection, Primary amoebic meningoencephalitis, Progressive multifocal leukoencephalopathy, Psittacosis, Parkinson's disease, Psoriasis, Rabies, Relapsing fever, Respiratory syncytial virus infection, Rhinosporidiosis, Rhinovirus infection, Rickettsial infection, Rickettsialpox, Rift valley fever, Rocky mountain spotted fever, Rotavirus infection, Rubella, Rheumatoid arthritis, Salmonellosis, SARS, Scabies, Schistosomiasis, Sepsis, Shigellosis, Shingles, Smallpox (variola), Sporotrichosis, Staphylococcal food poisoning, Staphylococcal infection, Strongyloidiasis, Subacute sclerosing, Sarcoidosis, Schizophrenia, Stroke, Syphilis, Taeniasis, Tetanus, Tinea barbae, Tinea capitis, Tinea corporis, Tinea cruris, Tinea manum, Tinea nigra, Tinea pedis, Tinea unguium, Tinea versicolor, Toxocariasis, Toxocariasis, Trachoma, Toxoplasmosis, Trichinosis, Trichomoniasis, Trichuriasis, Tuberculosis, Tularemia, Typhoid fever, Typhus fever, Thromboangiitis obliterans, Tourette syndrome, Tuberculosis, Ureaplasma urealyticum infection, Vasculitis, Variant Creutzfeldt-Jakob disease, Valley fever, Venezuelan equine encephalitis, Venezuelan hemorrhagic fever, Vibrio vulnificus infection, Vibrio parahaemolyticus enteritis, Viral pneumonia, West nile fever, White piedra, Yersinia pseudotuberculosis infection, Yersiniosis, Yellow fever and Zygomycosis. In one specific example, the disease is cancer. In another specific example, the disease is tuberculosis.
One specific example of a DNA virus is Epstein-Barr virus (EBV), also called human herpesvirus 4 (HHV-4). EBV is one of the most common viruses in humans, and is commonly transmitted by saliva and established latent infection in B lymphocytes where it persists for the lifetime of the host. EBV infection is associated with particular forms of cancer, such as nasopharyngeal carcinoma, gastric cancer, Hodgkin's lymphoma and Burkitt's lymphoma.
In an example of the present disclosure, the target sequence to be amplified for the detection of EBV comprises a sequence within a CpG island of the BamHI-W region of EBV.
The term “BamHI-W region” as used herein refers to a repeating BamHI-W restriction fragment of the EBV genome. BamHI-W is a 3-kb long sequence and the genome of an EBV typically contains six to twenty copies of the BamHI-W sequence. The BamHI-W has the following sequence:
There has been evidence showing EBV has evolved to utilize DNA methylation to maximize persistence and to protect itself from immune detection. Moreover, methylation of EBV BamHI-W fragment is important to its expression as a W promoter that drives expression of the EB viral nuclear antigens (EBNAs) at the initiation of virus-induced B-cell transformation. The role of methylation in EBV activity has also been recognized in human tissue. Pharmacologic reversal of dense CpG methylation in tumor tissue can be achieved in patients undergoing treatment with DNA methyltransferase inhibitor. On the other hand, the CpG island of EBV BamHI-W has been shown to be unmethylated in the latent cycle of EBV. Despite the variance in the methylation status of the target nucleic acid sequence within the CpG island of EBV, the method as disclosed herein can be used for the detection and/or quantification of such a target sequence.
In some examples, the CpG island of the BamHI-W region of EBV is selected from the group consisting of:
In some examples, the target sequence within the CpG island comprises the sequence selected from the group consisting of:
The term “fragment” as used herein refers to a nucleic acid sequence that is a constituent of the reference, or a constituent of the complementary sequence of the reference sequence. In some examples, a fragment comprises about 9 to about 300 nucleotides, or about 10 to about 290 nucleotides, or about 20 to about 280 nucleotides, or about 30 to about 270 nucleotides, or about 40 to about 260 nucleotides, or about 50 to about 250 nucleotides, or about 60 to about 240 nucleotides, or about 70 to about 230 nucleotides, or about 80 to about 220 nucleotides, or about 90 to about 210 nucleotides, or about 100 to about 200 nucleotides, or about 110 to about 190 nucleotides, or about 120 to about 180 nucleotides, or about 130 to about 170 nucleotides, or about 140 to about 160 nucleotides, or 10 nucleotides, 15 nucleotides, 25 nucleotides, or 35 nucleotides, or 45 nucleotides, or 55 nucleotides, or 65 nucleotides, or 75 nucleotides, or 85 nucleotides, or 95 nucleotides, or 105 nucleotides, or 115 nucleotides, or 125 nucleotides, or 135 nucleotides, or 145 nucleotides, or 155 nucleotides, or 165 nucleotides, or 175 nucleotides, or 185 nucleotides, or 195 nucleotides, or 205 nucleotides, or 215 nucleotides, or 225 nucleotides, or 235 nucleotides, or 245 nucleotides, or 255 nucleotides, or 265 nucleotides, or 275 nucleotides, or 285 nucleotides, or 295 nucleotides of the reference sequence or the complementary sequence of the reference sequence. A fragment can also comprise about 5% to about 95%, about 10% to about 90%, about 15% to about 85%, about 20% to about 80%, about 25% to about 75%, about 30% to about 70%, about 35% to about 65%, about 40% to about 60%, about 45% to about 55%, or about 50% of the reference sequence or the complementary sequence of the reference sequence.
The term “variant” as used herein refers to sequences that are substantially similar to the reference sequence. These nucleotide sequence variants may have at least 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity to the “non-variant” reference sequence. Variants may be a result of substitution, deletion or additional of any number of nucleotides in the reference sequence as a result of the mutation of the wild type DNA.
In one example, the amplification of the target sequence comprises the use of at least one oligonucleotide primer capable of binding to the target sequence. The at least one oligonucleotide primer binds within a region from about 9 to about 300 nucleotides, or about 10 to about 290 nucleotides, or about 20 to about 280 nucleotides, or about 30 to about 270 nucleotides, or about 40 to about 260 nucleotides, or about 50 to about 250 nucleotides, or about 60 to about 240 nucleotides, or about 70 to about 230 nucleotides, or about 80 to about 220 nucleotides, or about 90 to about 210 nucleotides, or about 100 to about 200 nucleotides, or about 110 to about 190 nucleotides, or about 120 to about 180 nucleotides, or about 130 to about 170 nucleotides, or about 140 to about 160 nucleotides, from the first nucleotide of the target sequence. The length of the at least one primer is between 5 to 40 nucleotides, or between 10 to 35 nucleotides, or between 15 to 30 nucleotides, or between 20 to 25 nucleotides, or 8 nucleotides, or 9 nucleotides, or 10 nucleotides, or 12 nucleotides, or 14 nucleotides, or 16 nucleotides, or 18 nucleotides, or 20 nucleotides, or 22 nucleotides, or 24 nucleotides, or 26 nucleotides, or 28 nucleotides, or 30 nucleotides, or 32 nucleotides, or 34 nucleotides, or 36 nucleotides, or 38 nucleotides, or 40 nucleotides. In one specific example, the length of the at least one primer is 20 nucleotides.
In one example, the at least one primer has a sequence identity of at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% to at least one sequence selected from the group consisting of AGATCTAAGGCCGGGAGAGG (SEQ ID NO.: 2), GGAATAAGCCCCCAGACAGG (SEQ ID NO.: 12), AGGAAGCGGGTCTATGGTTG (SEQ ID NO.: 13), CGCCCATTCGCCTCTAAAGT (SEQ ID NO.: 3), TTACGTAAACGCGCTGGACT (SEQ ID NO.: 14), and GACTGAGAAGGTGGCCTAGC (SEQ ID NO.: 15), or a complementary sequence thereof. In one example, the primer comprises one sequence selected from the group consisting of AGATCTAAGGCCGGGAGAGG (SEQ ID NO.: 2), GGAATAAGCCCCCAGACAGG (SEQ ID NO.: 12), AGGAAGCGGGTCTATGGTTG (SEQ ID NO.: 13), CGCCCATTCGCCTCTAAAGT (SEQ ID NO.: 3), TTACGTAAACGCGCTGGACT (SEQ ID NO.: 14), and GACTGAGAAGGTGGCCTAGC (SEQ ID NO.: 15), or a complementary sequence thereof.
In another example, the amplification of the target sequence comprises the use of at least one pair of oligonucleotide primers. The at least one pair of oligonucleotide primers can comprise one forward primer and one reverse primer. In some examples, the at least one pair of oligonucleotide primers can be selected from the group consisting of: forward primer AGATCTAAGGCCGGGAGAGG (SEQ ID NO.: 2) and reverse primer CGCCCATTCGCCTCTAAAGT (SEQ ID NO.: 3); forward primer GGAATAAGCCCCCAGACAGG (SEQ ID NO.: 12) and reverse primer TTACGTAAACGCGCTGGACT (SEQ ID NO.: 14); forward primer AGGAAGCGGGTCTATGGTTG (SEQ ID NO.: 13) and reverse primer GACTGAGAAGGTGGCCTAGC (SEQ ID NO.: 15). In one specific example, the at least one pair of oligonucleotide primers are forward primer AGATCTAAGGCCGGGAGAGG (SEQ ID NO.: 2) and reverse primer CGCCCATTCGCCTCTAAAGT (SEQ ID NO.: 3).
In one example, the amplification of the target sequence comprises the use of a probe capable of binding to the target sequence. The probe binds within a region from between 9 to 500 nucleotides, or from between 10 to 490 nucleotides, or from between 20 to 480 nucleotides, or from between 30 to 470 nucleotides, or from between 40 to 460 nucleotides, or from between 50 to 450 nucleotides, or from between 60 to 440 nucleotides, or from between 70 to 430 nucleotides, or from between 80 to 420 nucleotides, or from between 90 to 410 nucleotides, or from between 100 to 400 nucleotides, or from between 110 to 390 nucleotides, or from between 120 to 380 nucleotides, or from between 130 to 370 nucleotides, or from between 140 to 360 nucleotides, or from between 150 to 350 nucleotides, or from between 160 to 340 nucleotides, or from between 170 to 330 nucleotides, or from between 180 to 320 nucleotides, or from between 190 to 310 nucleotides, or from between 200 to 300 nucleotides, or from between 210 to 290 nucleotides, or from between 220 to 280 nucleotides, or from between 230 to 270 nucleotides, or from between 240 to 260 nucleotides, or 8 nucleotides, 15 nucleotides, 25 nucleotides, or 35 nucleotides, or 45 nucleotides, or 55 nucleotides, or 65 nucleotides, or 80 nucleotides, or 90 nucleotides, or 100 nucleotides, or 110 nucleotides, or 120 nucleotides, or 130 nucleotides, or 140 nucleotides, or 150 nucleotides, or 160 nucleotides, or 170 nucleotides, or 180 nucleotides, or 190 nucleotides, or 200 nucleotides, or 210 nucleotides, or 220 nucleotides, or 230 nucleotides, or 240 nucleotides, or 250 nucleotides, or 260 nucleotides, or 270 nucleotides, or 280 nucleotides, or 290 nucleotides, or 300 nucleotides, or 310 nucleotides, or 320 nucleotides, or 330 nucleotides, or 340 nucleotides, or 350 nucleotides, or 360 nucleotides, or 370 nucleotides, or 380 nucleotides, or 390 nucleotides, or 400 nucleotides, or 410 nucleotides, or 420 nucleotides, or 430 nucleotides, or 440 nucleotides, or 450 nucleotides, or 460 nucleotides, or 470 nucleotides, or 480 nucleotides, or 490 nucleotides, or 500 nucleotides from a specific nucleotide of the target sequence. The specific nucleotide can be nucleotide at position number 1, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490 or 500 of the target sequence. The length of a probe can be between 5 to 40 nucleotides, or between 10 to 35 nucleotides, or between 15 to 30 nucleotides, or between 20 to 25 nucleotides, or 8 nucleotides, or 9 nucleotides, or 10 nucleotides, or 12 nucleotides, or 14 nucleotides, or 16 nucleotides, or 18 nucleotides, or 20 nucleotides, or 22 nucleotides, or 24 nucleotides, or 26 nucleotides, or 28 nucleotides, or 30 nucleotides, or 32 nucleotides, or 34 nucleotides, or 36 nucleotides, or 38 nucleotides, or 40 nucleotides.
In one example, the probe has a sequence identity of at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% to a sequence selected from the group consisting of CTCTGGTAGTGATTTGGACCCGAAATCTG (SEQ ID NO.: 16), CCACCTTCTCAGTCCAGCGCGTTT (SEQ ID NO.: 17), GTGACTTCACCAAAGGTCAGGGCCC (SEQ ID NO.: 18), GGTGGTAAGCGGTTCACCTTCAGGG (SEQ ID NO.: 19), and complementary sequences thereof. In one example, the probe comprises the sequence selected from the group consisting of CTCTGGTAGTGATTTGGACCCGAAATCTG (SEQ ID NO.: 16), CCACCTTCTCAGTCCAGCGCGTTT (SEQ ID NO.: 17), GTGACTTCACCAAAGGTCAGGGCCC (SEQ ID NO.: 18), GGTGGTAAGCGGTTCACCTTCAGGG (SEQ ID NO.: 19) and complementary sequences thereof. In one specific example, the probe comprises the sequence of CTCTGGTAGTGATTTGGACCCGAAATCTG (SEQ ID NO.: 16).
In one example, the amplification of the target sequence comprises the use of at least one oligonucleotide primer and one probe as defined above. In another example, the amplification of the target sequence comprises the use of at least one pair of oligonucleotide primers and one probe as defined above.
In one aspect, there is provided a method for detecting and/or quantifying the presence of a target nucleic acid sequence of Epstein-Barr virus (EBV) in a sample obtained from a subject, comprising amplifying a target sequence in the BamHI-W region of EBV, wherein the target sequence comprises the sequence of AGATCTAAGGCCGGGAGAGGCAGCCCCAAAGCGGGTGCAGTAACAGGTAATC TCTGGTAGTGATTTGGACCCGAAATCTGACACTTTAGAGCTCTGGAGGACTTTA AAACTCTAAAAATCAAAACTTTAGAGGCGAATGGGCG (SEQ ID NO.: 1), wherein amplifying the target sequence comprises the use of a pair of oligonucleotide primers and a probe, wherein the first oligonucleotide primer comprises the sequence of 5′-AGATCTAAGGCCGGGAGAGG-3′ (SEQ ID NO.: 2), and the second oligonucleotide primer comprises the sequence of 5′-CGCCCATTCGCCTCTAAAGT-3′ (SEQ ID NO.: 3), and wherein the probe comprises the sequence of 5′-(6-FAM)CTCTGGTAGTGATTTGGACCCGAAATCTG(TAMRA)-3′ (SEQ ID NO.: 4), and wherein the method is a quantitative polymerase chain reaction (qPCR).
In some examples, the method of detecting and/or quantifying the presence of a target nucleic acid sequence of Epstein-Barr virus (EBV) in a sample obtained from a subject further comprises amplifying a control.
The term “control” or “internal control” as used herein refers to a reference sequence which can be used to indicate whether the amplification system is functioning.
When a control is included in the amplification system but is not successfully amplified, it indicates that the amplification result is false-negative. When there is no control spiked in samples, it is not possible to differentiate true negative and false negative. When a control is spiked and successfully amplified but no target signal in the tested samples, the results are true negative. When internal control is spiked and no signals from both internal control and tested samples, the results are false negative. Thus using a control allows one to distinguish false negative results which are common in clinical samples in which many PCR inhibitors are present. False negative results have great impacts on patients' lives, such as failure of disease diagnosis which will delay treatment and might result in treatment failure.
In some examples, the control includes a target sequence of
a complementary sequence, a fragment or a variant thereof.
In some examples, amplifying the control includes the use of a pair of oligonucleotide primers. Examples of a forward primer sequence is 5′-CGCTCTCGGTGTCCTCATTC-3′ (SEQ ID NO.: 81), a complementary sequence, a fragment or a variant thereof. Examples of a reverse primer sequence is 5′-GGCTGACGCTTGCATAAGGT-3′ (SEQ ID NO.: 82), a complementary sequence, a fragment or a variant thereof.
In some other examples, amplifying the control further includes the use of a probe capable of binding to the target sequence of the control. In one example, the probe comprises the sequence of CACAACGCTATGCTGTAACTCGACCTGAC (SEQ ID NO.: 89), a complementary sequence, a fragment or a variant thereof. One specific example of a probe is 5′-VIC-CACAACGCTATGCTGTAACTCGACCTGAC-TAMRA-3′ (SEQ ID NO.: 83).
In one example, the microorganism of which a target nucleic acid sequence is to be detected and/or quantified using the method as disclosed herein is a bacterium. One specific example of such a bacterium is Mycobacterium tuberculosis.
The following are some non-exclusive examples of CpG islands in the genomic nucleic acid sequences of Mycobacterium tuberculosis as predicted using CpG island analytical software:
The above identified 17 CpG islands accounts for 99.9% of the genomic nucleic acid sequence of Mycobacterium tuberculosis.
Some non-limiting examples of the target sequences to be used for the detection and/or quantification of Mycobacterium tuberculosis are:
The above identified target sequences in the genomic nucleic acid sequence of Mycobacterium tuberculosis are derived from whole genome sequencing from the plasma sample of a patient who has pulmonary tuberculosis. It is worth noting that it is generally unexpected to detect tuberculosis DNA signal from circulating plasma through whole genome sequencing. It is also to be noted that all of these target sequences fall within the above 17 CpG islands identified for Mycobacterium tuberculosis.
Examples of primers that can be used for the amplification of a target sequence in the CpG island of the genomic nucleic acid sequence of Mycobacterium tuberculosis include but are not limited to: forward primer GGCTGTGGGTAGCAGACC (SEQ ID NO.: 74) and ACCTGAAAGACGTTATCCACCAT (SEQ ID NO.: 75), reverse primer CGGGTCCAGATGGCTTGC (SEQ ID NO.: 76) and CGGCTAGTGCATTGTCATAGGA (SEQ ID NO.: 77).
The amplification of a target sequence in the CpG island of the genomic nucleic acid sequence of Mycobacterium tuberculosis may also comprise the use of a probe. In some examples, the probe used comprises the sequence of TGTCGACCTGGGCAGGGTTCG (SEQ ID NO.: 78) and TCCGACCGCGCTCCGACCGACG (SEQ ID NO.: 79).
In one example, the probe used comprises a component comprises at least one detectable label. In one example, the detectable label is capable of producing an optical signal. In one example, the detectable label comprises a fluorophore. Examples of fluorophores include but are not limited to fluorescent proteins, for example GFP (green fluorescent protein), YFP (yellow fluorescent protein), RFP (red fluorescent protein); non-protein fluorophores selected from the group consisting of xanthene derivatives (for example, fluorescein, rhodamine, Oregon green, eosin, 6-carboxyfluorescein and Texas red); cyanine derivatives (for example, cyanine, indocarbocyanine, oxacarbocyanine, thiacarbocyanine, and merocyanine), squaraine derivatives and ring-substituted squaraines, including Seta, SeTau, and Square dyes, naphthalene derivatives (dansyl and prodan derivatives), coumarin derivatives, oxadiazole derivatives (for example pyridyloxazole, nitrobenzoxadiazole and benzoxadiazole), anthracene derivatives (for example anthraquinones, DRAQ5, DRAQ7 and CyTRAK Orange), pyrene derivatives(for example cascade blue), oxazine derivatives (for example, Nile red, Nile blue, cresyl violet, oxazine 170), acridine derivatives (for example proflavin, acridine orange, acridine yellow), arylmethine derivatives (for example auramine, crystal violet, malachite green), tetrapyrrole derivatives (for example porphin, phthalocyanine, bilirubin) and derivatives thereof. In some specific examples, the fluorophore is selected from the group consisting of FAM (carboxyfluorescein), TET (carboxy-2′,4,7,7′-tetrachlorofluorescein succinimidyl ester), HEX (carboxy-2,4,4,5,7,7-hexachlorofluorescein succinimidyl ester), ROX (carboxy-X-rhodamine) and NED. In one specific example, the fluorophore is FAM (carboxyfluorescein), or 6-FAM (6-carboxyfluorescein).
In one example, the at least one detectable label is capable of producing a changeable signal. The changeable signal may be produced upon the hybridization of the probe to the target sequence. For example, the signal may be detectable before the probe binds to the target sequence, and upon the hybridization of the probe to the target sequence, the signal is reduced in strength or becomes completely undetectable. In another example, the detectable signal may be produced only upon the hybridization of the probe to the target sequence, or the strength of the detectable signal may be increased upon the hybridization of the probe to the target sequence.
In one example, the component comprises two detectable labels. In one example, the two detectable labels function independently, while in another example, the two detectable labels are an interactive pair of labels. The interactive pair of labels are capable of generating a changeable signal. For example, the signal may be detectable before the probe binds to the target sequence, and upon the hybridization of the probe to the target sequence, the signal is reduced in strength or becomes completely undetectable. In another example, the detectable signal may be produced only upon the hybridization of the probe to the target sequence, or the strength of the detectable signal may be increased upon the hybridization of the probe to the target sequence. In one specific example, the detectable signal is not generated when both detectable labels are linked together by the probe sequence. Once at least one detectable label is cleaved from the probe, the detectable signal is generated.
In some examples, the interactive pair of labels may comprise a fluorophore and a quencher pair. In one specific example, the fluorophore is located at the 5′ end of the probe, and the quencher is located at the 3′end of the probe. Examples of quenchers include but are not limited to TAMRA (tetramethylrhodamine), TaqMan® MGB (minor groove binder) and BHQ™ (Black Hole Quencher™). In one specific example, the fluorophore is FAM (carboxyfluorescein), more particularly 6-FAM (6-carboxyfluorescein), and the quencher is TAMRA (tetramethylrhodamine).
The amplification of the target sequence in the above methods may be carried out via a polymerase chain reaction (PCR). Examples of PCRs include but are not limited to real-time polymerase chain reaction, digital polymerase chain reaction, quantitative polymerase chain reaction, qualitative polymerase chain reaction, quantitative real-time polymerase chain reaction, or quantitative reverse transcription polymerase chain reaction.
Real-time PCR monitors the amplification of a targeted DNA molecule during the PCR, i.e. in real-time, and not at its end, as in conventional PCR. Real-time PCR can be used quantitatively (Quantitative real-time PCR), and semi-quantitatively, i.e. above/below a certain amount of DNA molecules (Semi quantitative real-time PCR). Quantitative Real-Time PCR (qrt-PCR) methods use fluorescent dyes or fluorophore-containing DNA probes to measure the amount of amplified product as the amplification progresses.
Digital PCR (dPCR) simultaneously amplifies thousands of samples, each in a separate droplet within an emulsion.
Quantitative PCR (qPCR) is used to measure the specific amount of target DNA (or RNA) in a sample. By measuring amplification only within the phase of true exponential increase, the amount of measured product more accurately reflects the initial amount of target. Special thermal cyclers are used that monitor the amount of product during the amplification.
Qualitative PCR refers to a PCR method used to detect the present or absence of target DNA (RNA) in a sample without quantifying the amount present.
Reverse Transcription PCR is used to reverse-transcribe and amplify RNA to cDNA. PCR is preceded by a reaction using reverse transcriptase, an enzyme that converts RNA into cDNA. The two reactions may be combined in a tube, with the initial heating step of PCR being used to inactivate the transcriptase. RT-PCR is widely used in expression profiling, which detects the expression of a gene. It can also be used to obtain sequence of an RNA transcript, which may aid the determination of the transcription start and termination sites and facilitate mapping of the location of exons and introns in a gene sequence.
The amplified product obtained using the methods described above can be purified and the resulting purified product can be quantified using conventional nucleic acid purification methods and quantification methods. Examples of nucleic acid purification methods include but are not limited to gel electrophoresis followed by gel extraction, and silica based membrane technologies. Examples of methods to quantify the purified nucleic acid include but are not limited to spectrophotometric analysis and analysis using fluorescent dye tagging. Various kits and systems are commercially available for the purification and quantification of amplified. nucleic acid products.
The copy number of BamHI-W in the amplified product can be calculated using the following formula:
The methods described herein have better sensitivity and specificity compared to the currently known methods of detecting microorganisms. Specifically, for the detection of EBV, in terms of sensitivity, the lowest concentration of EBVs in a sample that can be detected using the methods described herein is 100 International Unity (IU)/ml or sample, or 90 IU/ml of sample, or 80 IU/ml of sample, or 70 IU/ml of sample, or 60 IU/ml of sample, or 50 IU/ml of sample, or 40 IU/ml of sample, or 30 IU/ml of sample, or 20 IU/ml of sample, or 10 IU/ml of sample, or 5 IU/ml of sample, or 1 IU/ml of sample.
The term “International Unit” or “IU” as used herein refers to the first WHO International Standard for Epstein-Barr Virus for Nucleic Acid Amplification Techniques defined by the National Institute for Biological Standards and Control (NIBSC) (NIBSC Code No. 09/260).
As described above, the genome of an EBV typically contains six to twenty copies of the BamHI-W sequence. Therefore higher sensitivity can be achieved when the detection of EBV is based on the detection of the BamHI-W region. However, the variability of BamHI-W copy numbers in different EBV strains has been considered as a challenged in assay comparison and standardization between laboratories. Therefore, a new method of standardizing the number of copies of BamHI-W to the amount of EBV in a sample has been developed in the present disclosure to solve this problem. The sequence of BamHI-W standard was incorporated as an insert in plasmid which was propagated in competent bacteria cells, such as E. coli. Single colony was picked and grown for scale-up production of the plasmids. Bacteria were harvested and plasmids were extracted and quantified.
The BamHI-W standard plasmid obtained carries only BamHI-W sequence of EBV whereas NIBSC standard is the whole genome sequence of EBV. The presence of other genes in EBV genome might interfere with the amplification of BamHI-W or generate higher background signals as compared to the BamHI-W standard plasmid. In addition, the BamHI-W standard plasmid obtained carries one BamHI-W copy, allowing absolute quantification of BamHI-W. This is not feasible in the case of NIBSC standards because number of BamHI-W copies is unknown in EBV genome.
Using the constructed standard plasmid of BamHI-W, it has been derived that 1 IU (International Unit) of EBV as defined by the NIBSC standard equals to about 1.38 copies of BamHI-W. This conversion allows standardization between BamHI-W assay of the present disclosure and other assays, allowing comparison of test results across various laboratories that use different types of assay for EBV quantification.
The method of detecting and/or quantifying a target nucleic acid sequence of the EBV can be used alone or in combination with other available methods of detecting and/or quantifying EBV.
In another aspect, there is provided a method of detecting a disease associated with microorganism infection, or risk of developing a disease associated with microorganism infection in a subject, comprising detecting and/or quantifying the presence of a nucleic acid sequence of the microorganism using the method of the present invention in a sample obtained from the subject, wherein the presence of the nucleic acid sequence of the microorganism in the sample indicates that the subject has a disease associated with microorganism infection or is at risk of developing a disease associated with microorganism infection.
In a further aspect, there is provided a method of detecting and treating a disease associated with microorganism infection, comprising: (i) detecting and/or quantifying the presence of a nucleic acid sequence of the microorganism using the method of the present invention in a sample obtained from the subject, wherein the presence of the nucleic acid sequence of the microorganism in the sample indicates that the subject has a disease associated with microorganism; (ii) administering to the subject a medicament suitable for the treatment of the disease associated with the microorganism.
In yet a further aspect, there is provided a method of predicting the treatment outcome of a disease associated with microorganism infection in a patient, comprising: (i) quantifying the nucleic acid sequence of the microorganism in a sample collected from the patient before treatment or before a treatment step, and quantifying the nucleic acid sequence of the microorganism in a sample collected from the same patient after treatment or after a treatment step; (ii) comparing the amount of the nucleic acid sequence of the microorganism in the sample before and after treatment or a treatment step, wherein a decrease in the amount of the nucleic acid sequence of the microorganism in the sample after treatment or a treatment step indicates that treatment outcome of the disease associated with microorganism infection in the patient is positive, wherein the quantifying of the nucleic acid sequence of the microorganism in the sample is performed according to the method of the present invention.
The term “disease associated with microorganism infection” as used herein refers to any disease that can be caused by a microorganism, in particular a pathogenic microorganism as described herein.
In one specific example, the disease is EBV-associated disease, in particular EBV-associated cancers. Examples of EBV-associated cancers include but are not limited to nasopharyngeal carcinoma (NPC), gastric cancer, Hodgkin's lymphoma and Burkitt's lymphoma. In one specific example, the EBV-associated cancer is NPC.
The term “sample” used herein refers to a biological sample, or a sample that comprises at least some biological materials such as nucleic acid molecules, more particularly cell free DNAs (cfDNAs). The biological samples may include liquid samples, such as whole blood, blood serum, blood plasma, buffy coat, peripheral blood mononuclear cells (PBMCs), cerebrospinal fluid, central spinal fluid, lymph fluid, cystic fluid, sputum, stool, pleural effusion, mucus, pleural fluid, ascitic fluid, amniotic fluid, peritoneal fluid, saliva, bronchial washes and urine. In specific examples, the biological sample is a blood sample or blood plasma sample. In some other specific examples, the biological sample is not a urine sample. Nucleic acids can be extracted from a biological sample using any method known to those of skill in the art.
In one specific example, the subject from which the sample is obtained is of Asian ethnicity.
The term cell free DNA (cfDNA) is used herein to refer to DNA that is found in the circulating system of a subject. cfDNAs can be microorganism cfDNAs, or cfDNAs directly released from mammalian cells, in particular abnormal cells such as cancer cells. Microorganism cfDNAs may be released from the circulating cell free microorganisms, or from microorganisms present in mammalian cells. cfDNAs directly released from mammalian cells could have been incorporated into the DNAs of the mammalian cells as a result of mammalian cells infection caused by the microorganisms.
cfDNAs which are residing within the CpG island is more stable and less susceptible to degradation and hence more likely to be detected. Therefore when a sample contains both microorganism cfDNAs and human cfDNAs, targeting microorganism cfDNAs residing within the CpG island makes the microorganism cfDNAs more likely to be detected amongst the presence of human cfDNAs.
In some examples, it is preferred to select target sequences within a nucleic acid sequence that occurs in multiple repeats in a microorganism, in order to make detection and/or quantification more feasible, especially in sample with low content of the microorganism.
The presence of the DNA sequence of a microorganism in a sample from a subject can also be used as an indication of the stage of the disease associated with the microorganism. In one specific example, the presence of an EBV DNA sequence in a sample from a subject can be used as an indication of the stage of the EBV- associated cancers.
The term “treatment” as used herein refers to any methods or substances or combination thereof, which remedy a disease state or symptoms, prevent the establishment of disease, or otherwise prevent, hinder, retard, or reverse the progression of disease or other undesirable symptoms. In particular, the disease is an EBV-associated disease, such as EBV-associated cancers, which include but are not limited to nasopharyngeal carcinoma, gastric cancer, Hodgkin's lymphoma and Burkitt's lymphoma. Types of cancer treatment generally include chemotherapy, radiation therapy, immunotherapy, and targeted therapy.
The term “decrease” or “reduce” and their grammatically variance refers to a decrease in the level of EBV in a sample collected from the patient after treatment or a treatment step as compared to the level of EBV in a sample collected from the same patient before treatment or before a treatment step. In some examples, the level of EBVs in the sample collected after treatment or a treatment step is reduced by at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90% or at least 95% as compared to the level of EBVs in a sample collected from the same patient before treatment or before a treatment step.
The present disclosure also provides a kit for detecting and/or quantifying the presence of a target nucleic acid of a microorganism in a sample, where the kit can be used according to the methods of the present invention.
In one aspect, there is provided a kit for detecting and/or quantifying the nucleic acid sequence of a microorganism in a sample obtained from a subject, comprising a pair of oligonucleotide primers specific for the amplification of a target sequence in a CpG island of the nucleic acid of the microorganism.
In one example, the kit further comprises a probe capable of binding to the target sequence. In some examples, the probe is any probe as described above.
In yet a further example, there is provided one or more oligonucleotide primers for the amplification of a target sequence in the CpG island of the nucleic acid of a microorganism.
The sequences as described herein are from the 5′ to the 3′ direction, unless specified otherwise.
The invention illustratively described herein may suitably be practiced in the absence of any element or elements, limitation or limitations, not specifically disclosed herein. Thus, for example, the terms “comprising”, “including”, “containing”, etc. shall be read expansively and without limitation. Additionally, the terms and expressions employed herein have been used as terms of description and not of limitation, and there is no intention in the use of such terms and expressions of excluding any equivalents of the features shown and described or portions thereof, but it is recognized that various modifications are possible within the scope of the invention claimed. Thus, it should be understood that although the present invention has been specifically disclosed by preferred embodiments and optional features, modification and variation of the inventions embodied therein herein disclosed may be resorted to by those skilled in the art, and that such modifications and variations are considered to be within the scope of this invention.
The invention has been described broadly and generically herein. Each of the narrower species and subgeneric groupings falling within the generic disclosure also form part of the invention. This includes the generic description of the invention with a proviso or negative limitation removing any subject matter from the genus, regardless of whether or not the excised material is specifically recited herein.
Other embodiments are within the following claims and non-limiting examples. In addition, where features or aspects of the invention are described in terms of Markush groups, those skilled in the art will recognize that the invention is also thereby described in terms of any individual member or subgroup of members of the Markush group.
Benchmarking of the EBV cfDNA was conducted using comparison against results from a College of American Pathologists (CAP)-accredited laboratory as well as WHO-approved international EBV standards.
The clinical sensitivity and specificity of the three EBV cfDNA assays was benchmarked against an in-house EBV cfDNA assay targeting EBNA1 in a College of American Pathologists (CAP)-accredited clinical-grade laboratory at the Singapore General Hospital (SGH), with known analytical performance reported as a sensitivity of 79% and specificity of 100%. Out of 46 NPC patients (Table 1), 31 (69%) were reported to be EBV-positive, and 14 (31%) were reported to be EBV-negative (1 case was not reported due to logistic reasons). Of 31 EBV-positive patients on the clinical-grade assay, both BamHI-W qPCR and EBNA1-dPCR assays showed 100% matching positivity, whereas the EBNA1-qPCR assay showed 80% match. Of the 14 EBV-negative patients, the BamHI-W qPCR, EBNA1-dPCR and EBNA1-qPCR assay reported 9, 7, and 5 positive cases. Overall, all three EBV cfDNA assays demonstrate high clinical sensitivity and specificity, with particularly high sensitivity shown at baseline for the BamHI-W qPCR assay.
The only available WHO-approved international EBV standard was used to benchmark the sensitivity and specificity of the three EBV cfDNA assays. The BamHI-W qPCR assay demonstrated the highest reproducible sensitivity. The lowest EBV concentration detected in triplicates was 100 IU/mL for BamHI-W qPCR assay and 1,000 IU/mL for both EBNA1 assays (Table 2). The BamHI-W qPCR assay was also able to detect positive signal in one replicate of the standard containing 1 IU/mL, whereas EBNA1 assays were not able to. In addition, all assays produced no false-positive detection in five EBV-free standards, indicating their high specificity against EBV cfDNA.
The IU of NIBSC standards is derived from a mean value of highly variable EBV copy number measured by various qPCR assays of 28 laboratories in the world. These assays employ different DNA extraction methods, and target a wide range of genes, including a single-copy gene, EBNA1, and a multiple-repeat gene, BamHI-W. However, since dPCR was not included in the evaluation, the relationship between EBV copy number as obtained by dPCR and IU is less clear. Moreover, since the number of BamHI-W fragments varies in different EBV isolates, a fixed conversion ratio of BamHI-W copies to IU will not be always accurate in different patients' sample. Therefore, the NIBSC standards were only used in this study for comparison of sensitivity and specificity between EBV cfDNA assays. The subsequent data were to be reported in copy number of respective EBV targets.
Among EBV cfDNA quantitation approaches, BamHI-W qPCR assay yielded the highest concentration of EBV cfDNA levels: 2.4 to 37.7-fold higher than EBNA1-qPCR assay and 2.2 to 25.5-fold higher than EBNA1-dPCR assay (Table 3).
All samples detected EBV-positive by both EBNA1 assays were also detected positive for EBV by BamHI-W assay. The detection rates of canonical CTCs and potential CTCs are 76% and 94% in pre-treatment samples respectively. Overall, potential CTC count was higher and weakly correlated to canonical CTC count (r2=0.21, P-value=<0.01). No correlation was observed between each type of CTC count and EBV cfDNA levels quantified by different assays. In contrast, among the EBV cfDNA assays, strong correlation was observed between BamHI-W qPCR and EBNA1-dPCR assays (r2=0.99, P-value<0.0001), but not between BamHI-W and EBNA1-qPCR assays (r2=0.03, P-value=0.29) nor between EBNA1-qPCR and -dPCR assays (r2=0.06, P-value=0.11). This result corresponded with the similar detection rate of BamHI-W qPCR (89%) and EBNA1-dPCR (85%) assays, with the detection rate of EBNA1-qPCR assay being 67%.
aPCR inhibition
The clinical stages were re-classified to three groups; stage I, stage II-III, and stage IV (Table 4). The combination of stage-II and -III NPC patients was in the light of long-term 5-year follow-up data from Singapore showing similar survival outcomes using modern treatment approaches13. The EBV cfDNA levels in three assays strongly correlated with clinical stages. In contrast, there was no statistically significant relationship between CTCs and clinical stages. These results indicated a strong association between NPC clinical stage and EBV cfDNA, but not CTCs.
aLikelihood ratio Chi-square and P-values were determined using logistic ordinal regression for the prediction of NPC clinical stage, given the levels of NPC circulation biomarkers
bP-values <0.05 were considered statistically significant.
Decreased EBV cfDNA levels were observed in all EBV-positive patients following treatment, strongly correlating with the local radiological response (Table 5). To evaluate the predictive value of NPC circulating biomarkers for short-term radiological response, we determined that EBV cfDNA levels were significantly reduced after treatment (Wilcoxon's signed rank testing p-value<0.001 for all three techniques BamHI-W qPCR, EBNA1-dPCR and EBNA1-qPCR assay). In contrast, for both canonical and potential CTCs, decrease was not significant (p=0.07 and 0.54 respectively). The stratified analysis performed on patients undergoing radiotherapy and chemo-radiotherapy showed the magnitude of decrease of canonical CTCs pre- and post-treatment in each group remains insignificant (Table 6). Overall, our results show that EBV cfDNA level correlation with short-term radiological response was much stronger than that of potential or canonical CTC counts.
aP-Values were calculated using the Wilcoxon's signed rank testing and values <0.05 were considered statistically significant.
Survival analysis demonstrated that there was a stronger correlation between EBV cfDNA and overall survival, as compared to that between CTC counts and overall survival. All three EBV cfDNA techniques showed prognostic value on survival analysis: BamHI-W qPCR, EBNA1-dPCR and EBNA1-qPCR assays yielded corresponding p-values of 0.03, 0.02 and 0.0002 by log-rank testing respectively, whereas canonical CTC and potential CTC counts were not associated with overall survival (p=0.66 and 0.13 respectively). Kaplan-Meier plots are also shown for dichotomized biomarker variables (
1FAM signal
2VIC signal
The nucleic acid sequence of the Internal Control (IC) selected is
The sequences of the primers and probes used to amplify the IC are: forward Primer:
Concentration of IC primers and probe is 400 nM and 100 nM respectively (same as BamHI-W primers and probe). As shown in Table 8, Master mix 1 contains IC and all the components of the duplex assay. Therefore, both FAM and VIC signals should be seen in C666-1, an EBV-positive cell line. RKO, buffy coat of healthy donor and no-template control are EBV-negative, thus, will only emit VIC signal. Master mix 2, 3, 4 contain EBV primers and probe without IC Primers and probes or IC sequence or both. So VIC signal should not be present in all samples. This is to test if any component of the IC assay interfere the EBV assay. The last master mix only contains the IC sequence, primers and probe. VIC signal should be positive whereas FAM signal should be negative for all samples. And the results are as expected. The mean Ct values of both FAM and VIC signals in different setups are about the same. The duplex assay is then applied on a serial dilution of EBV sample with or without IC. As shown in
The duplex assay is tested in clinical samples, including 5 EBV-positive NPC samples, and 5 healthy donor's samples. As illustrated in
1FAM signal
2VIC signal
The IC plasmids were spiked to plasma samples and underwent DNA extraction whereas EBV standard plasmids were added directly to the PCR reactions. As shown in
Despite being a powerful tool in NPC prognosis, the quantification of EBV cfDNA faces challenges of standardization. The NIBSC standards, which are derived from whole EBV produced by B95-8 cells provide a consensus estimate of EBV IU, but are not ideal for standardization of BamHI-W copy number. In addition, the NIBSC spike-in standards do not truly represent the NPC plasma samples. Naturally occurring cfDNA has a size of less than 181 bp in NPC plasma whereas DNA obtained from NIBSC was genomic DNA with a size of 170 kb. The differences in DNA size influence the choice of DNA extraction kit, which in turn has meaningful impact on DNA recovery, and subsequently DNA quantification.
To solve these problems, a new standard for the quantification of EBV cfDNA is developed in the present study.
The sequence of BamHI-W standard was incorporated as an insert in plasmid which was propagated in competent bacteria cells (such as E. coli). Single colony was picked and further grown in Lysogeny Broth (LB) for scale-up production of the plasmids. Bacteria were harvested and plasmids were extracted using the QIAprep Spin Miniprep Kit (Qiagen) and quantified using Quantus Fluorometer (Promega). Sequence of BamHI-W in the newly produced plasmids was confirmed by Sanger Sequencing.
BamHI-W standard plasmids were prepared with known copy number (preferably 10-time serial dilution) then added directly to the PCR well. Standard curve was plotted based on the Ct values obtained from the BamHI-W standard plasmids, which were used to calculate sample's BamHI-W copy number. In the conversion factor experiment, NIBSC standards were set at standard curve to which BamHI-W standard plasmids were calibrated.
Two targets sequences were tested for the amplification of nucleic acid sequence of Mycobacterium tuberculosis in samples collected from a patient known to have tuberculosis and a healthy subject as control. The target sequences are: (1) nucleotide sequences from nucleotide position 1542511 to nucleotide position 1542349; and (2) nucleotide sequences from nucleotide position 1542328 to nucleotide position 1542215 of the genomic sequence of Mycobacteriaum tuberculosis. The primers and probes used for the amplification of target sequence (1) are: forward primer 5′-GGCTGTGGGTAGCAGACC-3′ (SEQ ID NO.: 74), reverse primer 5′-CGGGTCCAGATGGCTTGC-3′ (SEQ ID NO.: 76) and probe 5′-FAM-TGTCGACCTGGGCAGGGTTCG-TAMRA-3′ (SEQ ID NO.: 84). The primers and probes used for the amplification of target sequence (2) are: forward primer 5′-ACCTGAAAGACGTTATCCACCAT-3′(SEQ ID NO.: 75), reverse primer 5′-CGGCTAGTGCATTGTCATAGGA-3′ (SEQ ID NO.: 77), and probe 5′-FAM-TCCGACCGCGCTCCGACCGACG-TAMRA-3′ (SEQ ID NO.: 85). From the PCR amplification results shown in
Discussion
Non-invasive approaches of NPC diagnosis have been available for the past decade via the detection of immunoglobulin A antibody against EBV antigens in patients' serum. However, these techniques are inefficient in NPC prognosis and relapse prediction. There is considerable ongoing research into EBV cfDNA in NPC patients for prediction of post-treatment outcomes, and its role in selecting patients for additional adjuvant treatment following definitive therapy.
In the present study, good correlation between EBV cfDNA and clinicopathologic outcomes was consistently demonstrated regardless of approach undertaken: BamHI-W qPCR, EBNA1-qPCR or EBNA1-dPCR assays. Decreased EBV cfDNA levels are commonly observed in almost all patients undergoing treatment, corresponding generally to the short-term post-treatment radiological response, which is commonly a complete or near-complete response. Overall, the results demonstrated that EBV cfDNA yielded better results in comparison with circulating tumor cell (CTC) count as a circulating biomarker for NPC. Regardless of approach, cfDNA showed far stronger correlation with tumor stage, short-term radiological response as well as overall survival, in comparison with CTC counts.
The detection rate of the BamHI-W qPCR assay in the present study was 89%. In comparison with clinically validated assays, the BamHI-W qPCR assay demonstrated better performance. The detection rate of the CE-IVD EBNA1-qPCR assay reported in this study was 67%, despite its claimed clinical sensitivity of 100%, based on 80 EBV-positive samples. Moreover, EBV positive cases reported by the BamHI-W qPCR assay were matched with the ones reported by the SGH assay, which had clinical sensitivity of 79%.
In the present study, by targeting the multiple-repeat BamHI-W fragments, the BamHI-W qPCR assay yielded the highest detection rate in NPC pre-treatment samples. It also yielded the highest sensitivity in measurement of NIBSC spike-in standards despite the possible DNA losses due to the DNA extraction method potentially not optimized to genomic DNA. On the other hand, regardless of being different in fundamental techniques of quantification and EBV targets, BamHI-W qPCR and EBNA1-dPCR assays were strongly correlated in the measurement of EBV levels in pre-treatment samples. This correlation could possibly be aided by the same extraction process from which the cfDNA used in BamHI-W qPCR and EBNA1-dPCR assays was extracted. Altogether, in our interpretation, the BamHI-W qPCR and EBNA1-dPCR assays are more likely to quantify the true values of EBV cfDNA level in pre-treatment samples of NPC patients.
The evidence of EBV cfDNA existing in the form of short and freely-floating fragments in the plasma had led to a conclusion that they were released from apoptotic NPC cells. In other words, the NPC cells releasing EBV cfDNA lysed before they had the chance to enter the bloodstream. This phenomenon could explain the non-correlation between NPC CTC counts and EBV cfDNA levels measured by various assays.
The results of the present study demonstrated that by targeting the multiple-repeat BamHI-W, higher detection rate and sensitivity were achieved.
Further, plasma sample of NPC patients contains both human cfDNA and EBV cfDNA. The two major challenges in detection of EBV cfDNA, and in general, microorganism cfDNA in clinical samples are the degradation of EBV cfDNA and the abundant presence of human genomic DNA which hinders the signals from EBV cfDNA. In order to overcome these challenges, a target region (BamHI-W region) in EBV that is preserved and present in high copy number was selected, in particular the region within the CpG island and near to the 5′ end of CpG island was selected for the following reasons: 1. BamHI-W region occurs in multiple repeats per EBV genome, making detection and quantification more feasible, especially in sample with low EBV copy number and/or limited input volume; 2. The region is near to the 5′ location which would allow for preferential preservation during exonuclease III degradation of the EBV dsDNA; and 3. cfDNA residing within CpG islands is more stable and thus less susceptible to degradation and more likely to be detected amongst the presence of human cfDNA.
In the other example, Mycobacterium tuberculosis, plasma sample was selected from a tuberculosis patient because there were bacterium and human cfDNA present in the plasma and the genome of Mycobacterium is rich in GC content. Our analysis showed CpG islands cover 99.9% of the Mycobacterium tuberculosis genome. cfDNA was extracted from the tuberculosis plasma sample and undergone whole genome sequencing (WGS). All reads were mapped to Mycobacterium tuberculosis (Reference genome M.TB H37Rv) and aligned to the CpG islands data mentioned earlier. The results showed all reads belong to CpG islands on the Mycobacterium tuberculosis genome. In order words, highly fragmented and rare Mycobacterium tuberculosis cfDNA within CpG islands can be sequenced despite the abundant presence of human cfDNA in plasma sample. These results imply the advantage of designing cfDNA assay targeting CpG islands in microorganisms.
Materials and Methods
Clinical Samples
The study was approved by the Centralised Institutional Review Board, SingHealth (Reference number: 2013/354/B) and all methods were carried out in accordance with the approved guidelines. A total of 46 NPC patients, all of Asian ethnicity, who provided informed written consent, were recruited into the study between June 2013 and October 2014 (Table 1). 20 mL of blood was collected in EDTA tube (BD Biosciences) at baseline and one month after treatment. All stage-I and most of stage-II patients received only radiotherapy whereas most patients from stage III and IV received combined chemo-radiotherapy. Only 3 patients received adjuvant chemotherapy. A total of 28 matched serial samples, pre- and post-treatment, were collected. The post-treatment radiological response of all patients was based on their first magnetic resonance imaging/computed tomography scan after treatment (Table 5). The median follow-up was 18.7 months.
Participating Laboratories and Clinic
Institute of Bioengineering and Nanotechnology (IBN) served as the centralised laboratory of the study. Blood samples were collected from consenting NPC patients at National Cancer Centre Singapore, and sent to IBN within the same day of their visits within 4 hours. For each sample, whole blood was used for immediate CTC enumeration, and plasma was obtained, assigned blinded IDs and stored at −80° C. until further use. Each plasma assay had its individually optimized volumes. 250 μL of frozen plasma was distributed to Singapore General Hospital (SGH) where cfDNA extraction and quantification was performed using the Sentosa® SA EBV Quantitative PCR Test (Vela Diagnostics) following manufacturer's requirements. At IBN, 1 mL of thawed plasma was used for cfDNA extraction of which half was quantified by the in-house BamHI-W assay. The other half of the extracted cfDNA was sent to JN Medsys where cfDNA quantification was conducted using the Clarity™ Digital PCR System (JN Medsys).
BamHI-W qPCR Assay
50 μL of cfDNA was extracted from 1 mL of thawed plasma using the QlAamp Circulating Nucleic Acid Kit (Qiagen). The BamHI-W7 primers (Sigma Aldrich) and dual-labelled BamHI-W7 hydrolysis probe (Life Technologies) were designed for the amplification of a 143-bp region of BamHI-W. Each 20-μL reaction consisted of 1× Taqman® Fast Advanced Master Mix (Life Technologies), 400 nM BamHI-W7 primers (sense 5′- AGATCTAAGGCCGGGAGAGG-3′ (SEQ ID NO.: 2) and antisense 5′-CGCCCATTCGCCTCTAAAGT-3′) (SEQ ID NO.: 3), 100 nM BamHI-W7 probe (5′-(6-FAM)CTCTGGTAGTGATTTGGACCCGAAATCTG(TAMRA)-3′) (SEQ ID NO.:4) and 2 μL of DNA template, which was equivalent to 40 μl of plasma. Standard calibrators for BamHI-W were generated with 8 dilutions of DNA derived from EBV-immortalised cell lines ranging from 1 to 107 BamHI-W copies per reaction. qPCR was performed using the ViiA™7 Real-time PCR System (Life Technologies). Each run included patients' cfDNA, standard calibrators, EBV-positive, -negative and no-template controls (NTCs). The reactions were run at 50° C. for 2 min, followed by 95° C. for 20 sec to activate Uracil N-Glycosylase (UNG) and AmpliTaq® Fast DNA Polymerase, respectively. Subsequently, the reactions underwent 40 two-step cycles of denaturation and annealing at 95° C. for 1 sec, and 60° C. for 20 sec, respectively. The BamHI-W copy number was automatically calculated from ViiA™7 software based on the BamHI-W standard calibrator of each run, with R2=0.99, qPCR efficiency=98-100%, m=(−3.315)−(−3.368). Initial optimization of the BamHI-W assay was conducted by conventional PCR using EBV-positive C666-1 DNA (see
EBNA1-qPCR Assay
The Sentosa® SA EBV Quantitative PCR Test (Vela Diagnostics) was applied for quantification of EBV cfDNA with the aid of the integrated Sentosa® SX101 (Vela Diagnostics) and Rotor-Gene® Q MDx 5-plex HRM (Qiagen) instruments. 60 μL of DNA was automatically extracted from 200 μL of plasma using the Sentosa® SX Virus Total Nucleic Acid Kit v2.0 (Vela Diagnostics). 10 μL of purified DNA, equivalent to 33 μL of plasma was used for each reaction. The PCR master mix contained reagents and enzymes for the amplification of a 79-bp fragment of EBNA1, as well as a second set of primers/probes designed to detect EC3, a control for PCR inhibition and cfDNA extraction. The concentration of EBNA1 was automatically calculated based on the imported standard curve, with R2=0.99, qPCR efficiency=98%, m=(−3.367). The clinical sensitivity and specificity of the assay was reported as 100% and 98.8% respectively.
EBNA1-d PCR Assay
The Clarity™ Digital PCR System (JN Medsys) was used. The assay was designed to amplify a 118-bp fragment of EBNA1. Each 15-μL reaction consisted of 1× FastStart Essential DNA Probes Master (Roche), 200 nM EBNA1 primers (sense 5′-TCATCATCATCCGGGTCTCC-3′ (SEQ ID NO.: 86) and antisense 5′-GCTCACCATCTGGGCCAC-3′) (SEQ ID NO.: 87), 200 nM probe (5′-(6-FAM)CCTCCAGGTAGAAGGCCATTTTTCCACCCTGTAG(IABKFQ)-3′) (SEQ ID NO.: 88) (Integrated DNA Technologies), 1× Clarity™ JN Solution (JN Medsys), 0.15 U UNG (Roche) and 3 μL of plasma DNA or controls. The equivalent plasma volume per reaction was 60 μL. Each reaction mix was incubated at 40° C. for 10 min to allow UNG to degrade carry-over PCR products, followed by 95° C. for 10 min for UNG inactivation. The reaction mix was partitioned into approximately 10,000 individual reactions in the Clarity™ Digital PCR tube-strip (JN Medsys). Thereafter, the tube-strips were stabilised for 2 min, sealed with 230 μL sealing fluid and subjected to thermal cycling using the following parameters: 1 cycle at 95° C. for 5 min, 40 cycles at 95° C. for 50 sec and 58° C. for 1.5 min. Afterward, the tube-strips were transferred to the Clarity™ Reader (JN Medsys), which detected and quantified fluorescence signals from all partitions. Absolute copy number of EBNA1 in each reaction was determined by the Clarity™ Software (JN Medsys) after analysis of the ratio of positive partitions (i.e. those that contained amplified products) over the total number of partitions, using Poisson statistics.
Determination of Sensitivity and Specificity of EBV cfDNA Assays
All three EBV cfDNA assays were benchmarked against the EBV qPCR assay routinely performed by the College of American Pathologists (CAP)-certified laboratory in SGH. The clinical sensitivity and clinical specificity of the SGH assay was reported as 79% and 100% respectively, based on 66 untreated nasopharyngeal carcinoma patients and 30 normal volunteers. In addition, sensitivity and specificity of EBV cfDNA assays were benchmarked against the 1st World Health Organization (WHO) International Standards for EBV, code 09/260; from National Institute for Biological Standards and Control (NIBSC). The NIBSC standards and nuclease-free water were spiked into EBV-free plasma to obtain 18 standards of 6 known EBV concentrations, ranging from 0 to 1,000,000 IU/mL. In addition, two aliquots of EBV-free plasma served as blank standards. The protocol of DNA extraction, sample distribution and EBV cfDNA assays of spike-in standards was identical to the one for clinical plasma samples.
Enumeration of NPC CTCs
CTCs from 1 mL of whole blood were captured using the microsieve technology and enumerated with the aid of biomarker characterization. The microsieve technology is a size-based method capable of isolating both epithelial and mesenchymal CTCs, unlike the affinity system, which only captures EpCAM-expressed CTCs. Cell counting, and image analysis were performed subject to sample availability, using the MetaMorph software (Molecular Devices) and manually verified by trained laboratory technicians. Cytokeratin-positive and CD45-negative nucleated cells were classified as canonical CTCs. Other nucleated cells that were negative for both cytokeratin and CD45 biomarkers were defined as potential CTCs. All nucleated cells with CD45-positive were classified as white blood cells.
Statistical Analysis
Correlation study was carried out to correlate EBV levels amongst the NPC circulating biomarkers assays. Logistic ordinal regression modelling was used to evaluate pre-treatment circulating biomarker quantitation relative to the dependent variable of clinical stage. Wilcoxon's signed-rank test with continuity correction (R.3.0.0) was conducted to compare paired pre and post-treatment levels of NPC circulating biomarkers. Correlation was performed using Microsoft Excel and the logistic ordinal regression model was performed using the “orm {rms}” library package in R. Alpha was set to 0.05 throughout. Survival analysis was performed using R 3.0.0 survival package to study survival distributions of continuous pre-treatment levels of NPC circulating biomarkers and overall survival (Table 3), using log-rank testing to determine significance at a threshold of 0.05. 1 patient (Patient-025) was omitted from survival analysis, as the patient sought follow-up elsewhere.
Number | Date | Country | Kind |
---|---|---|---|
10201510448Q | Dec 2015 | SG | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/SG2016/050611 | 12/19/2016 | WO | 00 |