This application includes a sequence listing in computer readable form (a “xml” file) that is submitted herewith on an ST.26 XML file named Sequence_Listing.xml, created on Oct. 13, 2022 and 66,410 bytes in size. This sequence listing is incorporated by reference herein.
The present invention relates to an engineered DNA manufacturing technology, and, in particular, to a system and process for the production of hairpin loop ended self-complementary double-stranded covalently closed linear DNA vectors, the resulting molecules and their uses.
Gene therapy is defined as the introduction of a foreign genetic material into a host cell in vivo (e.g., inside the host organism) or ex vivo (e.g., a cell that was transiently removed from the host organism) to prevent, treat, or ultimately cure, diseases by altering the expression of a gene. Some naturally occurring gene mutations result in proteins not being expressed correctly or not being expressed at all which can lead to genetically borne disorders. Gene therapy works by repairing, turning-off, or replacing dysfunctional genes that cause disease with the aim of (re)establishing normal function. Also, other pathological conditions (which may or may not include genetic disorders), such as cancer or heart disease, among others, can be treated by supplying genetic sequences that express biologically active factors (RNA and/or proteins) that promote cell death, or that promote or inhibit certain metabolic pathways, in order to overcome or collaborate in the treatment of that disease. Nowadays protein-coding nucleic acids are also used for the in vivo production of vaccine antigens, expanding the productive front for the development of gene therapies aimed at infectious diseases.
During the past years, there has been great interest in the field of gene and cell therapy and the first approved therapies have seen light following decades of efforts. Approved gene therapies include the first oligonucleotide-based therapies such as Spinraza, Exondys 51, Vyondys 53, three ex vivo cell therapies such as Kymriah, Yescarta and Tescartus, and two in vivo gene therapies such as Luxtuma and Zolgensma, with more following close behind. These therapies have been designed to treat diverse diseases, including cancer, neuro-muscular disease, and inherited blindness. These approved therapies are life-changing for the affected patients and they harbor an enormous potential for extrapolation to many other pathological conditions. The success of in vivo delivered Adeno-associated virus (AAV)-based gene transfer into the human retina by Luxtuma, and into the central nervous system by Zolgensma has facilitated the development of other AAV-based gene therapies for gene delivery into the liver to treat Hemophilia, and into skeletal muscle cells to treat Duchenne muscular dystrophy. As well, there have been developments using ex vivo lentiviral and retroviral gene transfer to T cells that led to adoptive cell immunotherapy that expanded to modification of hematopoietic stem cells, enabling therapies for inherited disorders, such as sickle cell disease and beta thalassemia recently approved in the European Union and currently under review in the United States.
In addition, it is important to mention that gene therapies with viral vectors can present immune-related problems in patients, and there are also obstacles in the manufacturing processes, since expensive and not entirely efficient eukaryotic cell systems are required to produce multiple doses. Usually, it is necessary to administer a high quantity of recombinant virions for each patient to achieve an adequate level of therapeutic success.
While of course there is an exciting potential for these early gene therapy achievements to be applied to other medical disorders, the use of viral-based technologies is generating increasing concern in treating human diseases. The major primary obstacle to a broader application continues to be the immune response to viral vectors and potentially harmful side-effects derived from foreign transgenes and foreign gene-controlling promoters. Consequently, the most important near-future work must be focused on the control of the immune system. For example, despite the remarkable success of several AAV-based gene therapies, 50% of patients are currently excluded from treatment due to pre-existing natural immunity to the adenoviral capsids. Recent advances and efforts in clinical trials, such as engineering the AAV capsids, have led to technological advances that avoid this immune obstacle, but there are still concerns when redosing is mandatory to generate a significant therapeutic effect.
Thus, while viral vectors offer great potential to deliver a genetic payload with high efficiency into target cells, they inevitably pose several safety issues, including genotoxic events such as the upregulation of proto-oncogenes, or the undesired activation of nearby genes. All of these vector-derived toxicities depend both on the virus type used, as well as on other non-viral factors, including patient age, overall health status, dose, route of administration and other variables that can also influence the genotoxic potential. Also, viral vectors can cause a regeneration of virulent particles adding yet another layer of safety concerns to take into consideration.
In contrast to viral vectors, non-viral strategies based on naked, lipoplexed, or polyplexed genetic material can offer safer and effective alternatives. DNA vectors are easy to generate, purify and store, while offering extraordinary engineering capacity. However, several barriers must be overcome first in order to deliver safe and effective genetic therapies in vivo.
The most used and researched genetic vectors are the circular covalently closed double-stranded plasmid DNA molecules, which have been used for several decades for gene transfer into eukaryotic cells. Despite their great ability to express coding sequences in host cells, these circular supercoiled molecules have a number of disadvantages that make them unsafe for use in medical applications. Such disadvantages include the presence of bacteria-derived sequences and genes, such as prokaryotic origins of replication and antibiotic resistance genes that could be transferred to the patient’s microbiota leading to yet unknown disorders. To overcome said limitations, several vectors have been designed, such as DNA ‘minicircles’, lacking an origin of replication, antibiotic resistance, or other bacteria-derived sequences. Unfortunately, these molecules are still not effective enough due to their supercoiling structure that decreases the effectiveness with which they gain access to the nucleus.
There are other barriers that the DNA molecules have to overcome to get into the cell with their genetic cargo intact so as to be expressed. It is well known in the state of the art that nucleases present in patients' sera are able to digest unprotected DNA. This is why nucleic acid-based vectors must rapidly overpass the cell plasma membrane. This DNA vector transmembrane transport is complicated because cell membranes represent a lipoprotein barrier that hampers efficient nucleic acid translocation per se. To overcome this limitation, several strategies have been and are being developed, such as nucleic acid complexing with synthetic lipid nanoparticles to form a structure similar to the plasma membrane, or receptor-targeting nanoparticles for receptor-mediated endocytosis, among others. Although some of these strategies allow successful transmembrane transport of nucleic acids, once inside the cell these still show low efficiency in gaining access to the nuclear compartment, a site where they must enter to achieve gene expression.
The successful expression of a genetic sequence inside a host target cell also depends on a number of intracellular events. Immediately after the entry into the cytoplasm, naked DNA - either delivered through physical techniques such as electroporation, or after disassembly of DNA-carrier complexes - should associate with a large number of cellular proteins that mediate subsequent interactions with the microtubule network and move towards the microtubule organizing center and the nuclear envelope. DNA molecules then should enter the nucleus either after the mitotic nuclear envelope reorganization or through nuclear pore complexes in the absence of cell division, using a different set of proteins.
Although nuclear import of DNA molecules is not a normal event in the cell, several mechanisms exist for their transportation. Some of these nuclear transportation mechanisms have evolved over billions of years by viruses and other pathogens, allowing them to successfully infect their hosts, while others appear to be fortuitous “hijackings”, as is the case of the SV40 virus genome which spontaneously associates to proteins while on their way to the nucleus. Regardless, the mechanism is the same: a DNA molecule containing Nuclear Targeting Sequences (NTS), which are short consensus motifs recognized by specific transcription factors (TFs) carrying nuclear localization signals (NLS) (either provided by the host or the pathogen), is directed to the nucleus by interacting with such proteins. Thus, these NTS sequences bind to TFs and target the molecules into the nucleus by an importing-mediated process.
Several nuclear importation techniques for nucleic acids are currently being developed. Some of these techniques include the modification of DNA vectors with a covalent linkage of proteins bearing NLS, and attachment of import receptors such as Karyopherins. Engineering of DNA with NTS seems to result in highly efficient translocation of linear DNA molecules but not supercoiled circular closed ones. This difference provides an interesting approach for in vivo uses of linear DNA vectors for medical applications.
Nowadays, cutting-edge technology is focusing on a group of novel DNA molecules: hairpin loop ended self-complementary double-stranded covalently closed linear DNA vectors. These offer a different approach compared to conventional circular covalently closed plasmid DNA molecules, posing great promises regarding efficacy and safety. Besides topological differences with circular covalently closed plasmid DNA molecules, hairpin loop ended linear covalently closed DNA molecules are torsion free, as they are not subjected to gyrase-mediated supercoiling. Moreover, as well as classic DNA plasmids, they are also not subjected to exonuclease activity in prokaryotes nor eukaryotes. They are also protected from mammalian nucleases present in sera due to their hairpin loop covalently closed ends, thus preventing degradation. Therefore, this type of DNA molecules has clear advantages over minicircles, and can also compete with viral vector systems in terms of safety, design and redesign versatility, and scale production.
There are several in vitro strategies for the generation of linear covalently closed DNA molecules. These include capping of PCR amplification products with self-complementary oligonucleotides, enzymatic digestion of previously amplified and purified backbones followed by ligation with hairpin sequences for telomeric end-closing (doggyBones), and generation of minimalistic immunologically defined gene expression (MIDGE) vectors using Phi26 and Telomerase enzymes. These technologies have shown promising results in several applications including the development of vaccines and therapeutic molecules. Also, these vectors have demonstrated up to 25-fold improved transgene expression profile when compared to classical circular counterparts. Thus, hairpin loop ended linear covalently closed DNA molecules outperform circular covalently closed DNA molecules in expression efficiency, and bio-/immune-compatibility. However, large scale production of linear covalently closed DNA molecules using the aforementioned in vitro technologies is expensive and time-consuming because several recombinant enzymes have to be produced and purified prior to manufacturing the linear DNA molecules. Considering these concerns, improvements in production processes are necessary.
As mentioned above, biological mechanisms associated to the production of linear covalently closed double-stranded DNA molecules occur in nature. Linear DNA molecules, as plasmids and chromosomes, were initially unknown in prokaryotes but have now been found in spirochaetes, Gram-positive, and Gram-negative bacteria. At least, two structural types of bacterial linear DNA have been characterized: a) linear plasmids of the spirochaete Borrelia with a covalently closed hairpin loop at each end; and b) the linear plasmids of the Gram-positive filamentous Streptomyces with a covalently attached protein at each end. Replicons with similar structures are more frequent in eukaryotic cells than in prokaryotes. Also, several bacteriophages have linear genomes with covalently closed telomeres, such as Bacteriophages N15, PY54, VP882, ΦHAP-1, φKO2, Bam35, VP58.5, vB_VpaM_MAR, VHML, amongst others.
All these bacterial and phage linear hairpin-ended DNA genomes are maintained in a linear fashion by enzymes called recombinases that are able to perform double-stranded DNA cuts and rearrangements, generating hairpin ends in the linear molecules.
Lyme disease-causing bacteria Borrelia burgdorferi has about a dozen of linear DNA molecules that carry covalently closed hairpin ends generated by specialized mechanisms for dealing with the end-replication problem. The hairpin telomeres are generated from replicative intermediates through a unique reaction: breakage of two phosphodiester bonds in a single DNA duplex (one on each strand) and joining of each end with the opposite DNA strand to form a covalently closed hairpin end. This process occurs through a two-step transesterification reaction by ResT, involving the formation of a covalent protein-DNA intermediate at a position three nucleotides from the axis of symmetry in each strand of the substrate.
As well, the N15 phage shows a similar DNA end pattern. N15 phage is a lambdoid virus of Escherichia coli whose prophage is not integrative but is a linear DNA with covalently closed ends. The virus DNA has single-stranded cohesive ends, named cosL and cosR, that allow the formation of a circular double-stranded DNA within the cell, which in turn generates an inverted repeat (telRL). A virus-encoded protelomerase then recognizes that sequence and joins the phosphodiester bonds, producing covalently closed ends (telL and telR) and, in consequence, a linear hairpin-ended DNA genome.
There have been previous efforts to generate hairpin ended linear covalently closed double-stranded DNA molecules in vivo, but there are still several concerns in terms of yield and scalability to address application requirements. Some in vivo methods for producing linear DNA molecules rely on the amplification of conventional circular covalently closed DNA plasmid vectors with relaxed origins of replication that generate between 200 and 700 copies per cell and cutting of such plasmid vectors with a protelomerase to generate linear DNA molecules. Said linear DNA molecules are co-purified with their parental DNA backbone, thus needing several additional purification steps to get rid of the parental DNA sequence that contains the antibiotic resistance cassette and the bacterial origin of replication.
None of the existing solutions is capable of providing an efficient and safe hairpin loop ended self-complementary double-stranded covalently closed linear DNA vector manufacturing system and process that allows a large-scale production without the resource consumption caused by the systems and processes known so far. Therefore, there is a need for a hairpin loop ended self-complementary double-stranded covalently closed linear DNA vector manufacturing system and process that is safer, simpler, more efficient and that results in a higher productivity compared to the systems and processes known in the state of the art.
The present disclosure provides, in a first aspect, an engineered parental circular covalently closed synthetic plasmid DNA comprising:
In a second aspect of the present invention, a helper plasmid comprising a first and a second synthetic genetic construction is disclosed, wherein the first synthetic genetic construction comprises at least a first repressor protein under at least a first constitutive promoter, at least a second repressor coding sequence under at least a second constitutive promoter, at least a third repressor coding sequence under at least a third constitutive promoter, at least one endonuclease recognition sequence, and wherein the second synthetic genetic construction comprises at least one DNA replicating protein under the control of a first inducible promoter, at least one recombinase protein coding sequence under the control of a second inducible promoter, and at least one endonuclease coding sequence under the control of a third inducible promoter.
In a third aspect, a recombinant cell comprising the helper plasmid is provided.
According to a fourth aspect of the invention, a hairpin loop ended self-complementary double-stranded linear covalently closed DNA vector production system is provided, wherein such system comprises:
In a fifth aspect of this invention, a hairpin loop ended self-complementary double-stranded linear covalently closed DNA vector production process is provided, starting from the aforementioned system, wherein said process comprises the stages of:
In a sixth aspect of this invention, a hairpin loop ended self-complementary double-stranded linear covalently closed DNA vector is provided, wherein such vector is produced by the aforementioned process.
The invention is depicted in the Detailed Description with reference to the accompanying drawings.
Although the exemplary embodiments described herein are susceptible to various modifications and alternative forms, specific embodiments have been shown by way of example in the drawings and will be described in detail below. However, the exemplary embodiments described herein are not intended to be limited to the particular forms disclosed.
The general operating characteristics and advantages of the present invention will now be described in greater detail in this section in connection with the preferred embodiments, which should be considered as exemplary only and not limited to the present invention.
According to a first aspect of the present invention an engineered parental circular covalently closed synthetic plasmid DNA is provided (“Parental Plasmid”, shown in
In accordance with some embodiments of the present invention, the linear DNA module scaffold additionally comprises eukaryotic origins of replication such as, but not limited to, SV40 Origin of Replication.
In a preferred embodiment, the recombinase recognition sequence is selected from the group consisting of PhiC31 attP recombinase recognition sequence, phiC31 attP recombinase recognition site, CRE recombinase recognition sequence (LoxP), Flp recombinase recognition sequence, Lambda recombinase recognition sequence, ResT recombinase recognition sequence, φpK02 telRL site, the FRT site, TelN recombinase recognition sequence, PY54 Tel recombinase recognition sequence, VP882 TelK recombinase recognition sequence, ΦHAP-1 Tel recombinase recognition sequence, B. burgdorferi Tel recombinase recognition sequence, B. hermsii Tel recombinase recognition sequence, B. parkeri Tel recombinase recognition sequence, B. recurrentis Tel recombinase recognition sequence, B. turicatae Tel recombinase recognition sequence, B. anserine Tel recombinase recognition sequence, A. tumefaciens C58 ResT recombinase recognition sequence, and combinations thereof.
In a preferred embodiment, the endonuclease recognition sequence is selected from the group consisting of I- SceI endonuclease recognition sequence, F-SceI endonuclease recognition sequence, F-SceII endonuclease recognition sequence, I-AniI endonuclease recognition sequence, I-CeuI endonuclease recognition sequence, I-ChuI endonuclease recognition sequence, I-CpaI endonuclease recognition sequence, I-CpaII endonuclease recognition sequence, I-CreI endonuclease recognition sequence, I-CsmI endonuclease recognition sequence, I-DmoI endonuclease recognition sequence, I-PorI endonuclease recognition sequence, I-ScaI endonuclease recognition sequence, I-SceII endonuclease recognition sequence, I-SceIII endonuclease recognition sequence, I-SceIV endonuclease recognition sequence, PI-SceI endonuclease recognition sequence, PI-PspI endonuclease recognition sequence, PI-TliI endonuclease recognition sequence, F-SuvI endonuclease recognition sequence, F-TevI endonuclease recognition sequence, F-TevII endonuclease recognition sequence, I-DirI, I-HmuI endonuclease recognition sequence, I-HmuII endonuclease recognition sequence, I-PpoI endonuclease recognition sequence, I- TevIII recognition sequence, and combinations thereof.
In a preferred embodiment, the DNA replication enzyme recognition sequence is selected from the group consisting of Phi29 DNA replication enzyme recognition sequence and P2-A replication enzyme recognition sequence.
In a preferred embodiment, the bacterial selection system is selected from the group consisting of antibiotics resistance selection systems such as, but not limited to Ampicillin, Kanamycin, Gentamicin and Chloramphenicol, dapD complementation selection system, infA complementation selection system, amino acid auxotrophy complementation selection system, RNA-OUT selection systems, and combinations thereof.
In a preferred embodiment, wherein the bacterial replication origin is selected from the group consisting of pUC19-derived replication origin, pBR322-derived replication origin, p15A-derived replication origin, pSC101-derived replication origin, Pir-R6K replication origin, trfA-oriV replication origin, M13 Phage replication origin, f1 Phage replication origin, PhiX174/ProteinA Ori replication origin, and pT181 (RepC) replication origin.
In a preferred embodiment, the DNA nuclear targeting sequence is selected from the group consisting of SV40 enhancer nuclear targeting DNA sequence, NF-kB nuclear targeting DNA sequence, GRE nuclear targeting DNA sequence, smooth muscle-specific nuclear targeting DNA sequence, smooth muscle γ-actin (SMGA) promoter nuclear targeting DNA sequence, Sox2 regulatory region 2 (SRR2) nuclear targeting DNA sequence, Karyopherin (importin) β1, isoform 1 (Kpnβ1) nuclear targeting DNA sequence, RanBP7/importin 7 (Ipo7) nuclear targeting DNA sequence, RAN nuclear targeting DNA sequence, Ran binding protein 1 (RanBP1) nuclear targeting DNA sequence, Ran binding protein 3 (RanBP3) nuclear targeting DNA sequence, SYTE-DNTS1 nuclear targeting DNA sequence, and combinations thereof.
In a preferred embodiment, the nuclear matrix-association sequence is selected from the group consisting of human b-interferon gene-derived S/MAR, human immunoglobulin heavy chain-derived S/MAR, human apolipoprotein B-derived S/MAR, chicken lysozyme-derived S/MAR, other vertebrate-derived S/MAR, SV40T mutated sequence, β-Globin Replicator sequence, Epstein-Barr virus (EBV) elements oriP, and EBNA1.
In a preferred embodiment, the Linear DNA Module Scaffold further comprises one or more eukaryotic origins of replication.
In further embodiments, the eukaryotic origin of replication is selected from the group consisting of Homo sapiens LOC107198088 ORI, Homo sapiens LOC107133510 ORI, SV40 Origin of replication, Epstein Barr Origin of replication, Bovine Papilloma Virus Origin of replication, and combinations thereof.
In a preferred embodiment, the promoter sequence is selected from the group consisting of RNA polymerase II constitutive mammalian promoters, such as the human cytomegalovirus (CMV) immediate-early enhancer and promoter (pCMV), the early promoter of the simian virus 40 (pSV40), the polyubiquitin gene promoter (pUBC), the hybrid construct of the cytomegalovirus (CMV) enhancer fused to the chicken beta-actin promoter (pCAGG), the human elongation factor-1 alpha promoter (pEF1A), the human phosphoglycerate kinase 1 promoter (pGK), inducible promoters constituted by the minimal CMV promoter and different response elements such as bacterial operators (e.g., lac repressor, tetracycline repressor) and other DNA binding sequences where artificial transcription factors can be associated, and RNA polymerase III promoters such as the H1 RNA promoter (pH1), the U6 promoter or the 7SK promoter.
In embodiments of the present invention, the transcription terminator sequence is selected from the group consisting of human α-globin polyadenylation signal sequence, human β-globin polyadenylation signal sequence, rabbit α-globin polyadenylation signal sequence, rabbit β-globin polyadenylation signal sequence, mouse k-light chain polyadenylation signal sequence, chicken ovalbumin polyadenylation signal sequence and small polyT stretch (5-6) for RNA pol III promoters.
In a preferred embodiment, the synthetic transcriptional unit can comprise at least one of the following genetic elements:
Internal Ribosome Entry Sites (IRES) are natural or artificial RNA sequences typically present in the 5' untranslated region of mRNAs that undergo intracatenary folding to render a three-dimensional structure that resembles that of eukaryotic factors involved in the initiation of translation. Hence, IRES make certain posttranscriptional modifications of the mRNA (such as capping) and said initiation factors superfluous, allowing the mRNA to enter directly into a translationally active state. When placed between different open reading frames in an mRNA, IRES can be used to achieve polycistronic translation in eukaryotes to render several proteins from a single mRNA molecule. Some IRES can even enhance translation of a downstream coding sequence.
In some embodiments, the IRES element is derived from a DNA sequence from an organism including, but not limited to, an animal, a plant, a fungus, a bacterium, a virus, an alga or a protozoan.
In some embodiments, viral IRES are sequences at least partially derived from a virus including, but not limited to, encephalomyocarditis virus (ECMV), Coxackievirus B3 (CVB3), Taura Syndrome Virus (TSV), Triatoma Virus (TV), Human mastadenovirus C (HAdV-C), Human adenovirus 5 (HAdV5), Human adenovirus 7 (HAdV7), Human mastadenovirus B (HAdV-B), Hepatitis GB virus B (HGBV-B), HPV31, Human Poliovirus 3 (HPV-3), Human papillomavirus type 11 (HPV11), Human immunodeficiency virus 1 (HIV-1), Human immunodeficiency virus 2 (HIV-2), Simian T-lymphotropic virus 1 (STLVs-1), Human Parvovirus B19 (HPB19), Human betaherpesvirus 6B (HHV-6B), Human alphaherpesvirus 3 (HHV-3), Human papillomavirus type 41 (HPV41), Human papillomavirus type 6b (HPV6b), Alphapapillomavirus 7, Friend murine leukemia virus, Heilovirus (ThV), Rous sarcoma virus (RSV), Human mastadenovirus F (HAdv-F), Human papillomavirus type 4 (HPV4), Human papillomavirus type 63 (PHV63), Human mastadenovirus A, Feline immunodeficiency virus (FIV), Human T-lymphotropic virus 2 (HTLV-2), Jaagsiekte sheep retrovirus (JSRV), Spleen focus-forming virus (SFFV), Mouse mammary tumor virus (MMTV), Murine osteosarcoma virus (MOV), Ovine lentivirus (OLV/OvLV), Squirrel monkey retrovirus (SMRV), Human papillomavirus type 16 (HPV16), Human papillomavirus type 5 (HPV5), Human polyomavirus 1, Rabies lyssavirus, Simian immunodeficiency virus (SIV), Human papillomavirus type 10 (HPV10), Human papillomavirus type 26 (HPV26), Human papillomavirus type 32 (HPV32), Human papillomavirus type 34 (HPV34), Marburg margurgvirus, Enterovirus A, Rhinovirus A, Human betaherpesvirus 6A (HHV-6A), Macaca mulatta polyomavirus 1, Snakehead retrovirus (SnRV), Bovine foamy virus (BFV), Drosophila C virus (DCV), Hendra henipavirus (HeV), Aichi virus 1 (AiV-1), Influenza B virus (IBV), Zaire ebolavirus (ZEBOV), Human coronavirus 229E (HCoV-229E), Nipah henipavirus (NiV), Human respirovirus 1 (HPIV-1), Modoc virus (MODV), Sudan ebolavirus, Human parvovirus 4 G1, Rotavirus C, Human gammaherpesvirus 4 (Epstein-Barr virus).
In some embodiments, eukaryotic IRES are sequences at least partially derived from mRNAs from different species including, but not limited to, Homo sapiens IRES, Aplysia californica IRES, Canis lupus familiaris IRES, Drosophila melanogaster IRES, Gallus gallus IRES, Mus pahari IRES, Mus musculus IRES, Saccharomyces cerevisiae IRES, Zea mays IRES, Rattus norvegicus IRES, Ovis aries IRES.
In some embodiments, the IRES element comprises a novel synthetic proprietary IRES, for instance: SYTE-IRES1, SYTE-IRES2, SYTE-IRES3, SYTE-IRES4, amongst others.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide includes at least one IRES flanking at least one or more open reading frames. In some embodiments, the IRES flanks both sides of one or more open reading frames. In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide includes at least one IRES sequence on one or both sides of each open reading frame.
trM6-methyladenosine (m6A) is a reversible epitranscriptomic modification found in several eukaryotic mRNAs, and when present in the 5’ untranslated region (5'UTR), even a single m6A, promotes cap-independent translation. These sites are called m6A induced ribosome engagement sites (MIRES).
MIRES stimulate selective mRNA translation in stress conditions by a mechanism that involves the direct binding of the initiation factor eIF3. MIRES-dependent translation initiation should contain an RRACH (R = Guanine or Adenine, H = Adenine or Cytosine or Thymine) consensus motif of m6A modification close to the start codon.
The m6A machinery allows the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide to be marked as self or endogenous. When using chemical or enzymatic circularization of RNA molecules, it is beneficial to use m6A dependent translation to avoid unwanted immune responses. When m6A RRACH motifs are incorporated in a circular polyribonucleotide, RIG-1 mediated immunogenicity is decreased by 10,000-fold, compared to mutated m6A RRACH motifs.
When placed between different open reading frames in an mRNA, MIRES can be used to achieve polycistronic translation in eukaryotes to render several proteins from a single mRNA molecule. Some MIRES can even enhance translation of a downstream coding sequence.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide may be translated by another cap-independent mechanism. In some embodiments the cap-independent mechanism is mediated by the methylation of the nitrogen at position 6 in the adenosine base, N6-methyladenosine (m6A), within the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide may have one or more m6A sites close to the start codon.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide can comprise untranslated regions (UTRs), that are genetic sequences that can be transcribed but are not translated.
In some embodiments, one or more UTR sequences may be included upstream of the translation initiation sequence of an open reading frame (5'UTRs). In some embodiments one or more UTR sequences may be included downstream of an expression sequence (3'UTRs).
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide comprises at least one UTR sequence with embedded tranches of Adenosines and Thymidine within. In some embodiments, these A-U rich elements (AREs) may increase the expression rates of the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide.
In some embodiments, introduction, removal or modification of UTR AREs may be useful to modulate the stability, immunogenicity and translation rate of the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide may contain one or more copies of AREs that may modulate translation of the coding sequence.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide may be engineered to remove AREs to modulate the intracellular stability and thus the translation of the coding sequence.
It should be understood that any UTR from any gene and any organism may be incorporated into the respective flanking regions of the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide. Furthermore, multiple wild-type UTRs of any known gene may be utilized. It is also within the scope of the present invention to provide novel synthetic proprietary UTRs which are not variants of wild type genes.
In some embodiments, the UTRs or portions thereof, may be placed in the same orientation or altered orientation or location as from where they were selected.
In some embodiments, a 5' or 3' UTR may be inverted, shortened, lengthened and/or chimerized with one or more other 5' or 3' UTRs.
As used in this patent, the term “altered” as it relates to a UTR sequence, means that the UTR sequence has been changed in some way in relation to a reference sequence.
In some embodiments, more than one 5’ and/or 3' UTR may be used. For example, a “double” UTR is one in which two copies of the same UTR are encoded in series or substantially in series.
In some embodiments, the UTRs or portions thereof may comprise a group I self-splicing intron system known as PIE (Permuted Intron-Exon) that allows the formation of circular RNA molecules. There are several group I intron precursor RNAs derived from organisms such as Tetrahymena spp or Anabaena spp, and viruses such as T4 phage, which contain end-to-end fused exons that interrupt half intron sequences that in turn self-splice to generate a circular RNA exon. Exon sequences can be replaced by foreign sequences from a group I intron self-splicing system to make desired circular sequences, and that is possible because exon sequences are not involved in the self-splicing reaction. The Permuted Intron Exon method includes two transesterifications at defined splice sites; and, after the splicing, the ends are ligated forming circular RNA molecules. More specifically, the first transesterification releases the 3'-end sequence (5’-half intron) of the sequence. After, the free 3’OH group of the 3’-half exon attacks the 3’-splice site during the second transesterification, generating circRNA molecules. This method has been used to generate RNA aptamers, hammerhead ribozymes and hepatitis delta ribozymes.
In some embodiments, the UTRs or portions thereof may comprise a group II introns that can be modified for inverted splicing and thus generate circular RNA molecules. Yeast self-splicing group II intron can catalyze the formation of circular RNA molecules. This method requires the rearrangement of the exons in a consecutive way, where the branch point has to be positioned upstream and the intronic sequences flanking the exons. This method allows the circularization of the molecules by two transesterifications. This group II method is way more efficient than the group I intron method, however this approach carries 2’-5’ phosphodiester bond at the ligation site.
In some embodiments, the UTRs or portions thereof may comprise a Direct back- splicing event wherein the event involves joining the 3’-tail of a downstream exon to the 5’-head of an exon normally located upstream. The downstream splice donor pairs with an unspliced upstream splice acceptor and as a result, the exon becomes circularized.
In some embodiments, the UTRs or portions thereof may comprise a lariat intermediate that contains an exon produced from exon skipping, that subsequently undergoes internal splicing, thereby removing the intron and generating a circularized RNA molecule.
The sequence of interest may be a coding, translation competent genetic sequence or a non-coding, translation incompetent genetic sequence.
In some embodiments, the sequence of interest might be transcribed into a linear mRNA molecule.
In some embodiments, the sequence of interest might be transcribed into a circular RNA molecule.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide can encode a polypeptide and may comprise a translation initiation sequence such as, but not limited to, start codon. In some embodiments the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide includes at least one translation initiation sequence adjacent to an open reading frame. In some embodiments the start codon is a non-coding start codon. In some embodiments, the translation initiation genetic sequence can include a Shine-Dalgarno or Kozak sequence. In some embodiments the Shine-Dalgarno or Kozak sequence may be adjacent to an open reading frame (ORF). In some embodiments the translation initiation sequence may be placed in either or both sides of each open reading frame. In some embodiments, the translation initiation sequence may provide a conformational flexibility to the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide.
In some embodiments the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide may include at least one start codon and translation may initiate on the first start codon or may initiate downstream the first start codon. In some embodiments the translation of the polyribonucleotide may initiate at an alternative translation initiation sequence, such as, but not limited to, AAG, ACG, AGG, ATA/AUA, ATT/AUU, CTG/CUG, GTG/GUG and TTG/UUG. In some embodiments, the translation may begin under selective conditions, such as, but not limited to, cell type, cell condition and stress conditions. In some embodiments the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide may begin translation at a repeat-associated non-AUG (RAN) sequence.
In some embodiments the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide may encode prokaryotic and/or eukaryotic peptides or polypeptides of interest, selected from, but not limited to, biologics, antibodies, antigens, therapeutic peptides or polypeptides, cell penetrating peptides, secreted proteins, plasma membrane anchored proteins, cytoplasmic proteins, cytoskeletal proteins, intracellular membrane bound proteins, nuclear proteins, proteins associated with any human disease, targeting moieties or those proteins encoded by the human genome for which no therapeutic indication has been identified but which nonetheless have utility in areas of research and discovery.
As indicated herein, “Therapeutic protein” refers to a protein that, when administered to an organism, has a therapeutic, diagnostic and/or prophylactic effect, and/or elicits a desired biological and/or pharmacological effect.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide disclosed herein, may encode one or more biologics. As used herein, a “biologic” is a peptide or polypeptide produced by the methods provided herein and which may be used to prevent, diagnose, mitigate, treat or cure a serious disease or medical condition.
According to the present invention, biologics include, but are not limited to, vaccines, monoclonal antibodies, bi-specific antibodies, tri-specific antibodies, chimeric antigen receptors, allergenics, blood components, gene therapies, mammalian organ, tissue or cellular products, cytokines, immunomodulators, growth factors and enzymes amongst others.
In some embodiments, one or more biologics can be encoded and translated by the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide of the present invention.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide can encode one or more antibody fragments and combinations thereof. The term “antibody” includes monoclonal antibodies, which may include full length antibodies with an Fc region, antibodies with poly epitope specificity, multispecific antibodies such as, but not limited to, bispecific antibodies, diabodies, single chain molecules, VHH antibodies and antibody fragments. As described herein, the term “immunoglobulin” (Ig) is used interchangeably with the term “antibody”. As described herein, the term “monoclonal antibody” refers to an antibody obtained from an identical population of antibodies. Monoclonal antibodies are highly specific, and are directed against a single antigenic site.
In some embodiments, monoclonal antibodies include “chimeric” antibodies in which a portion of the heavy and/or light chain is homologous to antibodies derived from a particular species or correspond to a particular antibody class or subclass.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide may include any of the five classes of immunoglobulins, IgA, IgD, IgE, IgG and IgM, including the heavy chains designated alpha, delta, epsilon, gamma and mu respectively. In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide may include any subclass of antibody such as, but not limited to, IgA1, IgA2, IgG1, IgG2, IgG3 and IgG4.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide may encode whole antibodies or antibody fragments of interest which include, but are not limited to, Fab, Fab’, F(ab'), Fv, diabodies, linear antibodies, nanobodies, bi-specific antibodies, tri-specific antibodies, tetra-specific antibodies, single-chain fragment variable, multispecific antibodies or combinations thereof
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide may encode one or more antibody sequences that are currently being on the market or in development.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide may encode antibodies to treat or prevent diseases in several therapeutic areas such as, but not limited to, immunology, endocrinology, dermatology, oncology, musculoskeletal, respiratory, central nervous system, gastrointestinal and infectious.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide disclosed herein, may encode one or more vaccine antigens.
As indicated herein, “vaccine antigen” refers to a peptide or polypeptide that improves immunity to a particular disease or infectious agent.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide may encode one or more vaccine antigens. In some embodiments, the antigens encoded may be used to treat or prevent diseases in many therapeutic areas such as, but not limited to, cancer, allergy and infectious diseases.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide of this invention may encode one or more antimicrobial peptides, which have been isolated from a wide range of animals such as, but not limited to, microorganisms, plants, invertebrates, amphibians, birds, fish, and mammals.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide of this invention may encode antimicrobial peptides that may block cell fusion of viral entry. In some embodiments, the antimicrobial peptides may be part of viral encoded proteins. In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide described in this invention, may encode one or more cell penetrating peptides. As used herein, “cell penetrating peptide” refers to a polypeptide which may facilitate the cellular uptake of other polypeptides or molecules. In some embodiments, the cell penetrating peptide, may be fully or partially labeled. In some embodiments, the cell penetrating peptide may include a signal peptide. As used herein, a “signal sequence” refers to a sequence of amino acid residues bound at the amino terminus of a nascent protein during protein translation, that enables the polypeptide for example to be secreted. In some embodiments, the signal sequence may be used to signal the secretion of the cell penetrating peptide.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide described in this invention may encode a fusion protein. In some embodiments, the fusion protein may be created by operably linking two proteins of interest. As used herein, “operably linked” refers to the connection of two proteins in such a way to permit the expression of the complex when introduced into a cell.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide encodes a plasma membrane anchored protein, a cytoplasmic protein or cytoskeletal protein. In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide encodes a nuclear protein.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide of this invention encodes and is able to express a protein associated with a mammalian disease. In some embodiments, the mammalian disease is a human disease.
In some embodiments the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide may encode proteins such as, but not limited to, VEGF, Hemoglobin, FIX, cancer-specific antigens, gene editing enzymes, Zinc Fingers, Nucleases, Cas9 and TALENs.
As described herein, a polypeptide may be a single molecule, or a multimolecular complex such as a dimer, trimer, tetramer or n-mer. In some embodiments the polypeptide may also comprise single chain or multiple chain polypeptides that may be associated or linked.
In some embodiments, the coding sequences embedded in the parental plasmid DNA may be codon optimized to permit an efficient translation of peptides or polypeptides that may be derived from different organisms than the host organism.
In further embodiments, the sequence of interest is selected from the group consisting of peptides, polypeptides, vaccine antigens, monoclonal antibodies, mono-specific antibodies, bi-specific antibodies, tri-specific antibodies, tetra-specific antibodies, chimeric antigen receptors, allergens, blood components, gene therapies, organ or cellular products, cytokines, immunomodulators, growth factors, enzymes, antibody fragments, poly epitopes, multispecific antibodies, poly-epitope-specific antibodies, diabodies, single chain molecules, VHH antibodies, antibodies fragments, chimeric antibodies, fusion proteins, plasma membrane anchored proteins, cytoplasmic proteins, cytoskeletal proteins, nuclear proteins, gene editing proteins, zinc fingers, nucleases, Cas9, TALENs, dimers, trimers, tetramers, single-chain polypeptides, multi-chain polypeptides, and combinations thereof.
A riboswitch is an RNA sequence that can directly bind to a small target molecule, and the binding to that molecule directly affects the continuity of transcription of the riboswitch-bearing RNA itself and/or its own translation, thus the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide that includes a riboswitch is directly involved in regulating its own activity depending on the presence or absence of its target molecule.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide comprises one or more riboswitches.
In some embodiments, a riboswitch may have an aptamer-like affinity for a specific molecule, thus, these riboswitches may be used for sequestration of molecules from bulk amounts.
In some embodiments, the riboswitch may have an effect on gene expression, including, but not limited to, transcriptional termination, inhibition of translation initiation, mRNA self-cleavage, alteration of splicing pathways, binding to a trigger molecule to activate or inactivate gene expression.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide may be subjected to conditions that may activate, deactivate or block the gene expression. Expression can be altered as a result of blocking the ribosome binding site.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide binding to a molecule or an analog thereof can, depending on the nature of the riboswitch, reduce or prevent, promote or increase the expression of the RNA molecule
An aptazyme is a genetic switch that can be used for conditional gene expression in which the aptamer region is used as an allosteric control element that can be coupled to a catalytic RNA ribozyme sequence.
In some embodiments, the hairpin loop-ended self-complementary double stranded covalently closed linear polyribonucleotide can comprise one or more aptazymes.
In some embodiments, the hairpin loop-ended self-complementary double stranded covalently closed linear polyribonucleotide comprises an aptazyme that is active under specific cell states. In some embodiments, the specific cell state may include, but is not limited to, viral infection, or the presence of viral nucleic acids or proteins.
A ribozyme, or ribonucleic acid enzyme, is an RNA molecule that catalyzes a chemical reaction. Many ribozymes that are found in nature catalyze the hydrolysis of their own phosphodiester bonds or the hydrolysis of other RNA or DNA bonds. Ribozymes are able to catalyze other types of chemical reactions such as, but not limited to, covalent modification of amino acid side chains, tyrosine azido-adenylation, phosphorylation, dephosphorylation, covalent modification of phosphorylated amino acid side chains, amide and ester hydrolysis, diels-alder reaction, glycosylation, porphyrin metalation, RNA cleavage, RNA ligation, thymine-dimer repair, DNA depuration, DNA phosphorylation, DNA capping, DNA cleavage and DNA ligation.
In some embodiments, it may be advantageous to place many copies of a ribozyme and aptamers within a non-coding RNA .
In some embodiments, non-limiting examples of ribozymes may include hammerhead ribozyme, VL ribozyme, leadzyme and hairpin ribozyme.
In some embodiments, the aptazyme encoded in the hairpin loop-ended self-complementary double stranded covalently closed linear polyribonucleotide may also permit the self-regulation of the gene expression. In some embodiments, the polyribonucleotide gene expression may be autoregulated by the presence of an aptazyme that is activated by the product encoded in the same polyribonucleotide sequence. In some embodiments, the activity of the aptazyme can be sensitive to the accumulation of the protein product encoded within the polyribonucleotide or any other cellular macromolecule.
Virus-derived self-cleaving 2A peptides are promising candidates for the production of multicistronic RNA therapeutic molecules due to their small size and self-cleavage ability. Most 2A peptides are composed of 16-20 amino acids and are taken from viral RNA. The most common 2A peptides used to produce multicistronic vectors are F2A (Foot-and-Mouth Disease virus), E2A (Equine Rhinitis Virus), P2A (Porcine Teschovirus-1), and T2A (Thosea Asigna Virus). 2A peptide cleavage is not mediated by proteases, but it is mediated via ribosomal skip mechanism. The cleavage sites of these peptides are located between the C- terminal glycine of 2A and the N-terminal proline of 2B. This process takes place inside the ribosome during protein synthesis, where the formation of a normal peptide bond between these amino acids at the cleavage site is inhibited, but the cleavage does not affect the translation of the subsequent protein, since ribosome dissociation does not occur.
The use of elongated 2A with the additional spacers in the N-terminus of 2A peptide sequence can provide an efficient cleavage reaction, since it leads to a decrease in the inhibition rate of the 2A cleavage reaction. The insertion of additional spacer sequences such as as the furin cleavage site or a glycine-serine- glycine (GSG) spacer flexible linker can yield better cleavage.
The combination of 2A peptide sequences in the following order, namely T2A, P2A, and E2A, is optimal when creating multicistronic vectors containing four genes.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide can comprise one or more 2A peptides.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide may comprise at least one translation competent sequence that encodes a regulatory nucleic acid. In some embodiments the regulatory nucleic acid sequence may modify the expression of an endogenous or exogenous gene. In some embodiments, the regulatory nucleic acid may comprise at least one non-translation competent sequence that encodes a regulatory nucleic acid, such as, but not limited to, exRNA, hnRNA, IncRNA, microRNA, rRNA, shRNA, snoRNA, snRNA, siRNA, scaRNA, Y RNA, piRNA, miRNA, tRNA and rRNA, iRNA. In some embodiments the regulatory non-coding RNA sequences may hibridize to an endogenous genetic sequence or an exogenous genetic sequence such as viral DNA or RNA.
In some embodiments the non-coding regulatory RNA sequences can comprise a guide RNA (gRNA) or a single guide RNA (sgRNA) that may recognize specific DNA sequences. In some embodiments the sgRNA mimics a naturally occurring crRNA-tracrRNA complex and may contain both a tracrRNA that binds to the nuclease, and at least one crRNA that guides the nuclease to the sequence targeted for editing. In some embodiments, the gRNA and sgRNA is used as a part of the CRISPR/Cas system for gene editing.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide can comprise a sequence encoding one or more peptides or polypeptides, one or more regulatory sequences, one or more regulatory nucleic acids, one or more non-coding RNAs, one or more expression sequences, and any combination thereof.
In some embodiments, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide of this invention, may include a “safety switch system”. This “safety switch” is represented by a regulatable expression unit comprising at least one inducible promoter such as, but not limited to, tetracycline-inducible promoter, cumate-inducible promoter, rapamycin-inducible promoter, at least one copy of a suitable operator, at least one Kozak sequence, at least a coding sequence encoding a transcriptional activator-like effector nuclease (TALEN)-Fok1 fusion protein, where the DNA recognition domain is designed to target a specific n-mer sequence termed “S4” (Safety Switch System Sequence), plus a transcription terminator.
In other embodiments, another expression unit drives the expression of the regulatory rTetT-VP16 fusion protein which regulates the expression of the aforementioned tetracycline-inducible unit, comprising a constitutive promoter, a Kozak' sequence, and the cds corresponding to said regulatory fusion protein, followed by a transcription terminator. The hairpin loop ended self-complementary double-stranded linear DNA molecule bears two mutually facing S4 n-mers strategically placed to direct a double stranded cleavage in the molecule, when expression of TALEN-Fok1 is induced by tetracycline or doxycycline. In summary, administration of tetracycline/doxycycline will induce the expression of a TALEN-Fok1 fusion protein that selectively cleaves the hairpin loop ended self-complementary double-stranded linear DNA molecule in the region between the two S4 n-mers, rendering dsDNA free 5'and 3' ends which are susceptible to exonucleolytic degradation by naturally occurring cellular DNAses, thus, ending the physiological activity of the therapeutic DNA molecule.
In an alternative approach, the inducible promoter drives the expression of a cytotoxic or pro-apoptotic gene which promotes cell death upon induction by administration of a drug such as -but not limited to- tetracycline/doxycycline, cumate or rapamycin, thus ending the physiological activity of the therapeutic strategy. One example of these cytotoxic proteins is Herpes simplex’s thymidine-kinase, which metabolizes the nontoxic drug ganciclovir into ganciclovir-triphosphate, which incorporates into DNA in replicative cells, inhibiting DNA synthesis and resulting in cell death.
In a preferred embodiment, the enhancer sequence is selected from the group consisting of constitutive mammalian enhancers.
In another preferred embodiment, the post-transcriptional regulatory element is selected from the group consisting of Hepatitis B virus (HPRE) regulatory element and Woodchuck Hepatitis virus (WPRE) regulatory element.
In a preferred embodiment, the synthetic or wild-type IRES sequence is selected from the group consisting of encephalomyocarditis virus (ECMV) IRES, Coxackievirus B3 (CVB3) IRES, Taura Syndrome Virus (TSV) IRES, Triatoma Virus (TV), Human mastadenovirus C (HAdV-C), Human adenovirus 5 (HAdV5), Human adenovirus 7 (HAdV7), Human mastadenovirus B (HAdV-B), Hepatitis GB virus B (HGBV-B), HPV31, Human Poliovirus 3 (HPV-3), Human papillomavirus type 11 (HPV11), Human immunodeficiency virus 1 (HIV-1), Human immunodeficiency virus 2 (HIV-2), Simian T-lymphotropic virus 1 (STLVs-1), Human Parvovirus B19 (HPB19), Human betaherpesvirus 6B (HHV-6B), Human alphaherpesvirus 3 (HHV-3), Human papillomavirus type 41 (HPV41), Human papillomavirus type 6b (HPV6b), Alphapapillomavirus 7, Friend murine leukemia virus, Heilovirus (ThV), Rous sarcoma virus (RSV), Human mastadenovirus F (HAdv-F), Human papillomavirus type 4 (HPV4), Human papillomavirus type 63 (PHV63), Human mastadenovirus A, Feline immunodeficiency virus (FIV), Human T-lymphotropic virus 2 (HTLV-2), Jaagsiekte sheep retrovirus (JSRV), Spleen focus-forming virus (SFFV), Mouse mammary tumor virus (MMTV), Murine osteosarcoma virus (MOV), Ovine lentivirus (OLV/OvLV), Squirrel monkey retrovirus (SMRV), Human papillomavirus type 16 (HPV16), Human papillomavirus type 5 (HPV5), Human polyomavirus 1, Rabies lyssavirus, Simian immunodeficiency virus (SIV), Human papillomavirus type 10 (HPV10), Human papillomavirus type 26 (HPV26), Human papillomavirus type 32 (HPV32), Human papillomavirus type 34 (HPV34), Marburg marburgvirus, Enterovirus A, Rhinovirus A, Human betaherpesvirus 6A (HHV-6A), Macaca mulatta polyomavirus 1, Snakehead retrovirus (SnRV), Bovine foamy virus (BFV), Drosophila C virus (DCV), Hendra henipavirus (HeV), Aichi virus 1 (AiV-1), Influenza B virus (IBV), Zaire ebolavirus (ZEBOV), Human coronavirus 229E (HCoV-229E), Nipah henipavirus (NiV), Human respirovirus 1 (HPIV-1), Modoc virus (MODV), Sudan ebolavirus, Human parvovirus 4 G1, Rotavirus C, Human gammaherpesvirus 4 (Epstein-Barr virus), eukaryotic IRES derived from Homo sapiens, Aplysia californica, Canis lupus familiaris, Drosophila melanogaster, Gallus gallus,Mus pahari,Mus musculus, Saccharomyces cerevisiae, Zea mays, Rattus norvegicus, Ovis aries, SYTE-IRES1, SYTE-IRES2, and combinations thereof.
The synthetic transcriptional unit may further comprise a plurality of permuted Intron-Exon sequences.
In a further preferred embodiment, the permuted Intron-Exon sequence is selected from the group consisting of self-splicing group I intron fragment, group II intron fragment introns, and combinations thereof.
In a preferred embodiment, the at least one DNA nuclear targeting sequence included in the parental plasmid, and/or plasmids pDNAv_A and pDNAv_B is an SV40 enhancer DNA nuclear targeting sequence (SEQ ID NO 2).
In another embodiment, the at least one DNA nuclear targeting sequence included in plasmids pDNAv_A and pDNAv_B is the NFkB importin DNA nuclear targeting sequence (SEQ ID NO 3).
In order to obtain a long-term expression of a protein of interest, DNA molecules can be engineered to maintain a certain stability in the cell nucleus. For that purpose, several viral-derived and autologous DNA sequences can be added to the linear DNA molecules. Chromatin of eukaryotic cells have Matrix Attachment Regions (MARs) that are operationally defined as A/T-rich DNA elements that bind specifically to the protein nuclear matrix. They are frequently located in gene-rich areas associated with transcribed units and there is evidence that they can reduce or eliminate some forms of gene silencing. Genes flanked by MAR sequences have been shown to exhibit less variation in expression among independent transgenic plants or cell culture lines. In particular, MARs appear to prevent silencing caused by multiple transgene copies in cis. There is conjecture as to the precise role of MAR sequences in transgene cassettes, but current hypothesis suggest that they may act as boundary elements facilitating the formation of decondensed chromatin loops that in turn enhance transgene transcription and reduce some forms of gene silencing (
Therefore, it is contemplated the addition of S/MAR sequences to the parental plasmids of the present invention (
In a preferred embodiment, the S/MAR sequence is SEQ ID NO 4.
The placement of an SV40 origin of replication in the vicinity of an S/MAR sequence in a genetic construct has the capability of driving replication of said genetic construct at very low copy numbers, in a cell cycle-synchronized fashion, and is stably maintained without selection for more than 100 generations. This becomes especially relevant when considering a therapeutic approach that requires transfection of replicative cells with the hairpin loop ended self-complementary double stranded covalently closed linear DNA molecule of this invention.
Therefore, the addition of an SV40 origin of replication to the parental plasmid of the present invention is contemplated.
MCS1 and MCS2 are composed by at least one restriction enzyme recognition sequence known in the art, such as, but not limited to, EcoRI, XbaI, BamHI, XhoI, XbaI, HindIII, EcoRV, PstI, KpnI, StuI, Sa1I, NcoI, NdeI, NotI, SpeI and BsteI.
In a preferred embodiment the restriction recognition sites for MCS1 are Sa1I (SEQ ID NO 5), PstI (SEQ ID NO 6) and NseI (SEQ ID NO 7).
In a preferred embodiment the restriction recognition sites for MCS2 are EcoRI (SEQ ID NO 8), SdaI (SEQ ID NO 9) and XhoI (SEQ ID NO 10).
Origin of replication 1 can be selected from any inducible origin of replication such as, but not limited to, Pir-R6K, trfA-oriV, etc.
In a preferred embodiment of this invention, the selected Origin of replication is R6K and the replication protein 1 is Pi protein coding sequence (SEQ ID NO 11).
Appropriate Origin of replication 2 derived from a bacteria, phage, or combinations can be selected from any “rolling circle” replication system such as, but not limited to, M13 Ori (M13 g2p), PhiX174 Ori (protein A), pT181 (RepC).
In a preferred embodiment, the origin of replication of this invention comprises the M13 origin of replication and, in particular, the M13 Ori(+) (SEQ ID NO 12), to enable the generation of single stranded covalently closed circular DNA molecules.
In some embodiments, a recombinant transcriptional unit comprises a promoter operably linked to a second nucleic acid sequence that may be an open reading frame that encodes a genetic product. In some embodiments, a recombinant transcriptional unit may comprise nucleic acids combined in such a way that these nucleic acids are extremely unlikely to be found in nature.
In some embodiments, the transcriptional unit comprises a terminator or termination sequence operably linked to the open reading frame.
In some embodiments, the transcriptional unit comprises at least one synthetic or non-natural promoter, at least one synthetic or non-natural coding sequence and at least one synthetic or non-natural terminator.
In some embodiments the promoters and regulatory elements may include novel proprietary sequences.
According to a second aspect of the present invention, a helper plasmid (illustrated in
In a preferred embodiment, the helper plasmid also contains an endonuclease recognition sequence to permit the elimination of the Helper Plasmid prior to the purification of the linear DNA molecules.
In some embodiments, constitutive promoters are selected from the group consisting of P(Bla) constitutive promoter, P(Cat) constitutive promoter, P(Kat) constitutive promoter, Reverse lambda cI-regulated promoter, pBAD reverse constitutive promoter, lacQ constitutive promoter, GlnRS constitutive promoter, and combinations thereof.
In some embodiments, the inducible promoters are selected from the group consisting of L-Arabinose inducible promoter, IPTG inducible promoter, Lactose inducible promoter, Glucose inducible promoter, Tetracycline inducible promoter, Temperature inducible promoter, pH inducible promoter, HSL inducible promoter, Maltose inducible promoter, Xylose inducible promoter, Phosphate inducible promoter, Cold-Shock inducible promoter, glnG transcription enhancer inducible promoter, and combinations thereof.
In some embodiments, the Endonuclease Recognition Sequence is selected from the group consisting of I- SceI endonuclease recognition sequence, F-SceI endonuclease recognition sequence, F-SceII endonuclease recognition sequence, I-AniI endonuclease recognition sequence, I-CeuI endonuclease recognition sequence, I-ChuI endonuclease recognition sequence, I-CpaI endonuclease recognition sequence, I-CpaII endonuclease recognition sequence, I-CreI endonuclease recognition sequence, I-CsmI endonuclease recognition sequence, I-DmoI endonuclease recognition sequence, I-PorI endonuclease recognition sequence, I-ScaI endonuclease recognition sequence, I-SceII endonuclease recognition sequence, I-SceIII endonuclease recognition sequence, I-SceIV endonuclease recognition sequence, PI-SceI endonuclease recognition sequence, PI-PspI endonuclease recognition sequence, PI-TliI endonuclease recognition sequence, F-SuvI endonuclease recognition sequence, F-TevI endonuclease recognition sequence, F-TevII endonuclease recognition sequence, I-DirI endonuclease recognition sequence, I-HmuI endonuclease recognition sequence, I-HmuII endonuclease recognition sequence, I-PpoI endonuclease recognition sequence, I- TevIII endonuclease recombination sequence, and combinations thereof.
The helper plasmid may further comprise a bacterial genome integration cassette.
Examples of suitable recombinases for use in the present invention can be selected from, but are not limited to, Cre, Flp, phiC31, Lambda, ResT, whose target sequences are: locus of X-over P1 (loxp); short flippase recognition target (FRT); attachment sites B/P/L/R (attB/P/L/R); cos site; respectively.
The expression of recombinases, DNA replicating proteins and endonucleases is under the control of inducible promoters selected form the group consisting of E. coli σ70 promoters such as -but not limited to- L-Arabinose inducible promoter, IPTG inducible promoter, Lactose inducible promoter, Glucose inducible promoter, Tetracycline inducible promoter, Temperature inducible promoter, pH inducible promoter, HSL inducible promoter, Maltose inducible promoter, Xylose inducible promoter, Phosphate inducible promoter, or where the inducible promoter comprises E. coli σ32 promoters such as Cold-Shock inducible promoter, or where the inducible promoter comprises E. coli σ54 promoters such as glnG transcription enhancer inducible promoter, and combinations thereof.
The expression of the repressor proteins is under the control of constitutive promoters selected from the group consisting of E. coli σ70 promoters such as -but not limited to P(Bla) constitutive promoter, P(Cat) constitutive promoter, P(Kat) constitutive promoter, Reverse lambda cI-regulated promoter, pBAD reverse constitutive promoter, lacQ constitutive promoter, GlnRS constitutive promoter, and combinations thereof.
According to one embodiment of this invention, the constitutive promoter of the repressor protein operon is the constitutive promoter pTac, (SEQ ID NO 14).
In other embodiments of this invention, the first repressor protein CDS is LacIq (SEQ ID NO 15), which represses the first inducible promoter from the first expression cassette and permits the expression of the Recombinase 1 protein.
In further embodiments, the second repressor protein CDS is AraC (SEQ ID NO 16) that represses the second inducible promoter from the second expression cassette and permits the expression of the Endonuclease and DNA Replication proteins.
In alternative embodiments, the third repressor protein CDS is Lambda repressor protein (SEQ ID NO 17) that represses the third inducible promoter from the third expression cassette and permits the expression of the Recombinase 2 protein.
Exemplary “rolling circle” replication proteins from the second expression cassette comprise, M13 g2p, Staphylococcus aureus RepC, Pyrococcus abyssi pGT5, Streptococcus pneumoniae pLS1, amongst others.
In a preferred embodiment of this invention the replication protein is M13 g2p (SEQ ID NO 18).
In accordance to possible embodiments, the Recombinase 2 CDS comprises at least one phage-derived telomerase such as N15 phage telomerase TelN, PY54 phage telomerase Tel, VP882 phage TelK, ΦHAP-1 phage Tel, or wherein the Recombinase comprises at least one bacteria-derived telomerase such as B. burgdorferi telomerase, B. hermsii telomerase, B. parkeri telomerase, B. recurrentis telomerase, B. turicatae telomerase, B. anserine telomerase, A. tumefaciens C58 ResT telomerase, or combinations thereof, and the circular covalently closed plasmid DNA vector incorporates the target binding sequence for at least said recombinase and said system produces a linear covalently closed DNA molecule.
In a further preferred embodiment, the Recombinase 2 coding sequence is TelN (SEQ ID NO 19)
According to some embodiments, the Recombinase 2 coding sequence from the first expression cassette in the recombinant Escherichia coli is selected from the group consisting of the ResT site, the loxP site, φpK02 telRL site, the FRT site, phiC31 attP site and the λ attP site.
In a preferred embodiment of this invention, the Recombinase 2 coding sequence corresponds to PhiC31 (SEQ ID NO 20).
In some embodiments, the Endonuclease coding sequence of the second expression cassette comprises at least one of the “LAGLIDADG” homing endonuclease family such as I- SceI endonuclease, F-SceI endonuclease, F-SceII endonuclease, 1-AniI endonuclease, I-CeuI endonuclease, I-ChuI endonuclease, I-CpaI endonuclease, I-CpaII endonuclease, I-CreI endonuclease, I-CsmI endonuclease, I-DmoI endonuclease, I-PorI endonuclease, I-ScaI endonuclease, I-SceII endonuclease, I-SceIII endonuclease, 1-SceIV endonuclease, PI-SceI endonuclease, PI-PspI endonuclease, PI-TliI endonuclease, F-SuvI, or the endonuclease comprises at least one the GIY-YIG endonuclease family such as F-TevI endonuclease, F-TevII endonuclease, or the endonuclease comprises at least one of the His-Cys-His Box endonuclease family such as I-DirI, I-HmuI endonuclease, I-HmuII endonuclease, I-PpoI endonuclease, I- TevIII, or combinations thereof.
In a preferred embodiment of this invention, the Endonuclease coding sequence of the second expression cassette is I-SceI endonuclease (SEQ ID NO 21).
In a preferred embodiment the genetic sequence of the Helper Plasmid is SEQ ID NO 13.
It is contemplated the use of this helper plasmid within a recombinant bacterium as a system together with parental plasmids, wherein said recombinant bacteria is a non -genome recombinant bacteria.
According to a third aspect of the present invention, a recombinant cell is provided, as illustrated in
Suitable cells for use in the present invention may be bacterial cells such as Escherichia coli cells, yeast cells such as Saccharomyces cerevisiae, and mammalian cells such as HEK-293 cells.
In an embodiment of this invention, the recombinant cells are Escherichia coli, K12 strain derivative, as shown in
Genome-Recombinant Cells of the invention may be engineered based on techniques well-established in the art, for example, using bacteriophage integrases, which integrate Helper Plasmid DNA sequences into bacterial chromosomes in a single recombination step.
Alternatively, Episomically-Recombinant Cells of the invention may be engineered based on transformation of the Helper Plasmid using techniques well-established in the art, for example, CaC12/heat shock transformation or electroporation.
As depicted in
According to a fourth aspect of the invention, a hairpin loop ended self-complementary double-stranded linear covalently closed DNA vector production system (depicted in
In some specific embodiments, the production system comprises the recombinant cell and the parental plasmids hereinafter referred to as pDNAv_A and pDNAv_B.
Both pDNAv_A (illustrated in
In embodiments of the present invention, pDNAv _A and/or pDNAv_B further comprise at least one additional ultra-high copy number bacterial origin of replication that can generate more than 5,000 copies per cell.
In embodiments of the invention, pDNAv_A has two bacterial origins of replication. Preferably, one of the bacterial origins of replication for pDNAv_A is R6K origin of replication (SEQ ID NO 22).
In embodiments of the invention, the selection system for pDNAv_A is Ampicillin Resistance Cassette (SEQ ID NO 23).
In embodiments of the invention, the preferred selection system for pDNAv_B is Chloramphenicol Resistance Cassette (SEQ ID NO 24).
Also preferably, the other bacterial origin of replication for pDNAv_A is M13 origin of replication (SEQ ID NO 12).
In further embodiments, pDNAv_B has one bacterial origin of replication. Preferably, the bacterial origin of replication of pDNAv_B is the R6K origin of replication (SEQ ID NO 22).
In a preferred embodiment the genetic sequence of pDNAv_A is SEQ ID NO 25 (illustrated in
In another preferred embodiment the genetic sequence of pDNAv_B is SEQ ID NO 26 (illustrated in
According to a fifth aspect of the invention, a hairpin loop ended self-complementary double-stranded linear covalently closed DNA vector production process is provided, starting from the aforementioned system, wherein said process comprises the stages of:
In possible embodiments of the invention, the process particularly comprises the steps of:
The above process is depicted in
In preferred embodiments, pDNAv_A (
After the selection of the correct clones pDNAv_Ac and pDNAv_Bc, both are subjected to a first in vitro recombination step using the first recombinase recognition sequence. Alternatively, pDNAv_Ac and pDNAv_Bc can be subjected to a first in vitro recombination by using restriction enzyme digestion of the endonuclease recognition sequence embedded in the first recombinase recognition sequence, and subjecting digested pDNAv_Ac and pDNAv_Bc to ligation. Alternatively, pDNAv_Ac and pDNAv_Bc can be subjected to a first in vitro recombination by performing a PCR amplification of each plasmid and subjecting the amplicons generated to in vitro commercial recombination systems, in order to generate the cointegrate plasmid pDNAv_ABc (
After performing the cointegration step in vitro, the cointegrate vector pDNAv _ABc is transformed in the recombinant cell which harbors the Helper Plasmid, and incubated to allow the theta replication and expression of antibiotic resistance genes. After appropriate incubation, recombinant clones are selected.
Once the recombinant clones are selected, the cointegrate plasmid pDNAv_ABc is subjected to a second in vivo recombination by inducing the expression of the First Recombinase Protein that recognizes the Second Recombinase Recognition Sequence in the co-integrate, in order to generate two derived circular covalently closed DNA molecules: pDNAv_ABc1 (
After the generation of pDNAv_ABc1 and pDNAv_ABc2 circular covalently closed DNA molecules, the posterior induction of the DNA replicating protein expression in this step allows the initiation of the “rolling circle” replication of pDNA_ABc1 (
Once pDNAv_ABc2 and the helper plasmid are degraded, the third recombinase protein is induced in order to cleavage and eliminate the loops at both ends of the hairpin loop-ended double-stranded linear DNA molecules, producing the final hairpin loop-ended self-complementary double-stranded linear covalently closed DNA molecules denoted as SYTEL-DNA (
This last step involves the isolation of the resulting hairpin loop-ended self-complementary double-stranded linear covalently closed DNA molecules (SYTEL-DNA) from the recombinant cell, in order to proceed with the formulation for final administration into the target organism or cell.
The isolation can be carried out by methods known in the art for plasmid DNA purification such as Alkaline Lysis, Boiling Lysis, Affinity Column Based Methods among others.
In other embodiments of the invention, the process particularly comprises the steps of:
The above process is depicted in
In preferred embodiments, pDNAv_A (
After the selection of the correct clones pDNAv_Ac and pDNAv_Bc, both are subjected to a first in vitro recombination step using the first recombinase recognition sequence. Alternatively, pDNAv_Ac and pDNAv_Bc can be subjected to a first in vitro recombination by using restriction enzyme digestion of the endonuclease recognition sequence embedded in the first recombinase recognition sequence, and subjecting digested pDNAv_Ac and pDNAv_Bc to ligation. Alternatively, pDNAv_Ac and pDNAv_Bc can be subjected to a first in vitro recombination by performing a PCR amplification of each plasmid and subjecting the amplicons generated to in vitro commercial recombination systems, in order to generate the cointegrate plasmid pDNAv_ABc (
After performing the cointegration step in vitro, the cointegrate vector pDNAv _ABc is transformed in the recombinant cell which harbors the helper plasmid, and incubated to allow the theta replication and expression of antibiotic resistance genes. After appropriate incubation, recombinant clones are selected.
Once proper recombinant clones are selected, the recombinant bacteria are subjected to an incubation step in order to allow theta replication of plasmid pDNAv_ABc.
After a suitable incubation time, the expression of the second recombination protein is induced in order to produce the hairpin loop-ended double-stranded linear DNA molecules denoted as SYTEL-DNA (
To degrade the resulting residual parental recombination product and the Helper Plasmid, the endonuclease protein expression is induced in order to allow the cleavage of the corresponding endonuclease recognition sites located in the aforementioned DNA molecules.
This last step involves the isolation of the resulting hairpin loop-ended self-complementary double-stranded linear covalently closed DNA molecules from the recombinant cell, in order to proceed with the formulation for final administration into the target organism or cell.
The isolation can be carried out by methods known in the art for plasmid DNA purification such as Alkaline Lysis, Boiling Lysis, Affinity Column Based Methods among others.
In some embodiments of the invention the entire process is carried out in vivo.
In other possible embodiments, the entire process is carried out in vitro.
In preferred embodiments, all the steps are performed in vivo, except from the step of subjecting both parental plasmids (such as pDNAv_Ac and pDNAv_Bc) to a first recombination with the first recombination site in order to generate the co-integrated plasmid pDNAv_ABc that is performed in vitro.
In a sixth aspect of this invention, a hairpin loop ended self-complementary double-stranded linear DNA molecule is provided, wherein such molecule is produced by any of the aforementioned processes.
In some embodiments, as illustrated in
In still a further embodiment and with reference to
Preferably, the genetic elements of the DNA vector are operably linked in the following order: a single-stranded DNA loop, optionally at least one DNA nuclear targeting sequence, optionally at least one S/MAR sequence, optionally at least one eukaryotic origin of replication, at least one promoter, optionally at least one 5’ untranslated region sequence, at least one coding sequence or sequence of interest, optionally one or more IRES, MIRES and/or 2A-peptide sequences, optionally at least one 3’ untranslated region sequence, at least one transcriptional terminator sequence, optionally at least one S/MAR sequence, optionally at least one eukaryotic origin of replication, optionally at least one DNA nuclear targeting sequence, a single-stranded DNA loop, with suitable DNA spacer sequences optionally placed between at least two of the aforementioned elements.
In a most preferred embodiment, as shown in
By way of example, episomal maintenance sequences can be selected from S/MAR sequences, SV40T mutated sequence, β-Globin Replicator sequence, Epstein-Barr virus (EBV) elements oriP and EBNA1.
Preferably, the episomal maintenance sequence is β-Interferon S/MAR sequence (SEQ ID NO 4).
The resulting hairpin loop ended self-complementary double-stranded covalently closed linear DNA vectors may be used as a DNA-based platform for the expression of a wide range of genetic constructs such as, but not limited to, conventional mRNA molecules, self-amplifying RNA molecules, small interfering RNA (siRNA), circular RNA molecules (circRNA), microRNA Sponges, and combinations thereof.
In an embodiment of this invention, the molecules expressed may be used for the prevention or treatment of:
In some embodiments, in order to achieve the expected efficacy, the hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide may be formulated with suitable delivery vehicles through known methods. These delivery vehicles include but are not limited to: polymers, polymeric carriers, membrane lipid bilayers, liposomes, lipid nanoparticles, protein nanoparticles, viral-derived envelopes and/or capsids. In some embodiments, delivery methods may include transfection, electroporation and or fusion.
In some embodiments a pharmaceutical composition or formulation refers to a form of administration, e.g., systemic, local, intratumoral, peritumoral, subcutaneous, intramuscular. Suitable forms, in part, depend upon the use or the route of entry, for example oral, inhalation or injection. These compositions or formulations are prepared according to any method known in the art for the manufacture of pharmaceutical compositions.
This invention articulates with other available technologies to produce hairpin loop ended linear covalently closed DNA molecules since it can generate them in great quantity and quality for their administration to mammalian cells, tissues and organs. But, in addition, it overcomes the obstacles posed by such other available technologies. Particularly, because this solution uses a bacterial biological system to produce the hairpin-loop ended linear covalently-closed DNA molecules. This mainly avoids the need for large amounts of recombinant enzymes for use in vitro, which are expensive and represent a limiting step to sustain a qualitative and quantitative competitive production with a standardizable, reproducible and transferable procedure.
Regarding to the distinctive characteristics of this invention, it is important to highlight that the disclosed technology is more easily scalable to virtually all volumes in which it has been possible to cultivate Escherichia coli in high densities, without requiring an expensive infrastructure (widely used in the traditional biotech industry), consequently reducing time, human resources, dependence on important supplies (which are irreplaceable in the other procedures, such as recombinant enzymes), and procedural steps, since our biological system performs all the necessary tasks to generate the molecules of interest. Thus, the recombinant biological system proposed here, in combination with the particular and specific design of the plasmid vectors of this invention, contains all the engineering that replaces the infrastructure required in other extant systems producing similar DNA molecules. Thus, our system provides a fast and simple way to generate different active ingredients for the development of gene therapies or vaccines, which can then be rapidly produced on a scale suitable for conceptual evaluation (laboratory scale), for its testing in animal models (pre-clinical scale), for its clinical trial in humans or its direct formulation for products registered and approved by regulatory agencies (clinical GMP scale).
Adding to the originality of the present invention, we propose the incorporation of several specific modular genetic elements to the final functional hairpin loop ended linear covalently closed DNA molecules, such as SV40 enhancer, S/MAR and SV40 origin of replication sequences. The combination of these elements allows the encoded transcriptional unit in the molecule to achieve an unsurpassed therapeutic potential by maximizing its nuclear localization, and enhancing its transcriptional activity and stability through the attachment of the molecule to the nuclear matrix. In addition, the functional combination of an SV40 origin of replication with the S/MAR sequence, provides continuous replicative capability in a cell cycle-synchronized fashion, which is essential to avoid the gradual dilution of the hairpin loop ended linear covalently closed DNA molecule as the transfected cells proliferate, thus maintaining the therapeutic effect unaltered as the replication process continues, maximizing the chances for each cell of the progeny of receiving at least one copy of the therapeutic molecule. This feature is of the utmost importance in strategies such as CAR-T or CAR-NK cell therapies, wherein the transformed cells (T-cells or NK lymphocytes) undergo active proliferation upon activation, or for the treatment of diseases like atopic dermatitis or psoriasis, where the transformed cells (keratinocytes, in this example) also proliferate constantly as the skin peels off.
The aforementioned original innovations ensure the achievement of the maximum expression potential of the hairpin loop ended linear covalently closed DNA molecule of this invention as a therapeutic approach, potentially minimizing the need for high concentrations of nucleic acids administered per dose, or even the need for redosings after a single administration.
The assays described below are valid for all the aforementioned aspects of the invention.
The expression cassettes and the repressor proteins' operon were designed in-silico and chemically synthesized, molecularly cloned and sequenced by GeneUniversal Inc. Previous to the full sequence insertion into the Escherichia coli genome, a full genetic construction was achieved (e.g., Helper Plasmid;
After the PCR was performed, a gel-agarose electrophoresis was carried out, as can be appreciated in
To further demonstrate the functioning of the bacteria, template plasmids were subjected to enzymatic activity from the genes that were inserted in the bacterial genome.
In addition, in order to demonstrate the function of the PhiC31 enzyme, a cointegrated plasmid was generated from pDNAv_A (
To demonstrate the function of the I-SceI enzyme, genome-recombinant Escherichia coli having the co-integrate plasmid were subjected to an inductor and after appropriate incubation, extraction, and purification of DNA by alkaline lysis, this was subjected to an agarose gel electrophoresis and stained with ethidium bromide. Only the linear DNA was present as can be seen in
To further test the recombinant bacterial strain, the “rolling circle” replication was tested using a proper DNA template, showing the expected increment in the copies of DNA molecules (
To further validate generation of hairpin loop ended self-complementary double-stranded covalently closed linear DNA vectors, resulting linear DNA molecules from the process described in this invention, were subjected to exonuclease and/or endonuclease activity.
A transformation efficiency assay of the linear DNA molecules was performed (
The percentage of transfection was assessed after 48 h post transfection as GFP-positive cells/total cells, and the results are shown in
To validate the in vivo expression of the Linear DNA molecules, 20 µg of SYTEL-EGFP were formulated with non-targeted lipid nanoparticles and injected in mice biceps femoris muscles. Six days after the administration, the muscle was harvested, properly dissected and subjected to a fluorescence microscopy survey.
SYTEL-EGFP in vivo transfection leads to the expression of the recombinant protein EGFP as can be visualized in (
Eukaryotic cell replication follows a molecular schedule known as the cell cycle, which is controlled by a set of replication proteins such as cyclins and cyclin-dependent kinases, among others. Besides the expression of said replication proteins, the replication of DNA molecules in the cell requires a sequence within the DNA molecule that serves as an origin of replication, where the process starts by the formation of a replication bubble.
A replication competent linear DNA molecule was designed, synthesized and manufactured using the method described in this patent. The replication competent linear DNA was engineered to contain at least one origin of replication that is able to replicate the molecules inside mammalian cells. In a preferred embodiment of this patent, the origin of replication is selected from the group consisting of, but not limited to the SV40 origin of replication. This genetic element orchestrates the replication of the DNA molecule where the SV40 origin is located, provided that it is located in the vicinity of an appropriate S/MAR sequence, in a fashion that shows complete synchronization with the host cell cycle. This allows the linear DNA molecule to be stably maintained in the progeny of the originally transformed cell.
To demonstrate the replication of the linear DNA molecule inside mammalian cells, a set of experiments were performed. First, 1 µg of Linear DNA encoding the reporter protein EGFP (denoted as SYTEL-EGFP;
The safety switch assay was performed in cell lines to see if the expression of the recombinant protein can be stopped and the molecule can be self-eliminated in a safe way. For this assay, a proteasome-tagged EGFP-encoding linear DNA molecule was used. The expression of proteasome-tagged EGFP proteins is preferred over the classical EGFP protein because the former one can be recycled in a rapid fashion, so that if the safety switch works, the fluorescence will rapidly fade in the cells as can be visualized by fluorescence microscopy.
For the assay, 1 µg of SYTEL-TgEGFP was complexed with Lipofectamine and administered to HEK-293 cells according to the manufacturer’s instructions. After 48 hs, transfected cells were visualized under fluorescence microscopy to validate the correct expression of EGFP (
In an alternative approach, a second SYTEL-DNA molecule expressing CFP (Cerulean Fluorescent Protein) was used. In this case, a conditional Bax expressing cassette was built into the linear DNA molecule. When doxycycline is administered externally, the expression of the pro-apoptotic protein Bax is induced, triggering the apoptotic cascade, leading to the death of the transfected cell.
For this second assay, 1 µg of SYTEL-CFP (Bax-armed) was complexed with Lipofectamine and administered to HEK-293 (human embryonic kidney) cells. This cell line was cultured in Dulbecco’s Modified Eagle Medium (DMEM) with 10% FBS at 37° in 5% v/v CO2 and in a humidified atmosphere. All experiments were performed with cells in passage 3 cells according to the manufacturer’s instructions. After 48 hs, transfected cells were visualized under fluorescence microscopy to validate the correct expression of CFP (
As used herein, the article “a”, “an” , and “the'” means “at least one”, unless the context in which the article is used clearly indicates otherwise. The term “nucleic acid” as used herein means a polymer composed of nucleotides, e.g., deoxyribonucleotides or ribonucleotides.
The terms “ribonucleic acid”, “RNA” an “polyribonucleotide” as used herein mean a polymer composed of ribonucleotides.
The terms “circular polyribonucleotide”, “circRNA” and “circular RNA” are used interchangeably throughout this patent.
The terms “deoxyribonucleic acid”, “DNA” an “polydeoxyribonucleotide” as used herein mean a polymer composed of deoxyribonucleotides.
The term “recombinase”, as used herein, refers to an enzymatic protein able to catalyze DNA exchange at specific target sites.
The term “circular covalently closed DNA”, “circular covalently closed plasmid DNA”, “pDNA” or “plasmid DNA” are used interchangeably throughout this patent.
The term “hairpin loop ended self-complementary double stranded covalently closed linear polyribonucleotide”, “Linear DNA”, “Loop-Ended Linear DNA” or “L-DNA” are used interchangeably throughout this patent.
The term “oligonucleotide” denotes single or double stranded nucleotide multimers that goes from about 2 to up to about 200 nucleotides in length. Suitable oligonucleotides may be prepared by a chemical method such as the phosphoramidite method or an enzymatic method.
The term “polynucleotide” refers to a single or double stranded polymer composed of nucleotide monomers.
The term “polypeptide” refers to a compound made up of a single chain of D-configured or L- configured amino acids joined by peptide bonds.
The term “complementary” refers to an interacting surface of a molecule and its target. Thus, the probe and the target can be described as complementary.
The term “anneal” refers to the process by which a single-stranded nucleic sequence pairs by hydrogen bonds to a complementary sequence, thus forming a double stranded nucleic acid sequence. This includes the reformation of complementary strands that were thermally denatured, known as renaturation.
The term “target” refers to a molecule with an affinity for a given probe. Targets may be natural or man-made molecules.
The term “promoter”, “regulatory element” or “regulatory sequence” refers to a region or genetic sequence located upstream or downstream from the start of the transcription site and which are involved in the recognition and binding of RNA polymerase and other proteins to initiate the transcription process. Selected promoters may include, but are not limited to, bacterial promoters, viral promoters, mammalian promoters, yeast promoters or combinations thereof.
The term “regulatory element” is intended to include promoters, enhancers, internal ribosome entry sites (IRES) and other expression control elements such as, but not limited to, transcription termination signals, polyadenylation signals and poly-U sequences. Regulatory elements may include those that direct constitutive expression of a nucleic acid sequence in a wide variety of cell types, tissues or organs and those that direct expression of a nucleic acid sequence only in certain cell types, tissues or organs, such as, but not limited to, cell specific regulatory sequences, tissue specific regulatory sequences and organ specific regulatory sequences.
A cell specific promoter may direct the expression of a nucleic acid sequence in a desired cell type of interest such as, but not limited to, lymphocytes, muscle cells, keratinocytes and fibroblasts. A tissue specific promoter may direct the expression of a nucleic acid sequence in a desired tissue of interest such as, but not limited to, muscle, neuron, bone, skin and blood. An organ specific promoter may direct the expression of a nucleic acid sequence in a desired organ such as, but not limited to, liver, brain, heart, lungs and kidney.
Regulatory elements may also direct expression in a temporal dependent manner such as, but not limited to, cell cycle and developmental stage.
Also encompassed by the term “regulatory element” are enhancer elements that include, but are not limited to, WPRE enhancer, CMV enhancer and HTLV-LTR enhancer.
The term “recombinant” refers to a manipulated nucleic acid. In reference to a protein (“recombinant protein”), a protein encoded by a recombinant nucleic acid.
The term “transcriptional unit” refers to a nucleic acid construct, which when introduced into a cell, results in transcription and / or translation to an RNA or polypeptide.
The term “codon optimized” refers to the alteration of codons in the open reading frame of nucleic acid sequences to reflect the typical codon usage of a selected organism without altering the polypeptide encoded by the sequence. The optimization includes replacing at least one, or more than one, or a significant number of codons with one or more codons that are more frequently used in the genes of that selected organism.
A nucleic acid sequence is “operably linked” when it is placed into a functional relation with another nucleic acid sequence.
As used herein a “host” means an individual, and can include domesticated animals such as, but not limited to, cats and dogs; livestock such as but not limited to, cattle, horses, pigs, sheep and goats; laboratory animals such as, but not limited to, mouse, rabbit, rat and guinea pig; mammalians such as, but not limited to, humans, non-human primates and primates; other animals such as, but not limited to, rodents, birds, reptiles, amphibians and fish.
LENGTH: 5.951
TYPE: DNA
ORGANISM: Artificial
FEATURE: Parental Plasmid
OTHER INFORMATION: Parental Plasmid / pDNAv_A
SEQUENCE: 1
LENGTH: 73
TYPE: DNA
ORGANISM: Artificial
FEATURE: DNA Nuclear Localization Sequence
OTHER INFORMATION: SV40E
SEQUENCE: 2
LENGTH: 42
TYPE: DNA
ORGANISM: Artificial
FEATURE: DNA Nuclear Localization Sequence
OTHER INFORMATION: NF-kB importin
SEQUENCE: 3
LENGTH: 2.268
TYPE: DNA
ORGANISM: Artificial
FEATURE: DNA Scaffold Matrix Attachment Region
OTHER INFORMATION: β-Interferon Scaffold Matrix Attachment Region
SEQUENCE: 4
LENGTH: 6
TYPE: DNA
ORGANISM: Artificial
FEATURE: Restriction Site
OTHER INFORMATION: SalI
SEQUENCE: 5
LENGTH: 6
TYPE: DNA
ORGANISM: Artificial
FEATURE: Restriction Site
OTHER INFORMATION: PstI
SEQUENCE: 6
LENGTH: 6
TYPE: DNA
ORGANISM: Artificial
FEATURE: Restriction Site
OTHER INFORMATION: NseI
SEQUENCE: 7
LENGTH: 6
TYPE: DNA
ORGANISM: Artificial
FEATURE: Restriction Site
OTHER INFORMATION: EcoRI
SEQUENCE: 8
LENGTH: 8
TYPE: DNA
ORGANISM: Artificial
FEATURE: Restriction Site
OTHER INFORMATION: SdaI
SEQUENCE: 9
LENGTH: 6
TYPE: DNA
ORGANISM: Artificial
FEATURE: Restriction Site
OTHER INFORMATION: XhoI
SEQUENCE: 10
LENGTH: 918
TYPE: DNA
ORGANISM: Artificial
FEATURE: DNA Replication Protein
OTHER INFORMATION: PI Protein
SEQUENCE: 11
LENGTH: 179
TYPE: DNA
ORGANISM: Artificial
FEATURE: Phage Origin of Replication
OTHER INFORMATION: M13 Origin of Replication (+)
SEQUENCE: 12
LENGTH: 14.657
TYPE: DNA
ORGANISM: Artificial
FEATURE: Helper Plasmid
OTHER INFORMATION: pSYTEL-Helper
SEQUENCE: 13
LENGTH: 29
TYPE: DNA
ORGANISM: Artificial
FEATURE: Promoter
OTHER INFORMATION: pTac
SEQUENCE: 14
LENGTH: 1.083
TYPE: DNA
ORGANISM: Artificial
FEATURE: Repressor Protein
OTHER INFORMATION: LacIq
SEQUENCE: 15
LENGTH: 879
TYPE: DNA
ORGANISM: Artificial
FEATURE: Repressor Protein
OTHER INFORMATION: AraC
SEQUENCE: 16
LENGTH: 714
TYPE: DNA
ORGANISM: Artificial
FEATURE: Repressor Protein
OTHER INFORMATION: Lambda Repressor
SEQUENCE: 17
LENGTH: 1.233
TYPE: DNA
ORGANISM: Artificial
FEATURE: “rolling circle” Replication Protein
OTHER INFORMATION: M13 g2p
SEQUENCE: 18
LENGTH: 1.896
TYPE: DNA
ORGANISM: Artificial
FEATURE: Recombinase
OTHER INFORMATION: N15 TelN Recombinase
SEQUENCE: 19
LENGTH: 1.818
TYPE: DNA
ORGANISM: Artificial
FEATURE: Recombinase
OTHER INFORMATION: PhiC31 Recombinase
SEQUENCE: 20
LENGTH: 708
TYPE: DNA
ORGANISM: Artificial
FEATURE: Endonuclease
OTHER INFORMATION: I-SceI Endonuclease
SEQUENCE: 21
LENGTH: 357
TYPE: DNA
ORGANISM: Artificial
FEATURE: Origin of Replication
OTHER INFORMATION: R6K Origin of Replication
SEQUENCE: 22
LENGTH: 1.136
TYPE: DNA
ORGANISM: Artificial
FEATURE: DNA Antibiotic Resistance Cassette
OTHER INFORMATION: Ampicillin Resistance Cassette
SEQUENCE: 23
LENGTH: 806
TYPE: DNA
ORGANISM: Artificial
FEATURE: DNA Antibiotic Resistance Cassette
OTHER INFORMATION: Chloramphenicol Resistance Cassette
SEQUENCE: 24
LENGTH: 5.951
TYPE: DNA
ORGANISM: Artificial
FEATURE: Parental Plasmid
OTHER INFORMATION: pDNAv A
SEQUENCE: 25
LENGTH: 5.603
TYPE: DNA
ORGANISM: Artificial
FEATURE: Parental Plasmid
OTHER INFORMATION: Parental Plasmid / pDNAv B
SEQUENCE: 26
This application claims the benefit of Provisional U.S. Pat. Application No. No.: 63/230,603 filed Aug. 06, 2021, entitled “HAIRPIN LOOP ENDED SELF-COMPLEMENTARY DOUBLE STRANDED COVALENTLY CLOSED LINEAR DNA VECTOR MANUFACTURING SYSTEM AND PROCESS, AND USES OF SAID RESULTING DNA VECTOR,” the entire content and disclosure of which, both express and implied, is incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
63230603 | Aug 2021 | US |