The present invention relates to the field of retroviral vectors. In particular, the invention relates to nucleic acid constructs and methods for producing retroviral vectors and to packaging cells and producer cells for use in such methods.
Retroviral vectors are relevant for a range of applications, including gene therapy. However, progress in lentiviral gene therapy, for example, has been hampered by the requirement for production of purified lentiviral vectors with high titre.
Methods for generating retroviral packaging cell lines are known in the art (see WO 92/05266, for example). Such packaging cell lines may be used to create producer cell lines for the production of retroviral vector particles.
Lentiviral production may be performed using transient transfection of 293T cells with four plasmids: the viral genome vector plasmid and three helper plasmids which supply Gagpol, Rev and Env glycoprotein. However, transient transfection technology cannot be used to generate virus at an appropriate scale for clinical applications because very large scale transfections are not practicable.
Stable lentiviral packaging cell lines can be generated by stably integrating all four of the above mentioned plasmids into a cell line (e.g. a 293T cell line) by transient transfection, followed by a serendipitous integration event. Cells which have integrated the plasmid are selected by the co-expression of an antibiotic resistance gene from the integrated plasmid. This approach is limited for a number of reasons. Firstly, stable integration following transient transfection is very inefficient. Second, antibiotic selection works well for a limited range of antibiotics, for example Zeocin™ and puromycin, but options are limited when attempting to select for the integration of four different genes and the available selection systems are not optimal. Thirdly, it is difficult to determine the relative expression obtained from each individual integration event.
Thus there is a need for retroviral, in particular lentiviral, production methods which are not associated with the disadvantages outlined above.
The present inventors provide a retroviral production system which is based on the co-expression of viral proteins and detectable marker proteins which can be quantified at the level of a single cell. In particular, the system enables the expression level of different retroviral proteins in packaging cells to be tracked and packaging cells with ideal expression ratios for different retroviral components to be selected. For example, the expression level of the detectable marker may be determined using flow cytometry to identify and select packaging cells. Because the marker protein is co-expressed with the viral protein, packaging cell which express a particular viral protein or a combination of viral proteins at levels or ratios which are advantageous for a retroviral production system can be identified and selected.
In a first aspect the present invention provides a nucleic acid construct comprising:
The nucleic acid construct may comprise the following structure:
A-X-B
in which
The co-expression sequence may comprise an IRES sequence or a sequence encoding a self-cleaving peptide.
The co-expression sequence may be selected from a 2A self-cleaving peptide from an aphtho- or a cardiovirus, a 2A-like peptide, an internal ribosome entry site, a furin cleavage site or a Tobacco Etch Virus cleavage site.
The co-expression sequence may be a 2A self-cleaving peptide from an aphtho- or a cardiovirus or a 2A-like peptide.
The retroviral protein may be selected from gag-pol, env or rev.
The cell surface protein may be less than 200 amino acids.
The membrane targeting domain of the cell surface protein may comprise (i) a transmembrane domain and an endodomain; or (ii) a GPI anchor. The transmembrane domain may comprise a CD8 stalk or a part thereof.
The extracellular domain of the cell surface protein may comprise HA, V5, RQR8 or MYC
The cell surface protein may comprise an amino acid sequence shown as SEQ ID NO: 21-26 or a variant thereof which has at least 80% sequence identity.
The nucleic acid construct may further comprise transposon sequences flanking the A-X-B nucleic acid sequence. The transposon sequences may comprise the sequence shown as SEQ ID NO: 60 and 61 or a variant thereof sharing at least 80% sequence identity.
In a second aspect the present invention provides a kit comprising a plurality of nucleic acid constructs according to the present invention.
The kit may comprise:
The kit may further comprise:
The kit may also comprise:
In a third aspect the present invention provides a plasmid comprising a nucleic acid construct of the present invention.
In a fourth aspect the present invention provides a packaging cell comprising a nucleic acid sequence of the invention which expresses at least one detectable marker which is a cell surface protein comprising an extracellular domain and a membrane targeting domain.
The term “co-expresses” means that the retroviral protein and the detectable marker are expressed together or at the same time. As a result, the expression level of the retroviral protein can be determined indirectly by assaying the expression level of the detectable marker with which it is co-expressed.
The packaging cell may co-express each of gag-pol, env and optionally rev with a different detectable marker.
The packaging cell may comprise:
The packaging cell may further comprise:
In a fourth aspect the present invention provides a producer cell which comprises:
The nucleic acid construct(s) may be stably integrated into the cell genome.
The packaging or producer cell may be a HEK293, HEK293-T, TE671 or HT1080, 3T3, or K562 cell.
In another aspect the present invention provides a method for making a packaging cell or producer cell of the invention which comprises the step of introducing one or more nucleic acid construct(s) of the invention, one or more plasmid(s) of the invention, a kit of nucleic acid constructs of the invention, or a kit of plasmids of the invention, into a cell.
In another aspect the present invention provides a method for selecting a packaging or producer cell of the invention by selecting for expression of the or each detectable marker encoded by the or each nucleic acid construct.
The cell may express a plurality of detectable markers and may be selected using multi-parameter flow cytometry.
The method may comprise the following steps:
Step (a) may further comprise introducing into the plurality of cells:
Step (a) may further comprise introducing into the plurality of cells:
Step (a) may also comprise introducing a nucleic acid sequence encoding piggyBAC transposase.
In another aspect the invention provides a method for producing a retroviral vector which comprises the steps of culturing a producer cell of the present invention and isolating the retroviral vector.
The retroviral vector may be a lentiviral vector.
The present invention provides a nucleic acid construct comprising
The nucleic acid construct may comprise the structure: A-X-B; in which A is the first nucleic acid sequence, B is the second nucleic acid sequence and X is a co-expression sequence.
The nucleic acid sequence may be an RNA or DNA sequence or a variant thereof.
The concept of using viral vectors for gene therapy is well known (Verma and Somia (1997) Nature 389:239-242). As used herein the term “retroviral vector”, when referring to a retroviral vector system also includes a retroviral vector particle capable of transducing a recipient cell with a nucleotide of interest (NOI).
A retroviral vector particle includes the following components: a vector genome, which may contain one or more NOIs, a nucleocapsid encapsidating the nucleic acid, and a membrane surrounding the nucleocapsid.
The term “nucleocapsid” refers to at least the group specific viral core proteins (gag) and the viral polymerase (pol) of a retrovirus genome. These proteins encapsidate the packagable sequences and are themselves further surrounded by a membrane containing an envelope glycoprotein.
The term “vector genome” refers to both to the RNA construct present in the retroviral vector particle and the integrated DNA construct. The term also embraces a separate or isolated DNA construct capable of encoding such an RNA genome. A retroviral genome should comprise at least one component part derivable from a retrovirus. The term “derivable” is used in its normal sense as meaning a nucleotide sequence or a part thereof which need not necessarily be obtained from a virus such as a lentivirus but instead could be derived therefrom. By way of example, the sequence may be prepared synthetically or by use of recombinant DNA techniques.
There are many retroviruses. For the present application, the term “retrovirus” includes, but is not limited to: murine leukemia virus (MLV), human immunodeficiency virus (HIV), equine infectious anaemia virus (EIAV), mouse mammary tumour virus (MMTV), Rous sarcoma virus (RSV), Fujinami sarcoma virus (FuSV), Moloney murine leukemia virus (Mo-MLV), FBR murine osteosarcoma virus (FBR MSV), Moloney murine sarcoma virus (Mo-MSV), Abelson murine leukemia virus (A-MLV), Avian myelocytomatosis virus-29 (MC29), and Avian erythroblastosis virus (AEV) and all other retroviridiae including lentiviruses.
A detailed list of retroviruses may be found in Coffin et al (“Retroviruses” 1997 Cold Spring Harbour Laboratory Press Eds: J M Coffin, S M Hughes, H E Varmus pp 758-763).
In a preferred embodiment, the retroviral vector is derivable from a lentivirus.
Lentiviral vectors are major tools for gene delivery, providing efficient transduction of a wide variety cell types such as hematopoietic stem cells, neurons and endothelial cells. The advantages of lentiviral vectors over other systems are the ability to infect both dividing and non-dividing cells in vivo and in vitro and their greater packaging capacity that enables the expression of larger RNA transcripts.
The lentivirus group can be split into “primate” and “non-primate”. Examples of primate lentiviruses include the human immunodeficiency virus (HIV), the causative agent of human acquired immunodeficiency syndrome (AIDS), and the simian immunodeficiency virus (SIV). The non-primate lentiviral group includes the prototype “slow virus” visna/maedi virus (VMV), as well as the related caprine arthritis-encephalitis virus (CAEV), equine infectious anaemia virus (EIAV) and the more recently described feline immunodeficiency virus (FIV) and bovine immunodeficiency virus (BIV).
Details on the genomic structure of some lentiviruses may be found in the art. By way of example, details on HIV and EIAV may be found from the NCBI Genbank database (i.e. Genome Accession Nos. AF033819 and AF033820 respectively). Details of HIV variants may also be found at http://hiv-web.IanI.gov. Details of EIAV variants may be found through http://www.ncbi.nIm.nih.gov.
Lentiviruses have three main genes coding for the viral proteins in the order: 5″-gag-pol-env-3′. There are two regulatory genes, tat and rev. There are additional accessory genes depending on the virus (e.g., for HIV-1: vif, vpr, vpu, nef) whose products are involved in regulation of synthesis and processing viral RNA and other replicative functions. The Long terminal repeat (LTR) is about 600 nt long, of which the U3 region is 450, the R sequence 100 and the U5 region some 70 nt long.
Viral proteins involved in early stages of replication include Reverse Transcriptase and Integrase. Reverse Transcriptase is the virally encoded RNA-dependent DNA polymerase. The enzyme uses the viral RNA genome as a template for the synthesis of a complementary DNA copy. Reverse transcriptase also has RNaseH activity for destruction of the RNA-template. Integrase binds both the viral cDNA generated by reverse transcriptase and the host DNA. Integrase processes the LTR before inserting the viral genome into the host DNA. Tat acts as a trans-activator during transcription to enhance initiation and elongation. The Rev responsive element acts post-transcriptionally, regulating mRNA splicing and transport to the cytoplasm.
The retroviral protein may be Gag, Pol, Env or Rev.
Group-specific antigen (gag) proteins are major components of the viral capsid, which are about 2000-4000 copies per virion. Pol proteins are responsible for synthesis of viral DNA and integration into host DNA after infection. Env proteins play a role in association and entry of virions into the host cell. A functional copy of an env gene is what makes retroviruses distinct from retroelements. The ability of the retrovirus to bind to its target host cell using specific cell-surface receptors is given by the surface component (SU) of the Env protein, while the ability of the retrovirus to enter the cell via membrane fusion is imparted by the membrane-anchored trans-membrane component (TM). Thus the Env protein is what enables the retrovirus to be infectious.
The retroviral protein may be gagpol. The nucleic acid sequence which encodes gagpol may comprise the nucleic acid sequence shown as SEQ ID NO: 9. The nucleic acid sequence which encodes gagpol may comprise a variant of SEQ ID NO: 9 which shares at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95% or at least 99% sequence identity to SEQ ID NO: 9 and encodes functional Gag and Pol proteins.
GATAGGGGGGCAGCTCAAGGAGGCTCTCCTGGCCACCGGAGCAGACGACACCGTGCTGGAGGAGATGTCACTCCCTGGTCGGTGGAAGCCTAAG
ATGATTGGTGGTATAGGGGGCTTCATTAAGGTGCGGCAATACGACCAAATCTTGATCGAGATTTGCGGCCACAAGGCCATCGGCACCGTGCTGG
TGGGCCCCACCCCCGTGAATATCATCGGCCGGAACCTCCTCACCCAAATCGGCTGTACCCTGAACTTCCCTATCTCTCCCATCGAAACCGTGCC
CGTGAAGCTGAAACCCGGCATGGACGGGCCCAAGGTGAAGCAGTGGCCCCTCACCGAGGAGAAGATCAAGGCCCTGGTGGAGATCTGCACCGAA
ATGGAGAAAGAGGGCAAGATCAGCAAGATCGGCCCCGAGAACCCCTATAACACCCCCGTGTTCGCTATCAAAAAGAAGGATTCCACCAAGTGGC
GGAAGCTGGTGGACTTTCGGGAGTTGAACAAACGGACCCAGGATTTTTGGGAGGTGCAGCTGGGCATCCCCCACCCTGCCGGCCTGAAGAAAAA
GAAGAGCGTGACCGTGCTCGACGTCGGCGACGCCTACTTCAGCGTGCCTCTGGACGAGGATTTTCGCAAATACACCGCCTTCACAATCCCCTCC
ATCAATAACGAAACCCCCGGCATCCGGTACCAATATAACGTCTTGCCCCAAGGCTGGAAGGGCAGCCCCGCCATCTTTCAGTCCTCTATGACCA
AGATTCTGGAACCCTTCCGGAAGCAGAACCCCGATATCGTGATTTACCAGTATATGGACGACCTCTACGTGGGCAGCGATCTGGAGATCGGCCA
ACACCGGACCAAGATCGAAGAACTCCGGCAGCACCTCCTCCGCTGGGGCTTGACAACCCCCGATAAGAAGCACCAAAAGGAGCCTCCCTTTTTG
TGGATGGGCTACGAGTTGCACCCCGACAAGTGGACCGTGCAACCCATCGTCCTCCCCGAGAAGGATTCTTGGACCGTGAACGATATCCAAAAAC
TGGTCGGCAAGCTCAACTGGGCCTCCCAAATCTATCCCGGCATCAAGGTGCGCCAGCTGTGCAAGTTGTTGCGGGGCACAAAGGCGTTGACCGA
GGTGATCCCCTTGACCGAGGAGGCCGAATTGGAGCTCGCCGAGAATCGGGAAATCTTGAAGGAGCCCGTGCACGGCGTCTACTACGATCCCAGC
AAGGATCTGATCGCCGAGATCCAAAAACAAGGCCAGGGGCAGTGGACCTACCAGATCTACCAGGAACCCTTCAAGAACCTCAAGACCGGCAAGT
ACGCCCGGATGAGAGGCGCCCATACCAACGACGTGAAGCAGCTGACCGAAGCCGTCCAGAAGATCACAACCGAGTCTATCGTGATCTGGGGCAA
AACCCCCAAGTTCAAGCTCCCTATCCAGAAGGAAACGTGGGAAACCTGGTGGACCGAATACTGGCAGGCTACATGGATTCCCGAATGGGAGTTC
GTGAACACACCCCCTCTGGTCAAGCTGTGGTATCAACTGGAAAAGGAGCCTATCGTGGGCGCCGAGACATTTTACGTGGACGGCGCTGCCAATC
GCGAAACCAAGCTGGGCAAGGCCGGCTACGTGACCAATCGGGGCCGCCAGAAGGTGGTGACATTGACCGATACCACCAACCAGAAAACCGAACT
GCAGGCCATCTACTTGGCCCTCCAAGACAGCGGCCTGGAGGTGAATATCGTGACCGATAGCCAGTACGCCCTGGGCATTATCCAGGCCCAGCCC
GACCAGTCCGAGAGCGAACTGGTGAACCAGATCATCGAACAACTGATCAAGAAAGAGAAAGTGTACCTCGCCTGGGTGCCCGCCCATAAGGGGA
TCGGCGGCAACGAGCAGGTGGACAAGCTGGTGTCCGCCGGCATTCGCAAGGTGTTGTTCCTGGACGGCATCGACAAAGCTCAGGACGAGCACGA
AAAGTACCATTCCAACTGGCGGGCCATGGCCTCCGACTTCAATTTGCCACCCGTGGTGGCCAAGGAGATCGTGGCTTCTTGCGACAAGTGCCAA
TTGAAGGGCGAGGCTATGCACGGCCAGGTGGATTGCTCCCCCGGCATCTGGCAGTTGGACTGCACCCACCTGGAGGGCAAGGTGATTCTCGTGG
CCGTGCACGTGGCTTCCGGCTACATCGAGGCTGAGGTGATCCCGGCCGAAACCGGCCAAGAGACTGCCTACTTCTTGCTGAAGCTGGCCGGCAG
GTGGCCCGTAAAGACCATCCACACCGATAACGGGTCTAACTTTACATCCGCCACCGTGAAAGCTGCTTGCTGGTGGGCAGGCATTAAACAAGAG
TTCGGCATCCCTTATAACCCTCAGTCCCAGGGCGTGGTGGAGAGCATGAACAAGGAGCTGAAAAAGATCATCGGCCAAGTGCGGGACCAAGCCG
AGCACTTGAAAACCGCCGTGCAGATGGCCGTGTTTATTCATAACTTCAAGCGGAAGGGCGGCATCGGCGGCTATTCCGCCGGTGAGCGGATCGT
GGATATCATCGCCACCGATATCCAGACCAAGGAGCTGCAGAAGCAGATCACCAAGATCCAGAACTTCAGAGTGTACTATCGCGATTCTCGGAAC
CCCTTGTGGAAGGGGCCAGCCAAATTGTTGTGGAAGGGGGAGGGCGCCGTGGTGATCCAGGACAACTCCGATATCAAGGTGGTCCCGCGGAGGA
AGGCCAAAATTATCCGCGACTACGGCAAGCAAATGGCCGGCGACGACTGCGTCGCCTCCCGGCAAGACGAGGACTGA
The retroviral env protein may be the RD114 SU or TM protein or the RDPRO SU or TM protein. In one embodiment, the env protein is VSV-G (glycoprotein G of the Vesicular stomatitis virus (VSV)). VSV-G is a commonly used env protein for lentiviral pseudotyping as it is capable of transducing all cell types.
The retroviral env protein may comprise the sequence shown as SEQ ID NO: 10. The retroviral env protein may comprise a variant of SEQ ID NO: 10 which retains ability to provide a functional env protein. The variant may share at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95% or at least 99% sequence identity to SEQ ID NO: 10.
Suitably, ‘Retroviral envelope protein’ refers to the SU and/or TM proteins, as described above in relation to lentiviruses.
In one embodiment the packaging or producer cell is used to produce a lentivirus vector and retroviral protein may be Rev. Rev is specifically expressed by lentivirus and acts post-transcriptionally, regulating mRNA splicing and transport to the cytoplasm.
The detectable marker is a cell surface protein. The cell surface protein may be any cell surface protein which is not natively expressed on the surface of the cell type which is used as the packaging cell or producer cell.
The cell surface protein comprises a membrane targeting domain and an extracellular domain. When expressed at the cell surface at least one domain of the cell surface protein is exoplasmic (i.e. on the exterior of the cell). This domain of the cell surface protein is therefore accessible for antibody binding.
In one embodiment, the detectable marker is a cell surface protein where the expression level can be determined at the single cell level. In particular, the detectable marker may be detected without disrupting the cell. Because the expression level of the detectable marker can be determined at the single cell level, the co-expression of the detectable marker with a retroviral protein means that the expression level of the retroviral protein within a packaging cell can also be determined.
This is in contrast to the selectable antibiotic resistance markers previously used to identify packaging cells expressing a retroviral protein. In particular, although the use of selectable antibiotic resistance markers enables cells which express the resistance marker—and therefore the retroviral protein—to be selected, it is difficult to determine the relative expression obtained from each individual integration event and thus the level of expression of each retroviral protein in a particular cell.
Accordingly, the detectable marker of the present invention is not an antibiotic resistance marker.
For example, the expression level of the detectable marker may be determined using flow cytometry. Methods for using techniques such as flow cytometry to determine the expression levels of proteins are well known in the art. As such, the detectable marker may be a cell surface protein which can be detected using flow cytometry by using a reagent(s) which allows expression of the cell surface protein to be determined. For example the reagent may be an antibody, for example a labelled antibody such as a fluorescently labelled antibody which specifically binds the cell surface protein. The detectable marker may be a fluorescent protein which is inherently detectable by flow cytometry due to its fluorescent characteristics.
In one embodiment the cell surface protein is less than 200 amino acids in length. The use of a cell surface protein of less than 200 amino acids is advantageous because the smaller nucleic acid sequences which encode such proteins are easier to transfect and stably integrate into cells.
A ‘membrane targeting domain’ is an entity which preferentially localises to the membrane and therefore anchors the cell surface protein to the membrane of, for example, a packaging cell.
A transmembrane domain is a hydrophobic alpha helix which spans a cell membrane and typically found in transmembrane proteins. It may comprise a hydrophobic alpha helix. The transmembrane domain may be derived from CD8, CD28 or human IgG.
The transmembrane domain may be derived from any type I transmembrane protein. The transmembrane domain may be a synthetic sequence predicted to form a hydrophobic helix.
The transmembrane domain may comprise the sequence shown as SEQ ID NO: 12.
The membrane targeting domain may be a transmembrane domain and an endodomain, which orientates to the interior of the packaging cell. The endodomain may comprise polar residues which anchor the cell surface protein to the membrane.
The membrane targeting domain may be a GPI anchor.
GPI anchoring is a post-translational modification which occurs in the endoplasmic reticulum. Preassembled GPI anchor precursors are transferred to proteins bearing a C-terminal GPI signal sequence (see Kinoshita et al.; J Biochem; 122, 251-257 (1997)). During processing, the GPI anchor replaces the GPI signal sequence and is linked to the target protein via an amide bond. The GPI anchor targets the mature protein to the membrane.
The present tagging protein may comprise a GPI signal sequence. For example, the tagging protein may comprise a sequence shown as SEQ ID NO: 13.
The extracellular domain is the part of the cell surface protein that exoplasmic (i.e. on the exterior of the cell) when the cell surface protein is expressed at the cell surface.
The extracellular domain may be any polypeptide sequence which comprises an epitope that can be specifically bound by an antibody.
In one embodiment the extracellular domain is less than 200 amino acids in length.
The extracellular domain may comprise a HA, V5, RQR-8 or MYC tag.
Human influenza hemagglutinin (HA) is a surface glycoprotein required for the infectivity of the human virus. The HA tag is derived from the HA-molecule corresponding to amino acids 98-106. It has been extensively used as a general epitope tag in expression vectors. The HA tag may comprise the sequence shown as SEQ ID NO: 14.
The V5 epitope tag (V5) is derived from a small epitope (Pk) present on the P and V proteins of the paramyxovirus of simian virus 5 (SV5). The V5 tag may comprise the sequence shown as SEQ ID NO: 15 or 16.
A myc tag (MYC) is a polypeptide protein tag derived from the c-myc gene product. The MYC tag may comprise the sequence shown as SEQ ID NO: 17.
The extracellular domain may comprise a domain which comprises a rituximab-binding epitope (R epitope) and/or a Qbend10 epitope (Q epitope). A rituximab-binding epitope refers to an epitope which specifically binds rituximab. For example, the rituximab-binding epitope may be based on the CD20 B-cell antigen.
The Rituximab-binding epitope sequence from CD20 is CEPANPSEKNSPSTQYC (SEQ ID NO: 18)
The CliniMACS CD34 selection system utilises the QBEnd10 monoclonal antibody to achieve cellular selection. The present inventors have previously mapped the QBEnd10-binding epitope from within the CD34 antigen (see WO 2013/153391) and determined it to have the amino acid sequence shown as SEQ ID NO: 19.
The binding domain of the cell surface protein may comprise a QBEnd10-binding epitope having the amino acid sequence shown as SEQ ID NO: 19 or a variant thereof which retains QBEnd10-binding activity.
The extracellular domain may comprise a binding domain which comprises or consists of 136 amino acid sequence shown as SEQ ID NO: 20.
The cell surface marker protein may comprise or consist of a sequence shown as SEQ ID NO: 21 to 26.
The cell surface protein may comprise a variant of an extracellular domain or a cell surface marker protein as described herein. For example, the extracellular domain may be based on a sequence provided herein but comprises one or more amino acid mutations, such as amino acid insertions, substitutions or deletions, provided that the epitope retains binding activity of the entity or epitope on which it is based. In particular, the sequence may be truncated at one or both terminal ends by, for example, one or two amino acids.
Deliberate amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues as long as binding activity of the epitope is retained. For example, negatively charged amino acids include aspartic acid and glutamic acid; positively charged amino acids include lysine and arginine; and amino acids with uncharged polar head groups having similar hydrophilicity values include leucine, isoleucine, valine, glycine, alanine, asparagine, glutamine, serine, threonine, phenylalanine, and tyrosine.
Conservative substitutions may be made, for example according to the Table below. Amino acids in the same block in the second column and in the same line in the third column may be substituted for each other.
The binding domain may, for example, contain 5 or fewer, 4 or fewer, 3 or fewer, 2 or fewer or 1 amino acid mutation(s) compared to a binding domain sequence as shown herein.
The extracellular domain may consist essentially of a domain sequence as shown herein or a variant thereof which retains the ability to bind an antibody which binds to the domain on which it is based. The extracellular domain may consist of a sequence as shown herein or a variant thereof which retains the ability to bind an antibody which binds to the domain on which it is based.
The cell surface protein may comprise the sequence shown as SEQ ID NO: 14 to 26 or a variant thereof having 80% identity thereto which retains the ability to be expressed at the cell surface and recognised by reagent, for example, an antibody which binds to one of SEQ ID NO: 14 to 26.
The cell surface protein may comprise the sequence shown as SEQ ID NO: 14 to 26 or a variant thereof having 85% identity thereto which retains the ability to be expressed at the cell surface and recognised by an antibody or a ligand which binds to one of SEQ ID NO: 14 to 26
The cell surface protein may comprise the sequence shown as SEQ ID NO: 14 to 26 or a variant thereof having 90% identity thereto which retains the ability to be expressed at the cell surface and recognised by an antibody or a ligand which binds to one of SEQ ID NO: 14 to 26
The cell surface protein may comprise the sequence shown as SEQ ID NO: 14 to 26 or a variant thereof having 95% identity thereto which retains the ability to be expressed at the cell surface and recognised by an antibody or a ligand which binds to one of SEQ ID NO: 14 to 26
The cell surface protein may comprise the sequence shown as SEQ ID NO: 14 to 26 or a variant thereof having 99% identity thereto which retains the ability to be expressed at the cell surface and recognised by an antibody or a ligand which binds to one of SEQ ID NO: 14 to 26
The cell surface protein may also comprise a signal sequence so that when the protein is expressed inside a cell the nascent protein is directed to the endoplasmic reticulum (ER).
The core of the signal peptide may contain a long stretch of hydrophobic amino acids that has a tendency to form a single alpha-helix. The signal peptide may begin with a short positively charged stretch of amino acids, which helps to enforce proper topology of the polypeptide during translocation. At the end of the signal peptide there is typically a stretch of amino acids that is recognized and cleaved by signal peptidase. Signal peptidase may cleave either during or after completion of translocation to generate a free signal peptide and a mature protein. The free signal peptides are then digested by specific proteases.
The signal peptide may be at the amino terminus of the molecule.
The signal peptide may comprise the SEQ ID NO: 1 or 2 or a variant thereof having 5, 4, 3, 2 or 1 amino acid mutations (insertions, substitutions or additions) provided that the signal peptide still functions to cause secretion of the bi-specific molecule.
The signal peptides of SEQ ID NO: 1 and 2 are compact and highly efficient. They are predicted to give about 95% cleavage after the terminal glycine, giving efficient removal by signal peptidase.
The cell surface marker protein as described herein may lack a signal sequence. For example, the cell surface marker shown as SEQ ID NO: 21-26 may not include the sequence annotated of the signal sequence. In particular, the cell surface marker protein will not include the signal sequence when the protein is present at the cell surface.
In one embodiment the retroviral protein and the detectable marker are transcribed as a nucleic acid which comprises a nucleic acid sequence which encodes the retroviral transfer vector or retroviral protein, a co-expression sequence and a nucleic acid sequence which encodes the detectable marker, wherein the co-expression sequence enables the retroviral transfer vector or retroviral protein and the detectable marker to be expressed as separate polypeptides.
The co-expression sequence may be a 2A self-cleaving peptide from an aphtho- or a cardiovirus, a 2A-like peptide, an internal ribosome entry site, a self-cleaving peptide, a furin cleavage site or a Tobacco Etch Virus cleavage site.
An internal ribosome entry site (IRES) is a nucleotide sequence that allows for translation initiation in the middle of a messenger RNA (mRNA).
The co-expression sequence may be a sequence encoding a cleavage site positioned between nucleic acid sequences which encode the retroviral protein and detectable marker, such that the retroviral protein and detectable marker can be expressed as separate entities.
The cleavage site may be any sequence which enables the polypeptide comprising the retroviral protein and detectable marker to become separated.
The term “cleavage” is used herein for convenience, but the cleavage site may cause the retroviral protein and detectable marker ‘to separate into individual entities by a mechanism other than classical cleavage. For example, for the Foot-and-Mouth disease virus (FMDV) 2A self-cleaving peptide (see below), various models have been proposed for to account for the “cleavage” activity: proteolysis by a host-cell proteinase, autoproteolysis or a translational effect (Donnelly et al (2001) J. Gen. Virol. 82:1027-1041). The exact mechanism of such “cleavage” is not important for the purposes of the present invention, as long as the cleavage site, when positioned between nucleic acid sequences which encode retroviral protein and detectable marker, causes the retroviral protein and detectable marker to be expressed as separate entities.
The cleavage site may be a furin cleavage site.
Furin is an enzyme which belongs to the subtilisin-like proprotein convertase family. The members of this family are proprotein convertases that process latent precursor proteins into their biologically active products. Furin is a calcium-dependent serine endoprotease that can efficiently cleave precursor proteins at their paired basic amino acid processing sites. Examples of furin substrates include proparathyroid hormone, transforming growth factor beta 1 precursor, proalbumin, pro-beta-secretase, membrane type-1 matrix metalloproteinase, beta subunit of pro-nerve growth factor and von Willebrand factor. Furin cleaves proteins just downstream of a basic amino acid target sequence (canonically, Arg-X-(Arg/Lys)-Arg′) (SEQ ID NO: 3) and is enriched in the Golgi apparatus.
The cleavage site may be a Tobacco Etch Virus (TEV) cleavage site.
TEV protease is a highly sequence-specific cysteine protease which is chymotrypsin-like proteases. It is very specific for its target cleavage site and is therefore frequently used for the controlled cleavage of fusion proteins both in vitro and in vivo. The consensus TEV cleavage site is ENLYFQ\S (SEQ ID NO: 4) (where ‘\’ denotes the cleaved peptide bond). Mammalian cells, such as human cells, do not express TEV protease. Thus in embodiments in which the present nucleic acid construct comprises a TEV cleavage site and is expressed in a mammalian cell—exogenous TEV protease must also expressed in the mammalian cell.
The cleavage site may encode a self-cleaving peptide.
A ‘self-cleaving peptide’ refers to a peptide which functions such that when the polypeptide comprising the retroviral protein and detectable marker and the self-cleaving peptide is produced, it is immediately “cleaved” or separated into distinct and a discrete retroviral protein and a detectable marker without the need for any external cleavage activity.
The self-cleaving peptide may be a 2A self-cleaving peptide from an aphtho- or a cardiovirus. The primary 2A/2B cleavage of the aptho- and cardioviruses is mediated by 2A “cleaving” at its own C-terminus. In apthoviruses, such as foot-and-mouth disease viruses (FMDV) and equine rhinitis A virus, the 2A region is a short section of about 18 amino acids, which, together with the N-terminal residue of protein 2B (a conserved proline residue) represents an autonomous element capable of mediating “cleavage” at its own C-terminus.
The C-terminal 19 amino acids of the longer cardiovirus protein, together with the N-terminal proline of 2B mediate “cleavage” with an efficiency approximately equal to the apthovirus FMDV 2a sequence. Cardioviruses include encephalomyocarditis virus (EMCV) and Theiler's murine encephalitis virus (TMEV).
Mutational analysis of EMCV and FM DV 2A has revealed that the motif DxExNPGP (SEQ ID NO: 5) is intimately involved in “cleavage” activity (Donelly et al (2001) as above).
The cleavage site of the present invention may comprise the amino acid sequence: Dx1Ex2NPGP, where x1 and x2 are any amino acid (SEQ ID NO: 5). Suitably, X1 may be selected from the following group: I, V, M and S. X2 may be selected from the following group: T, M, S, L, E, Q and F (as shown in SEQ ID NO: 8).
For example, the cleavage site may comprise one of the amino acid sequences shown in Table 1.
The cleavage site, based on a 2A sequence may be, for example 15-22 amino acids in length. The sequence may comprise the C-terminus of a 2A protein, followed by a proline residue (which corresponds to the N-terminal proline of 2B).
Mutational studies have also shown that, in addition to the naturally occurring 2A sequences, some variants are also active. The cleavage site may correspond to a variant sequence from a naturally occurring 2A polypeptide, have one, two or three amino acid substitutions, which retains the capacity to induce the “cleavage” of a polyprotein sequence into two or more separate proteins.
The cleavage sequence may be selected from the following which have all been shown to be active to a certain extent (Donnelly et al (2001) as above):
Based on the sequence of the DxExNPGP “a motif, “2A-like” sequences have been found in picornaviruses other than aptho- or cardioviruses, ‘picornavirus-like’ insect viruses, type C rotaviruses and repeated sequences within Trypanosoma spp and a bacterial sequence (Donnelly et al (2001) as above). The cleavage site may comprise one of these 2A-like sequences, such as:
The cleavage site may comprise the 2A-like sequence shown as SEQ ID NO: 55 (RAEGRGSLLTCGDVEENPGP).
It has been shown that including an N-terminal “extension” of between 5 and 39 amino acids can increase activity (Donnelly et al (2001) as above). In particular, the cleavage sequence may comprise one of the following sequences or a variant thereof having, for example, up to 5 amino acid changes which retains cleavage site activity:
Various transposon elements are well known in the art. Any suitable transposon may be used in the present invention.
DNA transposons are genetic elements that can mobilize from one location to another in the host genome. Their use in engineering mammalian cells was initially hampered by the lack of active transposons in mammals. Over a decade ago, the first active DNA transposon in mammals, Sleeping Beauty, was reconstructed from fossilized transposon sequences found in the salmonid genome. This discovery made transposon mediated genetic modification of mammalian cells possible. This method is much more efficient at generating cell lines with modified genomes than simple transfection.
Since the generation of the Sleeping Beauty transposon, a number of transposons from different families have been reported to show active transposition in mammalian cells (Skipper et al, J of Biomed Sci 2013, 20:92). Currently, along with Sleeping Beauty, the piggyBac (PB) transposon isolated from cabbage looper moth Trichoplusia ni is most promising because of a variety of unique characteristics, namely exhibiting the most efficient transposition in mammalian cells, the ability of the transposase to form functional protein fusions, large cargo capacity, and traceless excision. More importantly, PB has been found capable of mediating stable integration of up to 4 independent transposons simultaneously in human cells following a single transfection. A more active form of piggyback has been described (Yusa et al, PNAS, 2011, vol. 108, no.4).
In one embodiment of the invention, the transposon element or sequence comprises (or consists of) a piggyBac transposon. During transposition, the PB transposase recognizes transposon-specific inverted terminal repeat sequences (ITRs) located on both ends of the transposon vector and efficiently moves the contents from the original sites and integrates them into TTAA chromosomal sites.
Unique features of PiggyBac transposons are that there is relatively no cargo limit as it can carry a transgene of up to 14 kbp and it is also reversible: genomes containing an inserted PiggyBac vector can be transiently re-transfected with the PB tranposase expression vector and the transposase will remove the transposons from the genome, footprint-free.
In one embodiment, the nucleic acid construct may comprise a PiggyBAC 5′ terminal repeat and a PiggyBAC 3′ terminal repeat such that the construct can insert into the host genome. The PiggyBAC 5′ and 3′ terminal repeats may flank the sequence which is to be inserted into the host genome.
The PiggyBAC 5′ Terminal Repeat may comprise or consist of a variant which shares at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% sequence identity with SEQ ID NO: 60 and retains the ability to recruit an RdRp. The PiggyBAC 5′ Terminal Repeat may comprise or consist of SEQ ID NO: 60.
The PiggyBAC 5′ Terminal Repeat may comprise or consist of a truncated version of SEQ ID NO: 60 or truncated version of a variant of SEQ ID NO: 60 which retains ability to facilitate insertion into the host genome.
The PiggyBAC 3′ Terminal Repeat may comprise or consist of a variant which shares at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% sequence identity with SEQ ID NO: 61 and retains the ability to recruit an RdRp. The PiggyBAC 3′ Terminal Repeat may comprise or consist of SEQ ID NO: 61.
The PiggyBAC 3′ Terminal Repeat may comprise or consist of a truncated version of SEQ ID NO: 61 or truncated version of a variant of SEQ ID NO: 61 which retains ability to facilitate insertion into the host genome.
In one aspect the present invention provides a plasmid comprising a nucleic acid construct of the present invention.
The term plasmid covers any DNA transcription unit comprising a nucleic acid construct according to the invention and the elements necessary for its in vivo expression in a desired cell; and, in this regard, it is noted that a supercoiled or non-supercoiled, circular plasmid, as well as a linear form, are intended to be within the scope of the invention.
A plasmid may comprise the nucleic acid construct according to the invention, operably linked to a promoter or under the control of a promoter or dependent upon a promoter. In general, it is advantageous to employ a strong promoter functional in eukaryotic cells. The strong promoter may be, but not limited to, the immediate early cytomegalovirus promoter (CMV-IE) of human or murine origin, or optionally having another origin such as the rat or guinea pig.
In more general terms, the promoter has either a viral, or a cellular origin. A strong viral promoter other than CMV-IE that may be usefully employed in the practice of the invention is the early/late promoter of the SV40 virus or the LTR promoter of the Rous sarcoma virus. A strong cellular promoter that may be usefully employed in the practice of the invention is the promoter of a gene of the cytoskeleton, such as e.g. the desmin promoter (Kwissa et al., 2000), or the actin promoter (Miyazaki et al., 1989).
As to the polyadenylation signal (polyA) for the plasmids use can be made of the poly(A) signal of the bovine growth hormone (bGH) gene (see U.S. Pat. No. 5,122,458), or the poly(A) signal of the rabbit β-globin gene or the poly(A) signal of the SV40 virus.
The present invention further provides a kit comprising a plurality of nucleic acid constructs of the invention.
In one embodiment the kit may comprise:
The kit may further comprise:
The kit may also comprise:
In another aspect the present invention provides a kit comprising a plurality of plasmids according to the invention.
As used herein, the term “packaging cell” refers to a cell which contains those elements necessary for production of infectious recombinant virus which are lacking in the RNA genome. Such packaging cells are capable of expressing viral structural proteins (such as gag-pol and env, which may be codon optimised) but they do not contain a packaging signal. The term “packaging signal” which is referred to interchangeably as “packaging sequence” or “psi” is used in reference to the non-coding, cis-acting sequence required for encapsidation of retroviral RNA strands during viral particle formation. In HIV-1, this sequence has been mapped to loci extending from upstream of the major splice donor site (SD) to at least the gag start codon.
As used herein, the term “producer cell” refers to a cell which contains all the elements necessary for production of retroviral vector particles.
The producer cells/packaging cells of the present invention may be any suitable cell type. Producer cells are generally mammalian cells but can be, for example, insect cells.
By using producer/packaging cell lines, it is possible to propagate and isolate quantities of retroviral vector particles (e.g. to prepare suitable titres of the retroviral vector particles) for subsequent transduction of a site of interest.
The packaging cell lines are useful for providing the gene products necessary to encapsidate and provide a membrane protein for a high titre vector particle production. The packaging cell may be a cell cultured in vitro such as a tissue culture cell line. Suitable cell lines include but are not limited to mammalian cells such as murine fibroblast derived cell lines or human cell lines. The packaging cell line may be a human cell line, such as for example: HEK293, 293-T, TE671, HT1080.
There are two common procedures for generating producer cells. In one, the sequences encoding retroviral Gag, Pol and Env proteins are introduced into the cell and stably integrated into the cell genome; a stable cell line is produced which is referred to as the packaging cell line. As used herein, the term “stably integrated” means that the foreign genes become integrated into the cell's genome. The packaging cell line produces the proteins required for packaging retroviral RNA but it cannot bring about encapsidation due to the lack of a psi region.
However, when a vector genome (having a psi region) is introduced into the packaging cell line, the helper proteins can package the psi-positive recombinant vector RNA to produce the recombinant virus stock. This can be used to transduce recipient cells. The recombinant virus whose genome lacks all genes required to make viral proteins can infect only once and cannot propagate. Hence, a nucleic acid sequence can be introduced into a host cell genome without the generation of potentially harmful retrovirus.
The second approach is to introduce the three different DNA sequences that are required to produce a retroviral vector particle (i.e. the env coding sequences, the gag-pol coding sequence and the defective retroviral genome containing one or more NOIs) into the cell at the same time by transient transfection and the procedure is referred to as transient triple transfection. WO 94/29438 describes the production of producer cells in vitro using this multiple DNA transient transfection method. WO 97/27310 describes a set of DNA sequences for creating retroviral producer cells either in vivo or in vitro for re-implantation.
The components of the viral system which are required to complement the vector genome may be present on one or more “producer plasmids” for transfecting into cells.
The present invention provides a packaging cell comprising a nucleic acid construct according to the invention, and which cell expresses at least one detectable marker which is a cell surface protein comprising an extracellular domain and a membrane targeting domain.
In one embodiment, the packaging cell comprises:
The packaging cell may further comprise:
The invention further provides a producer cell which comprises:
The nucleic acid construct(s) may be stably integrated into the cell genome.
The packaging or producer cell may be a HEK293, HEK293-T, TE671 or HT1080, 3T3, or K562 cell.
In one aspect the present invention provides a method for making a packaging cell or producer cell of the invention which comprises the step of introducing one or more nucleic acid construct(s) of the invention, one or more plasmid(s) of the invention, a kit of nucleic acid constructs of the invention, or a kit of plasmids of the invention, into a cell.
The nucleic acid construct may be introduced into a cell by a variety of methods which are known in the art, for example using standard transfection methods such as electroporation or lipofection.
The nucleic acid constructs may be introduced into the cell during the same step of the method (e.g. transfected at the same time) or at different steps in the method (e.g. transfected at different times).
In another aspect the present invention provides a method for selecting a packaging or producer cell of the invention by selecting for expression of the or each detectable marker encoded by the or each nucleic acid construct.
The method may comprise the following steps:
Step (a) may further comprise introducing into the plurality of cells:
Step (a) may further comprise introducing into the plurality of cells:
Step (a) may also comprise introducing a nucleic acid sequence encoding piggyBAC transposase.
In another aspect, the present invention provides a method for selecting a packaging cell or producer cell according to the invention by selecting for expression of the or each detectable marker encoded by the or each nucleic acid construct.
In one embodiment, the screening is performed by fluorescence-activated cell sorting (FACS).
For example, by sorting vector, gagpol, env and rev expressing cells using flow cytometry, it is possible to monitor the expression levels of the different retroviral components and select cells with optimal ratios of expression of each component for more productive retroviral production.
The cell may express a plurality of detectable markers and may be selected using multi-parameter flow cytometry.
The method may comprise the following steps:
Step (a) may further comprise introducing into the plurality of cells:
Step (a) may further comprise introducing into the plurality of cells:
Step (a) may also comprise introducing a nucleic acid sequence encoding piggyBAC transposase.
In another aspect the invention provides a method for producing a retroviral vector which comprises the steps of culturing a producer cell of the present invention and isolating the retroviral vector.
The producer cell may be cultured using standard culture conditions suitable for the particular cell type used to make the producer cell. During culture, retrovirus vector particles are released into the culture supernatant. The retrovirus vector may be isolated using standard techniques, for example, ultracentrifugation, ultrafiltration or affinity purification.
The retroviral vector may be a lentiviral vector.
Definitions of terms appear throughout the specification. Before the exemplary embodiments are described in more detail, it is to be understood that this disclosure is not limited to particular embodiments described, and as such may, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting, since the scope of the present disclosure will be limited only by the appended claims.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. Singleton, et al., DICTIONARY OF MICROBIOLOGY AND MOLECULAR BIOLOGY, 20 ED., John Wley and Sons, New York (1994), and Hale & Marham, THE HARPER COLLINS DICTIONARY OF BIOLOGY, Harper Perennial, NY (1991) provide one of skill with a general dictionary of many of the terms used in this disclosure.
This disclosure is not limited by the exemplary methods and materials disclosed herein, and any methods and materials similar or equivalent to those described herein can be used in the practice or testing of embodiments of this disclosure. Numeric ranges are inclusive of the numbers defining the range.
Where a range of values is provided, it is understood that each intervening value, to the tenth of the unit of the lower limit unless the context clearly dictates otherwise, between the upper and lower limits of that range is also specifically disclosed. Each smaller range between any stated value or intervening value in a stated range and any other stated or intervening value in that stated range is encompassed within this disclosure. The upper and lower limits of these smaller ranges may independently be included or excluded in the range, and each range where either, neither or both limits are included in the smaller ranges is also encompassed within this disclosure, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in this disclosure.
It must be noted that as used herein and in the appended claims, the singular forms “a”, “an”, and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “an enzyme” includes a plurality of such candidate agents and equivalents thereof known to those skilled in the art, and so forth.
The terms “comprising”, “comprises” and “comprised of” as used herein are synonymous with “including”, “includes” or “containing”, “contains”, and are inclusive or open-ended and do not exclude additional, non-recited members, elements or method steps. The terms “comprising”, “comprises” and “comprised of” also include the term “consisting of”.
The invention will now be further described by way of Examples, which are meant to serve to assist one of ordinary skill in the art in carrying out the invention and are not intended in any way to limit the scope of the invention.
Possible compact surface marker genes are detailed in
293T cells were co-transfected with a piggyBAC transposase expression plasmid and a lentiviral rev expression plasmid which also co-expressed V5-8 marker gene. Some 293T cells were also transfected with just the rev expression plasmid alone as a control. These latter cells allow determination of the contribution to expression from transient transfection as opposed to permanent insertion. At 10 days, both populations of cells as well as non-transfected controls were stained with a fluorescent antibody which recognizes the V5 epitope tag. These cells were then analysed by flow-cytometery. A clear population of V5-8 expression cells were seen in the transposase co-transfected cells, while expression in the rev expression alone 293T cells was lower (
Different lentiviral elements require different means to optimally co-express a marker gene. Lentiviral gagpol was tagged in two different ways: HA-8 tag marker gene was co-expressed with a FMD-2A like peptide, or with an IRES sequence (
The helper cassettes must supply rev, RD-PRO and gagpol. Suggested optimal cassettes are shown in
An expression cassette was generated where lentiviral gagpol could be transposed (the cassette was flanked by piggyBAC terminal repeats—see
Stable transposed gagpol could result in vector production. Low expressing cells generated higher titer. This suggests that careful gating of cell populations possible during flow sorting may help to identify particularly productive cell populations or clones (
A lentiviral transfer vector was generated which was tagged with a surface-expressed epitope marker which was flanked by piggyBAC terminal repeats (see
Titration showed that viral vector was produced. Productivity did not correlate with high expression of marker gene (see
All publications mentioned in the above specification are herein incorporated by reference. Various modifications and variations of the described methods and system of the invention will be apparent to those skilled in the art without departing from the scope and spirit of the invention. Although the invention has been described in connection with specific preferred embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the described modes for carrying out the invention which are obvious to those skilled in the art are intended to be within the scope of the following claims.
Number | Date | Country | Kind |
---|---|---|---|
1603372.2 | Feb 2016 | GB | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/GB2017/050481 | 2/24/2017 | WO | 00 |