The present invention is based on the discovery that large parts of the genome of nucleopolyhedrovirus (NPV)-alpha baculovirus clade Ia viruses can be deleted without deleterious effect on the usability of the virus comprising such genome in the infection of cells in cell culture. Accordingly, the present invention provides NPV-alpha baculovirus clade Ia genomes which are reduced in size in comparison to the respective native NPV-alpha baculovirus clade Ia genomes, such genomes comprising heterologous nucleotides, viruses comprising either of these genomes, cells infected with such viruses and methods for producing such viruses and cells.
The understanding of the cellular machinery has increased tremendously in recent years mainly due to astounding progress in ‘omics’ research (genomics, proteomics and glycomics) (Nie Y, et al (2009) Curr. Genomics 10:558-72). Comprehensive genomics datasets are now available for many organisms including human, and focus has now shifted to elucidating the cellular proteome in correlation with the live cellular functionality and morphology. One essential lesson learned is that proteins in eukaryotic cells typically do not work in isolation but coexist in large and highly diverse assemblies of ten or more interlocking subunits. These stably or transiently associated multiprotein assemblies additionally work together with separate proteins or multiprotein assemblies to carry out essential cellular processes including signaling, energy generation, and transport of food, water or waste.
A considerable number of accessory proteins typically accompany any individual multiprotein complex at various stages of its production, trafficking, active life and degradation. For example, chaperones are often critical for proper assembly of complexes, while other proteins are required for proper targeting and activation through post-translational modification. The activity of complexes is often fine-tuned by the incorporation of isoforms of individual subunits, for example to mediate tissue-specific functions. To fully understand biology, it is clear that new methods are needed to unlock the assembly, structure and mechanism of all of the complexes that exist in our cells. This is not only essential for basic research, but equally important for enabling novel approaches in the pharmaceutical and biotech industries to drive development of new and better drugs that more specifically modulate cellular functions. An imposing bottleneck that obstructs progress in these areas stems from the typically low abundance and high heterogeneity of protein complexes in their native cells. Apart from a handful of notable examples, most human multiprotein complexes remain virtually inaccessible to date.
Recombinant overexpression can provide a solution to this problem. However, until recently, the production challenge for eukaryotic (especially human) multiprotein complexes has not been systematically addressed. The provision of human multiprotein complexes in the quality and quantity required for mechanistic studies and drug design poses particular challenges due to the complexity of the machinery at work in our cells. Technical factors for heterologous protein production including protein yield, stoichiometric ratio between subunits, post-translational modifications, folding, and stability are all of critical importance, and ideally a highly flexible heterologous expression system should be available that can provide these functions for a wide range of protein complexes. An attractive solution could be mammalian expression systems, which naturally provide the required functions to accurately reflect what takes place in our cells, and heterologous expression in mammalian systems has become increasingly popular, especially for secreted proteins such as therapeutic antibodies (Nettleship J E, et al. (2010) J Struct. Biol. 172:55-65). However, mammalian systems often do not provide acceptable yields for intracellular proteins, and multiprotein expression technologies for mammalian cells are still in their infancy, albeit progress has been made recently, opening interesting options to depict with hitherto unattainable precision entire pathways in mammalian cells, for example for pharmacological screening studies (Kriz A, et al. (2010) Nat. Commun. 2010; 1:120).
An attractive alternative to mammalian systems is heterologous expression using recombinant baculoviruses to infect insect cell cultures. This method was pioneered three decades ago (Summers M D (2006) Adv. Virus Res. 68:3-73) and has become a method of choice for producing high levels of many eukaryotic proteins including a large number of proteins of pharmaceutical interest (Kost T A, et al. (2005) Nat. Biotechnol. 23:567-75 and Jarvis D L (2009) Methods Enzymol. 463:191-222). A significant advance over existing baculovirus expression vector systems (BEVS) came with the introduction of MultiBac, an advanced BEVS particularly tailored for producing eukaryotic multiprotein complexes for structural and functional studies (Bieniossek C et al. (2012) Trends Biochem. Sci. 37:49-57; Bieniossek C, et al. (2009) Nat Methods 6:447-50; Bieniossek C, et al. (2008) Curr. Protoc. Protein Sci. Chapter 5: Unit 5.20 and Fitzgerald D J, et al. (2006) Nat. Methods 3:1021-32). MultiBac consists of a baculovirus genome that has been engineered for optimized protein production by deleting protease and apoptosis activities (Barger I, et al. (2004) Nat. Biotechnol. 22:1583-7). In a subsequent improvement of the system, a new suite of transfer vectors was introduced to facilitate introduction of many heterologous genes into one recombinant MultiBac baculovirus by a method called Tandem Recombineering (TR), involving sequence-and-ligation-independent cloning (SLIC) and Cre-LoxP recombination (Trowitzsch S, et al. (2010) J. Struct. Biol. 172:45-5414; Vijayachandran L S, et al. (2011) J. Struct. Biol. 2011; 175:198-208). More recently, the design of transfer plasmids has been further refined, resulting in small, easy to handle plasmids containing only the functional DNA elements required for protein expression, expression cassette multiplication and plasmid concatenation by TR (Vijayachandran L S, et al. (2011) J. Struct. Biol. 175:198-208). Multigene transfer vectors created in this way are introduced into the MultiBac baculovirus genome by the Tn7 transposon, in E. coli strains modified for this purpose (Trowitzsch S, et al. (2010) supra).
As a step forward relative to previous systems, the original MultiBac system already provided the option to integrate accessory functionalities that may be required for proper functioning of a multiprotein complex, by means of a second entry site engineered into the virus genome that is independent of, and distal to the main site of integration that relies on the Tn7 transposition. This feature has been exploited to integrate additional functional modules into the viral genome, including post translational modification enzymes, and fluorescent proteins that allow easy monitoring of virus performance and protein production following transfection and during virus amplification (Vijayachandran L S, et al. (2011) and Fitzgerald D J, et al. (2007) Structure 15:275-9). More recently, this approach has been used to create SweetBac, allowing for the production of mammalian-like glycoproteins in insect cells (Palmberger D et al. (2012) PloS One 7:e34226 and Palmberger D et al. (2012) Bioengineered 2012; 4). MultiBac is now in use at more than 600 laboratories world-wide, in academia and industry, and a broad range of multiprotein complexes have been produced in high quality and quantity for diverse applications by using the MultiBac system (Trowitzsch S et al. (2012) supra; Nettleship J E et al. (2010) supra; Kriz A et al. (2010) supra; Summers M D (2005) supra; Jarvis D L (2009) Methods Enzymol. 463:191-222 and Bieniossek C (2012) Trends Biochem. Sci. 37:49-57).
Currently, two approaches for integrating heterologous expression cassettes into the baculovirus genome dominate the field. One of these approaches requires the presence of the baculoviral genome as a bacterial artificial chromosome (BAC) in E. coli cells, together with Tn7 transposase activity present in these same cells which recombine transformed transfer plasmids into a Tn7 attachment site on the BAC. Invitrogen's Bac-to-Bac system and also the more advanced MultiBac system both utilize this approach. The recombinant, composite baculovirus DNA is then purified from these E. coli cells by alkaline lysis, and used to transfect insect cells. In contrast, the original method of choice to integrate heterologous expression cassettes into the baculovirus genome relied on homologous recombination mediated by regions in the transfer plasmid that were homologous to two genes on the baculovirus genome (Orf1629 and lef2/603) that flank the baculoviral polh locus which had been inactivated. This method is still offered by a large number of commercial providers (Novagen BacVector series, Pharmingen BaculoGold, Abvector, others). By this method, homologous recombination occurs in insect cells following transfection the baculovirus genomic DNA together with the transfer vector. The efficiency of recombination is increased by linearization of the baculovirus genome, but still remains a less efficient method to rapidly generate recombinant baculovirus than transforming Tn7-produced composite BACs. A further improvement on the homologous recombination in insect cell method came by truncation of the essential Orf1629 gene on the baculovirus genome which is then repaired by co-transfecting complete Orf1629-containing transfer vectors (FlashBac system, Oxford Expresion Technologies, UK).
Currently, BEVS applications including MultiBac rely on a large baculovirus genome (130 kb) derived from wild-type Autographa californica multicapsid nuclear polyhedrosis virus (AcMNPV). This genome has been intensively researched for many years. Genes that are essential for propagation in cell culture and genes which are detrimental for foreign protein production were delineated by several research groups (Harrison R L t al. (2003) J. Gen. Virol. 2003; 84:1827-42; Pijlman G P et al. (2001) Virology 283:132-8; Pijlman G P et al. (2002) J. Virol. 76:5605-11; Pijlman G P t al. (2003) J. Gen. Virol. 84:2041-9; Pijlman G P et al. (2006), J. Biotechnol. 123:13-21; Pijlman G P at al. (2003) J. Gen. Virol. 2003; 84:2669-78 and Pijlman G P t al. (2003) J. Invertebr. Pathol. 84:214-9).
The inherent DNA instability of the currently used baculovirus genome poses a problem, in particular at expression scales relevant for pharmaceutical production. Simply speaking, as the virus replicates during expression scale up, it progressively suffers from deletion of bits and pieces of its genome, preferentially in the highly expressed, (non-essential) heterologous protein expression cassette, as was shown already for laboratory scale production (Fitzgerald D J (2006) Nat. Methods 3:1021-32; Pijlman G P (2001) Virology 283:132-8; Pijlman G P et al. (2002), J. Virol. 76:5605-11). This is exacerbated for the viruses of the BAC/Tn7 type by the fact that the insertion site which is targeted by the Tn7 transposon is actually a mutational hotspot (Carstens E B t al. (1987) J. Gen. Virol. 68:901-5; Roelvink P W et al. (1992) J. Gee. Virol. 73 (Pt 6):1481-9).
Accordingly, there is a need in the art to provide expression systems which allow efficient protein expression, accommodate large heterologous nucleotide inserts, e.g. to express multiple proteins and which are less prone to rearrangement of the genome, i.e. with improved genomic stability.
In a first aspect the present invention relates to a nucleopolyhedrovirus (NPV)-alpha baculovirus clade Ia genome, wherein the number of base pairs is reduced in comparison to a native NPV-alpha baculovirus clade Ia genome by more than 18%, preferably an Autographa californica multicapsid nucleopolyhedrovirus (AcMNPV) genome, wherein the number of base pairs of the genome is reduced in comparison to a native BmNPV genome by at least at least 25.7% or a Bombyx mori nucleopolyhedrovirus (BmNPV) genome, wherein the number of base pairs of the genome is reduced in comparison to a native BmNPV genome by at least 18.31% and which in each case assembles into an infectious baculovirus.
In a second aspect the present invention relates to a NPV alpha baculovirus clade Ia genome according to the first aspect further comprising a nucleotide sequence heterologous to the NPV alpha baculovirus clade Ia genome.
In a third aspect the present invention relates to an infectious NPV alpha baculovirus clade Ia virus comprising a genome according to the first or second asp t of the invention.
In a fourth aspect the present invention relates to a cell infected with a virus according to the third aspect of the invention.
In a fifth aspect the present invention relates to a method for producing an NPV alpha baculovirus clade Ia genome according to the first or second aspect of the invention comprising the step of chemically synthesizing all or part of the genome.
In a sixth aspect the present invention relates to a method for producing an NPV alpha baculovirus clade Ia virus by introducing a genome according to the first or second aspect of the invention or producible according to the method of the fifth aspect into a cell.
Before the present invention is described in detail below, it is to be understood that this invention is not limited to the particular methodology, protocols and reagents described herein as these may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to limit the scope of the present invention which will be limited only by the appended claims. Unless defined otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by one of ordinary skill in the art.
In the following, the elements of the present invention will be described. These elements are listed with specific embodiments, however, it should be understood that they may be combined in any manner and in any number to create additional embodiments. The variously described examples and preferred embodiments should not be construed to limit the present invention to only the explicitly described embodiments. This description should be understood to support and encompass embodiments which combine the explicitly described embodiments with any number of the disclosed and/or preferred elements. Furthermore, any permutations and combinations of all described elements in this application should be considered disclosed by the description of the present application unless the context indicates otherwise.
Several documents are cited throughout the text of this specification. Each of the documents cited herein (including all patents, patent applications, scientific publications, manufacturer's specifications, instructions, etc.), whether supra or infra, are hereby incorporated by reference in their entirety. Nothing herein is to be construed as an admission that the invention is not entitled to antedate such disclosure by virtue of prior invention.
To practice the present invention, unless otherwise indicated, conventional methods of chemistry, biochemistry, and recombinant DNA techniques are employed which are explained in the literature in the field (cf., e.g., Molecular Cloning: A Laboratory Manual, 2th Edition, J. Sambrook et al. eds., Cold Spring Harbor Laboratory Press, Cold Spring Harbor 1989).
Throughout this specification and the claims which follow, unless the context requires otherwise, the word “comprise”, and variations such as “comprises” and “comprising”, will be understood to imply the inclusion of a stated integer or step or group of integers or steps but not the exclusion of any other integer or step or group of integers or steps. As used in this specification and the appended claims, the singular forms “a”, “an”, and “the” include plural referents, unless the content clearly dictates otherwise.
The terms “polynucleotide” and “nucleic acid” are used interchangeably herein and are understood as a polymeric or oligomeric macromolecule made from nucleotide monomers.
Nucleotide monomers are composed of a nucleobase, a five-carbon sugar (such as but not limited to ribose or 2′-deoxyribose), and one to three phosphate groups. Typically, a polynucleotide is formed through phosphodiester bonds between the individual nucleotide monomers. In the context of the present invention referred to nucleic acid molecules include but are not limited to ribonucleic acid (RNA), deoxyribonucleic acid (DNA), and mixtures thereof such as e.g. RNA-DNA hybrids. The nucleic acids, can e.g. be synthesized chemically, e.g. in accordance with the phosphotriester method (see, for example, Uhlmann, E. & Peyman, A. (1990) Chemical Reviews, 90, 543-584). “Aptamers” are nucleic acids which bind with high affinity to a polypeptide. Aptamers can be isolated by selection methods such as SELEmir146-a (see e.g. Jayasena (1999) Clin. Chem., 45, 1628-50; Klug and Famulok (1994) M. Mol. Biol. Rep., 20, 97-107; U.S. Pat. No. 5,582,981) from a large pool of different single-stranded RNA molecules. Aptamers can also be synthesized and selected in their mirror-image form, for example as the L-ribonucleotide (Nolte et al. (1996) Nat. Biotechnol., 14, 1116-9; Klussmann et al. (1996) Nat. Biotechnol., 14, 1112-5). Forms which have been isolated in this way enjoy the advantage that they are not degraded by naturally occurring ribonucleases and, therefore, possess greater stability.
The terms “protein” and “polypeptide” are used interchangeably herein and refer to any peptide-bond-linked chain of amino acids, regardless of length or post-translational modification. Proteins usable in the present invention (including protein derivatives, protein variants, protein fragments, protein segments, protein epitops and protein domains) can be further modified by chemical modification. This means such a chemically modified polypeptide comprises other chemical groups than the 20 naturally occurring amino acids. Examples of such other chemical groups include without limitation glycosylated amino acids and phosphorylated amino acids. Chemical modifications of a polypeptide may provide advantageous properties as compared to the parent polypeptide, e.g. one or more of enhanced stability, increased biological half-life, or increased water solubility.
The term “sequence identity” is used throughout the specification with regard to polypeptide and polynucleotide sequence comparisons. In case where two sequences are compared and the reference sequence is not specified in comparison to which the sequence identity percentage is to be calculated, the sequence identity is to be calculated with reference to the longer of the two sequences to be compared, if not specifically indicated otherwise. If the reference sequence is indicated, the sequence identity is determined on the basis of the full length of the reference sequence indicated by SEQ ID, if not specifically indicated otherwise. For example, a polypeptide sequence consisting of 200 amino acids compared to a reference 300 amino acid long polypeptide sequence may exhibit a maximum percentage of sequence identity of 66.6% (200/300) while a sequence with a length of 150 amino acids may exhibit a maximum percentage of sequence identity of 50% (150/300). If 15 out of those 150 amino acids are different from the respective amino acids of the 300 amino acid long reference sequence, the level of sequence identity decreases to 45%. The similarity of nucleotide and amino acid sequences, i.e. the percentage of sequence identity, can be determined via sequence alignments. Such alignments can be carried out with several art-known algorithms, preferably with the mathematical algorithm of Karlin and Altschul (Karlin & Altschul (1993) Proc. Natl. Acad. Sci. USA 90: 5873-5877), with hmmalign (HMMER package, http://hmmer.wustl.edu/) or with the CLUSTAL algorithm (Thompson, J. D., Higgins, D. G. & Gibson, T. J. (1994) Nucleic Acids Res. 22, 4673-80) available e.g. on http://www.ebi.ac.uk/Tools/clustalw/or on httpJ/www.ebi.ac.uk/Tools/clustalw2/index.html or on http://npsa-pbil.ibcp.fr/cgi-bin/npsa_automat.pl?page=/NPSA/npsa_clustalw.htmi. Preferred parameters used are the default parameters as they are set on http://www.cbi.ac.uktTools/clustalw/or http://www.ebi.ac.ukfTools/clustalw2/index.html. The grade of sequence identity (sequence matching) may be calculated using e.g. BLAST, BLAT or BlastZ (or BlastX). A similar algorithm is incorporated into the BLASTN and BLASTP programs of Altschul et al. (1990) J. Mol. Biol. 215: 403-410. BLAST polynucleotide searches are performed with the BLASTN program, score=100, word length=12. BLAST protein searches are performed with the BLASTP program, score=50, word length=3. To obtain gapped alignments for comparative purposes, Gapped BLAST is utilized as described in Altschul et al. (1997) Nucleic Acids Res. 25: 3389-3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs are used. Sequence matching analysis may be supplemented by established homology mapping techniques like Shuffle-LAGAN (Brudno M., Bioinformatics 2003b, 19 Suppl 1:154-162) or Markov random fields. When percentages of sequence identity are referred to in the present application, these percentages are calculated in relation to the full length of the longer sequence, if not specifically indicated otherwise. “Hybridization” can also be used as a measure of sequence identity or homology between two nucleic acid sequences. A nucleic acid sequence encoding F, N, or M2-1, or a portion of any of these can be used as a hybridization probe according to standard hybridization techniques. Hybridization conditions are known to those skilled in the art and can be found, for example, in Current Protocols in Molecular Biology, John Wiley & Sons, N. Y., 6.3.1-6.3.6, 1991. “Moderate hybridization conditions” are defined as equivalent to hybridization in 2× sodium chloride/sodium citrate (SSC) at 30° C., followed by a wash in 1×SSC, 0.1% SDS at 50° C. “Highly stringent conditions” are defined as equivalent to hybridization in 6× sodium chloride/sodium citrate (SSC) at 45° C., followed by a wash in 0.2×SSC, 0.1% SDS at 65° C.
As used herein, the terms “resistance gene” refers to a gene conferring resistance to a toxin and/or an antibiotic”. Accordingly, such a gene may also be referred to as “toxin-resistance gene” or “antibiotica-resistance gene”. The functional inactivation of a toxin or antibiotic may be achieved by expressing a marker gene which carries mutation(s) rendering the respective gene product insensitive to a toxin or antibiotic. Alternatively, the functional inactivation of a toxin or antibiotic may be achieved by expressing a marker gene which inhibits the toxin or antibiotic e.g. by interacting or binding to it. The functional inactivation of a toxin or antibiotic may also be achieved by expressing a marker gene which counteracts the effects of the toxin or antibiotic.
Antibiotic compounds include but are not limited to tetracyclines, sulfonamides, penicillins, cephalosporins, ansamycins, carbapenems, macrolides, quinolones, aminonucleoside, aminoglycosides, peptides, glycopeptides, and lipopeptides. Exemplified, hygromycin B, neomycin, kanamycin, gentamicin, and G418 (also known as Geneticin) are aminoglycoside antibiotics which are similar in structure. In general, neomycin and kanamycin are used for prokaryotes, whilst G418 is needed for eukaryotes. Kanamycin is isolated from Streptomyces kanamyceticus and interacts with the 30S subunit of prokaryotic ribosomes thereby inducing mistranslation and indirectly inhibiting translocation during protein synthesis. Neomycin is produced naturally by the bacterium Streptomyces fradiae whilst G418 is produced by Micromonospora rhodoranges. Neomycin blocks protein biosynthesis by binding to the 30S subunit of the 70S-ribosome. G418 blocks polypeptide synthesis by binding to the 80S-ribosome and thereby inhibiting the elongation step in both prokaryotic and cukaryotic cells. Resistance to neomycin and G418 is conferred by the Neor gene from transposon Tn5 encoding an aminoglycoside 3′-phosphotransferase, APH 3′ II which phosphorylates neomycin or geneticin on a hydroxygroup and thereby, inhibits its function. Hygromycin B is produced by the bacterium Streptomyces hygroscopicus and kills bacteria, fungi and higher eukaryotic cells by inhibiting protein synthesis. The hygromycin resistance gene Hph encodes the hygromycin B phosphotransferase which inactivates hygromycin B through phosphorylation. Blasticidin is an antibiotic that is produced by Streptomyces griseochromogenes and prevents the growth of both eukaryotic and prokaryotic cells by inhibiting peptide bond formation by the ribosome. The three genes bis from Streptoverticillum sp., bsr from Bacillus cereus, and BSD from Aspergillus terreus, confer resistance to blasticidin by enabling the cells continue protein production even in the presence of blasticidin. Puromycin is an aminonucleoside antibiotic, derived from the Streptomyces alboniger that causes premature chain termination during translation taking place in the ribosome. The expression of the report gene Puror encodes a puromycin N-acetyl-transferase which conveys resistance to the antibiotic puromycin. In the context of the present invention genes conveying antibiotic-resistance are particularly suitable, including but not limited to genes conveying resistance to neomycin, puromycin, blasticidin and hygromycin.
To provide improved expression systems the present inventors decided to rewire the entire baculovirus genome to maximize its performance. The aim was the redesign and restructuring of the baculovirus genome to provide among other advantages enhanced DNA stability and efficient protein production. The genes and DNA elements which are dispensable under laboratory culture conditions and unnecessary for efficient budded viral production, which is the major virus type used for protein expression in cell culture (Bieniossek C et al. (2008) supra and Fitzgerald D J et al (2006) Nat. Methods 3:1021-32) have been determined by the present inventors. This allowed engineering an improved baculovirus genome by removing non-essential genes and regions prone to mutation. Such engineered viruses provide among other advantages improved virus DNA stability, increased ability of accommodating very large foreign gene insertions, without compromising the ease of handling and superior protein production properties.
The present inventors surprisingly found that the size of the genome of a NPV-alpha baculovirus clade Ia genome can be significantly reduced without affecting its ability to infect and propagate in cells. Viruses comprising such an optimized genome provide enhanced DNA stability and more efficient protein production. Thus, in a first aspect this invention provides a genome of a nucleopolyhedrovirus (NPV)-alpha baculovirus clade Ia virus, wherein the number of base pairs is reduced in comparison to a native NPV-alpha baculovirus clade Ia genome by more than 18% and which assembles into an infectious baculovirus, e.g. under the conditions outlined below, preferably a genome of an Autographa californica multicapsid nucleopolyhedrovirus (AcMNPV), wherein the number of base pairs of the genome is reduced in comparison to a native AcMNPV genome by at least 25.7% or a Bombyx mori nucleopolyhedrovirus (BmNPV) genome, wherein the number of base pairs of the genome is reduced in comparison to a native BmNPV genome by at least 18.31% and which in each case assembles into an infectious baculovirus.
Preferably, the genome assembles into an infectious baculovirus capable of expressing heterologous proteins, once it infects a permissible cell.
The genomes of baculoviruses belonging to the group of NPV alpha baculovirus clade Ia are highly conserved. Thus, in the following the coding segments (CDS), 5′ untranslated regions (UTRs), spacers and/or repeat sequences (br) to be deleted or maintained are always indicated with reference to the genome of AcMNPV or BmNPV. If nucleotide positions are indicated these are either with reference to the genome of AcMNPV according to SEQ ID NO: 1 or of BmNPV according to SEQ ID NO: 4. Based on this information the skilled person can determine without undue burden the corresponding CDS, UTR, spacer or hr sequence of another NPV alpha baculovirus clade Ia virus by using standard alignment tools as set out above. The result of such an approach is exemplary described for the ptp gene encoding a protein tyrosine phosphatase of AcMNPV (UniProt P24656, SEQ ID NO: 13). When the amino acid sequence of SEQ ID NO: 13 is used in a PBLAST search of non-redundant protein sequences the homologs of the protein tyrosine phosphatase of AcMNPV from other NPV alpha baculovirus clade Ia viruses are identified, e.g. a homolog from Rachiplusia ou multiple nucleopolyhedrovirus (RoMNPV) (Accession NO: NP—702993; SEQ ID NO: 14), a homolog from BmNPV (Accession No: AAG31657; SEQ ID NO: 15), a homolog from Maruca vitrata multiple nuclopolyhcdrovirus (MaviMNPV) (Accession No.: YP—950853; SEQ ID NO: 16), a homolog from Choristoneura fumiferana defective nucleopolyhedroviru (CfDefMNPV) (Accession No.: NP—932617; SEQ ID NO: 17), a homolog of Anticarsia gemmatalis multiple nucleopolyhedrovirus (AgMNPV) (Accession No.: YP—803403; SEQ ID NO: 18), a homolog from Epiphyas postvittana nucleopolyhedrovirus (EppoNPV) (Accession No: NP—203176; SEQ ID NO: 19), a homolog from Antheraea pernyi nucleopolyhedrovirus (AnpeNPV) (Accession No.: YP—611104; SEQ ID NO: 20), a homolog from Choristoneura Fumiferana Multinucleocapsid nucleopolyhedrovirus (CfMNPV) (Accession No.: NP—848321; SEQ ID NO: 21), a homolog from Orgyia Pseudotsugata Multicapsid nucleopolyhedrovirus (OpMNPV) (Accession No.: NP—046166; SEQ ID NO: 22), and a bomolog from Hyphantria cunea nucleopolyhedrovirus (HycuNPV) (Accession No: YP—473330; SEQ ID NO: 23). The alignment of these sequences using CLUSTALW2 is shown in
Preferably the size of the NPV alpha baculovirus clade Ia genome is reduced by at least 20%, more preferably by at least 22%, at least 24%, at least 26%, at least 28% at least 30% at least 32%, at least 34%, at least 36% at least 38%, or at least 40% % by following the teaching in this application, which genes to maintain and to delete with respect to the native genome of the respective virus to arrive at the genome with a reduced size of the present invention.
Preferably the size of genome of AcMNPV is reduced by at least 26%, at least 28% at least 30% at least 32%, at least 34%, at least 36% at least 38%, at least 40%, Preferably the size of genome of BmNPV is reduced by at least 20%, more preferably by at least 22%, at least 24%, at least 26%, at least 28% at least 30% at least 32%, at least 34%, at least 36% at least 38%, or at least 40%.
Preferably the size of genome of RoMNPV is reduced by at least 20%, more preferably by at least 22%, at least 24%, at least 26%, at least 28% at least 30% at least 32%, at least 34%, at least 36% at least 38%, or at least 40%.
Preferably the size of genome of MaviMNPV is reduced by at least 20&, more preferably by at least 22%, at least 24%, at least 26%, at least 28% at least 30% at least 32%, at least 34%, at least 36% at least 38%, or at least 40%.
Preferably the size of genome of CfDefMNPV is reduced by at least 20%, more preferably by at least 22%, at least 24%, at least 26%, at least 28% at least 30% at least 32%, at least 34%, at least 36% at least 38%, or at least 40%.
Preferably the size of genome of AgMNPV is reduced by at least 20%, more preferably by at least 22%, at least 24%, at least 26%, at least 28% at least 30% at least 32%, at least 34%, at least 36% at least 38%, or at least 40%.
Preferably the size of genome of EppoNPV is reduced by at least 20%, more preferably by at least 22%, at least 24%, at least 26%, at least 28% at least 30% at least 32%, at least 34%, at least 36% at least 38%, or at least 40%.
Preferably the size of genome of AnpeNPV is reduced by at least 20%, more preferably by at least 22%, at least 24%, at least 26%, at least 28% at least 30% at least 32%, at least 34%, at least 36% at least 38%, or at least 40%.
Preferably the size of genome of CfMNPV is reduced by at least 20%, more preferably by at least 22%, at least 24%, at least 26%, at least 28% at least 30% at least 32%, at least 34%, at least 36% at least 38%, or at least 40%.
Preferably the size of genome of OpMNPV is reduced by at least 20% o, more preferably by at least 22%, at least 24%, at least 26%, at least 28%, at least 30% at least 32%, at least 34%, at least 36% at least 38%, or at least 40%.
Preferably the size of genome of HycuNPV is reduced by at least 20%, more preferably by at least 22%, at least 24%, at least 26%, at least 28% at least 30% at least 32%, at least 34%, at least 36% at least 38%, or at least 40%.
Preferably the size of genome of Plutella xylostella nucleopolyhedrovirus (PlxyNPV) is reduced by at least 20%, more preferably by at least 22%, at least 24%, at least 26%, at least 28% at least 30%, at least 32%, at least 34%, at least 36% at least 38%, or at least 40%.
In above and below part of the description the reduction in size is always expressed as a percentage of the reduction in comparison to a native baculovirus genome. Alternatively the reduction in size may also be indicated in absolute values, i.e. reduction in size by X base pairs. The length of the CDS, 5′-UTR, spacer and hr segments, respectively, in base pairs are indicated in detail below in Tables 1 to 12 for AcMNPV and BmNPV (see column labeled “bp” in each Table). The sum of the bp of the respectively indicated elements will determine the number of base pairs that the baculovirus genome is shortened with respect to the native baculovirus genome, preferably those according to SEQ ID NO: 1 or 15. Using the teaching on how to identify homologous regions in other Clade Ia viruses by sequence alignment the skilled person can also determine the length of the homologous segments in other Clade Ia viruses and add the number of base pairs of the homologous sequences to be deleted to arrive at the value for absolute reduction of the length of the native genome of that particular Clade Ia baculovirus.
The present inventors have found that the CDS/proteins indicated below can be deleted without detrimeantal effect on the respective NPV alpha baculovirus clade Ia virus (all CDS and proteins are with reference to the genome of AcMNPV and BmNPV, respectively).
The present inventors found that the CDS indicated below in Tables 1a can be deleted without detrimental effect on AcMNPV. Further, the CDS indicated below in Tables 1b can be deleted without detrimental effect on BmNPV. These CDS are considered to belong to a group that is referred to as Type I of the respective NPV alpha baculovirus clade Ia virus (all CDS and proteins are with reference to AcMNPV). Accordingly, the deletion of these CDS (and preferably also the respective 5′-UTR, spacer and/or hr regions) are referred to as Type I deletions:
The total length of the CDS encoding these portions is approximately 34,362 base pairs in AcMNPV according to SEQ ID NO: 1. Thus, the native AcMNPV genome according to SEQ ID NO: 1 of a length of 133,894 base pairs is preferably shortened by deletion of the CDS encoding these proteins and, thus, the AcMNPV genome of the invention is preferably at least 34,363 base pairs shorter than the native AcMNPV genome. The total length of the CDS encoding these proteins is approximately 24,531 base pairs in BmNPV. Thus, the native BmNPV genome having a length of 128,413 base pairs is preferably shortened by deletion of the CDS encoding these proteins and, thus, the BmNPV genome of the invention is preferably at least 25,121 base pairs shorter than the native BmNPV genome.
Accordingly, the AcMNPV genome of the invention is at least 25.7% shorter than the native AcMNPV genome. Accordingly, the BmNPV genome of the invention is at least 18.31% shorter than the native BmNPV genome.
For MvMNPV, the genome of the invention is preferably at least 21,688 bases pairs shorter than the native MvMNPV genome. For BmaMNPV, the genome of the invention is preferably at least 20,235 bases pairs shorter than the native BmaMNPV genome. For BmNPV, the genome of the invention is preferably at least 24,531 bases pairs shorter than the native BmNPV genome. For PlxyMNPV, the genome of the invention is preferably at least 26,724 bases pairs shorter than the native PlxyMNPV genome. For RoMNPV, the genome of the invention is preferably at least 26,379 bases pairs shorter than the native RoMNPV genome.
It is more preferred that one or more of the 5′-UTRs of these CDS are also deleted. In AcMNPV the 5′-UTRs of these CDS have a length of 2,113 base pairs. In BmNPV the 5′-UTRs of these CDS have a length of 1,614 base pairs. The following Tables 2a and 2b indicate the position of the 5′UTRs of the CDS deleted in preferred genomes of the invention. The 5′-UTR precedes the respectively indicated CDS Start codon by the indicated number of base pairs:
The deletion of one or more of these 5′-UTRs from the genome of a NPV alpha baculovirus clade Ia virus leads to a further reduction of the size of the genome in comparison to the native genome of up to 1.58%. Accordingly, in an even more preferred embodiment the size of the genome of the AcMNPV virus of the invention is reduced by at least 27.24%. Accordingly, in an even more preferred embodiment the size of the genome of the BmNPV virus of the invention is reduced by at least 19.56%.
Alternatively or additionally the spacers of the AcMNPV CDS may be deleted, which amount to an additional deletion of 3,000 bp. In BmNPV the spacers of these CDS have a length of 3,047 base pairs. With reference to the nucleotide sequence of AcMNPV the spacers that may be deleted are the following:
The deletion of these spacers from the genome of a NPV alpha baculovirus clade Ia virus leads to a further reduction of the size of the genome in comparison to the native genome. Specifically, the deletion of these spacers from the genome of a AcMNPV virus leads to a further reduction of the size of the genome in comparison to the native genome of 2.24%. Accordingly, in an even more preferred embodiment the size of the genome of the AcMNPV virus of the invention is reduced by at least 29.48%.
The deletion of these spacers from the genome of a BmNPV virus leads to a further reduction of the size of the genome in comparison to the native genome of 2.33%. Accordingly, in an even more preferred embodiment the size of the genome of the BmNPV virus of the invention is reduced by at least 20.36%.
It is noted that some of the spacers indicated above overlap with the 5′UTRs indicated in Table 2a or 2b. Thus, if the 5′-UTR is deleted for a given gene and the spacer is deleted additionally this means that only that part of the spacer that is not overlapping with a 5′-UTR is deleted. Conversely, if a spacer is deleted that partially overlaps with a 5′-UTR the overlapping part is also deleted.
It is preferred that the NPV alpha baculovirus clade Ia genome from which above indicated CDS, 5′-UTRs, spacers and/or hr regions are deleted is based on the genome of a baculovirus selected from the group consisting of AcMNPV, PlxyNPV, RoMNPV, BmNPV, MaviMNPV, CfDefMNPV, AgMNPV, EppoNPV, AnpeNPV, CfMNPV, OpMNPV, and HycuNPV.
Exemplary genomes of these baculoviruses are accessible at the NIH and EBI databank. The phrase “based on the genome” means that the native genome of the respectively indicated baculovirus is used as a reference point and that the genome of the invention has that nucleotide sequence sans the CDS and/or 5′UTRs and preferably also spacers of the genes odv-e66, p43, odv-no42 or odv-e56, ptp, bro, ctx, orf603, polyhedrin, egt, bv/odv-e26, ac18, pif-2, env-prot, iap-1, sod, fgf, vubi, gp37, ac69, iap-2, pnk/pn1, ac91, odv-e28 pif-4, pif-3, pif-1, pk-2, chiA, v-cath, pp34, 94K, p26, p10, p74, ac145, and ac150 and/or hr regions.
To determine the extent of the deletion of the genome of the invention a reference genome is used, which is referred to as “native NPV-alpha baculovirus clade Ia genome”. This term is used to designate the genome of a naturally occurring NPV alpha baculovirus clade Ia virus and includes all silent mutations within open reading frames that do not impair functionality of the DNA elements. Preferably, this term comprises NPV-alphabaculovirus clade I a/b genomic sequences that exhibit at least 90% sequence identity to the nucleotide sequence of naturally occurring NPV alpha baculovirus clade Ia genome, e.g. those accessible at the NIH or EBI databank. It is preferred that the nucleotide sequence of the naturally occurring NPV alpha baculovirus clade Ia genome is for: (i) AcMNPV as set out in SEQ ID NO: 1 (NC—001623) with a length of 133,894 base pairs, (ii) PlxyNPV as set out in SEQ ID NO: 2 (NC—008349) with a length of 133,417 base pairs, (iii) RoMNPV as set out in SEQ ID NO: 3 (NC—004323) with a length of 131,526 base pairs, (iv) BmNPV as set our in SEQ ID NO: 4 (NC—001962) with a length of 128,413 base pairs, (v) MaviMNPV as set out in SEQ ID NO: 5 (NC—008725.1) with a length of base pairs 111,953, (vi) CfDefMNPV as set out in SEQ ID NO: 6 (NC—005137.2) with a length of 131,160 base pairs, (vii) AgMNPV as set out in SEQ ID NO: 7 (NC—008520.1) with a length of 132,239 base pairs, (viii) EppoNPV as set out in SEQ ID NO: 8 (NC—003083.1) with a length of 118,584 base pairs, (ix) AnpeNPV as set out in SEQ ID NO: 9 (NC—008035.3) with a length of 126,629 base pairs, (x) CfMNPV as set out in SEQ ID NO: 10 (NC—004778.3) with a length of 129,593 base pairs, (xi) OpMNPV as set out in SEQ ID NO: 11 (NC—001875.2) with a length of 131,995 base pairs, and (xii) HycuNPV as set out in SEQ ID NO: 12 (NC—007767.1) with a length of 132,959 base pairs. Accordingly, the reference native NPV-alpha baculovirus clade Ia genome preferably has at least 90% sequence identity to one of the sequences selected from the group consisting of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12, more preferably at least 92% sequence identity to one of the sequences selected from the group consisting of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12, more preferably at least 94% sequence identity to one of the sequences selected from the group consisting of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12, more preferably at least 96% sequence identity to one of the sequences selected from the group consisting of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12, even more preferably at least 98% sequence identity to one of the sequences selected from the group consisting of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12 and most preferably 100% sequence identity to one of the sequences selected from the group consisting of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12. The most preferred genome is that of AcMNPV according to SEQ ID NO: 1.
The present inventors have found that the CDS indicated in Table 4a and 4b can be deleted without detrimental effect on the respective NPV alpha baculovirus clade Ia virus (all CDS and proteins are with reference to AcMNPV). These CDS are considered to belong to a group that is referred to as Type IT. Accordingly, the deletion of these CDS (and preferably also the respective 5′-UTR, spacer and/or hr regions) are referred to as Type II deletions:
The total length of the CDS encoding these proteins is approximately 6,246 base pairs in AcMNPV according to SEQ ID NO: 1. Thus, the native AcMNPV genome may be shortened by deletion of one or more, preferably of all of the CDS encoding the proteins of Table 4a. If all these CDSs are deleted from the AcMNPV genome, the genome of the invention is preferably at least 6,246 base pairs shorter than the native AcMNPV genome. The respective shortening attributable to the deletion of one specific CDS can be derived from the column labelled “bp”. These CDS correspond to approximately 4.7% of the native genome of AcMNPV.
The native BmNPV genome may be shortened by deletion of one or more, preferably of all of the CDS encoding the proteins of Table 4b. If all these CDSs are deleted from the BmNPV genome, the genome of the invention is preferably at least 2,967 base pairs shorter than the native BmNPV genome. The respective shortening attributable to the deletion of one specific CDS can be derived from the column labelled “bp”. These CDS correspond to approximately 2.3% of the native genome of BmNPV.
It is more preferred that one or more of the 5′-UTRs of the CDS are also deleted. In AcMNPV the 5′-UTRs of these CDS have a length of 446 base pairs. In BmNPV the 5′-UTRs of these CDS have a length of 19 base pairs. The following Table 5a and 5b indicate the position of the 5′UTRs of the CDS deleted in preferred genomes of the invention. The 5′-UTR precedes the respectively indicated CDS Start codon by the indicated number of base pairs:
The deletion of one or more of these 5′-UTRs from the genome of a NPV alpha baculovirus clade Ia virus leads to a further reduction of the size of the genome in comparison to the native genome. The deletion of one or more of these 5′-UTRs from the genome of a AcMNPV virus leads to a further reduction of the size of the genome in comparison to the native genome of up to 0.35%. The deletion of one or more of these 5′-UTRs from the genome of a BmNPV virus leads to a further reduction of the size of the genome in comparison to the native genome of up to 0.02%.
Alternatively or additionally the spacers of the AcMNPV CDS may be deleted, which amount to an additional deletion of 2,543 bp. In BmNPV the spacers of these CDS have a length of 3,028 base pairs. With reference to the nucleotide sequence of AcMNPV the spacers that may be deleted are the following:
The deletion of these spacers from the genome of a NPV alpha baculovirus clade Ia virus leads to a further reduction of the size of the genome in comparison to the native genome. Accordingly, in an even more preferred embodiment the size of the genome of the AcMNPV virus of the invention described above is reduced by up to a further 1.89%. In further preferred embodiments, the size of the genome of the BmNPV virus of the invention is reduced up to a further 2.36%
Accordingly, in a preferred embodiment the NPV alpha baculovirus clade Ia genome of the present invention comprises Type I deletions in addition to the type II deletions.
The AcMNPV genome of the invention is preferably at least 30.23% shorter than the native AcMNPV genome (if only Type I and II CDS are deleted), more preferably 35.16% shorter (if Type I and II CDS and 5′-UTRs are deleted) and more preferably 37.05% shorter (if Type I and II CDS, 5′-UTR and spacers are deleted).
The BmNPV genome of the invention is preferably at least 21.62% shorter than the native BmNPV genome (if only Type I and II CDS are deleted), more preferably 22.07% shorter (if Type I and II CDS and 5′-UTRs are deleted) and more preferably 24.38% shorter (if Type I and II CDS, 5′-UTR and spacers are deleted).
The present inventors have found that the CDS indicated in Table 7a and 7b can be deleted without detrimental effect on the respective NPV alpha baculovirus clade Ia virus (all CDS and proteins are with reference to AcMNPV). These CDS are considered to belong to a group that is referred to as Type II. Accordingly, the deletion of these CDS (and preferably also the respective 5′-UTR, spacer and/or hr regions) are referred to as Type III deletions:
The total length of the CDS encoding these proteins is approximately 5,625 base pairs in AcMNPV according to SEQ ID NO: 1. Thus, the native AcMNPV genome may be shortened by deletion of one or more, preferably of all of the CDS encoding these proteins. If all these CDSs are deleted from the AcMNPV genome, the genome of the invention is preferably at least 5,625 base pairs shorter than the native AcMNPV genome. The respective shortening attributable to the deletion of one specific CDS can be derived from the column labelled “bp”. These CDS correspond to approximately 4.2% of the native genome of AcMNPV.
The total length of the CDS encoding these proteins is approximately 3,173 base pairs in BmNPV. Thus, the native BmNPV genome may be shortened by deletion of one or more, preferably of all of the CDS encoding these proteins. If all these CDSs are deleted from the BmNPV genome, the genome of the invention is preferably at least 3,173 base pairs shorter than the native BmNPV genome. The respective shortening attributable to the deletion of one specific CDS can be derived from the column labelled “bp”. These CDS correspond to approximately 2,47% of the native genome of BmNPV.
Accordingly, in a preferred embodiment the NPV alpha baculovirus clade Ia genome of the present invention comprises (i) Type II deletions, (ii) Type I deletions and Type III deletions, (iii) Type II and Type III deletions or (iv) Type I, II and III deletions.
The AcMNPV genome of the invention is preferably at least 25.66% shorter than the native AcMNPV genome (if only Type I CDS are deleted and Type II CDS, 5′-UTRs and spacers are retained), more preferably 27.24% shorter (if Type I CDS and 5′-UTRs are deleted and Type II CDS, 5′-UTRs and spacers are retained) and more preferably 34.14% shorter (if Type I and II CDS, 5′-UTR and spacers are deleted).
The BmNPV genome of the invention is preferably at least 18.3% shorter than the native BmNPV genome (if only Type I CDS are deleted and Type II CDS, 5′-UTRs and spacers are retained), more preferably 19.55% shorter (if Type I CDS and 5′-UTRs are deleted and Type IT CDS, 5′-UTRs and spacers are retained) and more preferably 24.20% shorter (if Type I and II CDS, 5′-UTR and spacers are deleted).
It is more preferred that the 5′-UTRs of these CDS are also deleted. In AcMNPV the 5′-UTRs of these CDS have a length of 351 base pairs. In BmNPV the 5′-UTRs of these CDS have a length of 7 base pairs. The following Tables 8a and 8b indicate the position of the 5′UTRs of the CDS deleted in preferred genomes of the invention. The 5′-UTR precedes the respectively indicated CDS Start codon by the indicated number of base pairs:
The deletion of one or more of these 5′-UTRs from the genome of a AcMNPV leads to a further reduction of the size of the genome in comparison to the native genome of up to 0.26%.
The deletion of one or more of these 5′-UTRs from the genome of a BmNPV leads to a further reduction of the size of the genome in comparison to the native genome of up to 0.01%. Alternatively or additionally the spacers of the CDS in AcMNPV may be deleted, which amount to an additional deletion of 2183 bp. In BmNPV the spacers of these CDS have a length of 3021 base pairs. With reference to the nucleotide sequence of AcMNPV the spacers that may be deleted are the following:
The deletion of these spacers from the genome of a AcMNPV virus leads to a further reduction of the size of the genome in comparison to the native genome of 1.63%. Accordingly, in an even more preferred embodiment the size of the genome of the AcMNPVvirus of the invention is reduced by up to a further 1.63%.
The deletion of these spacers from the genome of a BmNPV virus leads to a further reduction of the size of the genome in comparison to the native genome of 2.35%. Accordingly, in an even more preferred embodiment the size of the genome of the BmNPVvirus of the invention is reduced by up to a further 2.35%.
The present inventors have found that the CDS indicated in Table 10a and 10b can be deleted without detrimental effect on the respective NPV alpha baculovirus clade Ia virus (all CDS and proteins are with reference to AcMNPV). These CDS are considered to belong to a group that is referred to as Type IV. Accordingly, the deletion of these CDS (and preferably also the respective 5′-UTR, spacer and/or hr regions) are referred to as Type IV deletions:
The total length of the CDS encoding these proteins is approximately 3,027 base pairs in AcMNPV and approximately 1,230 base pairs in BmNPV. Thus, the native NPV alpha baculovirus clade Ia genome may be shortened by deletion of one or more, preferably of all of the CDS encoding these proteins.
If all these CDSs are deleted from the AcMNPV genome, the genome of the invention is preferably at least 3,027 base pairs shorter than the native AcMNPV genome. The respective shortening attributable to the deletion of one specific CDS can be derived from the column labelled “bp”. These CDS correspond to approximately 2.26% of the native genome of AcMNPV.
If all these CDSs are deleted from the BmNPV genome, the genome of the invention is preferably at least 823 base pairs shorter than the native BmNPV genome. The respective shortening attributable to the deletion of one specific CDS can be derived from the column labelled “bp”. These CDS correspond to approximately 0.64% of the native genome of BmMNPV.
It is more preferred that one or more of the 5′-UTRs of these CDS are also deleted. In AcMNPV the 5′-UTRs of these CDS have a length of 446 base pairs. In BmNPV the 5′-UTRs of these CDS have a length of 263 base pairs. The following Table 11a and 11b indicates the position of the 5′UTRs of the CDS deleted in the preferred AcMNPV and BmNPV genomes of the invention, respectively. The 5′-UTR precedes the respectively indicated CDS Start codon by the indicated number of base pairs:
The deletion of these 5′ UTRs from the genome of a AcMNPV virus leads to a further reduction of the size of the genome in comparison to the native genome of up to 0.25%.
The deletion of these 5′-UTRs from the genome of a BmNPV virus leads to a further reduction of the size of the genome in comparison to the native genome of up to 0.20%.
Alternatively or additionally the spacers of the CDS in AcMNPV may be deleted, which amount to an additional deletion of 1,847 bp. In BmNPV the spacers of these CDS have a length of 2,760 base pairs. With reference to the nucleotide sequence of AcMNPV the spacers that may be deleted are the following:
The deletion of these spacers from the genome of a NPV alpha baculovirus clade Ia virus leads to a further reduction of the size of the genome in comparison to the native genome. Accordingly, in an even more preferred embodiment the size of the genome of the AcMNPV virus of the invention is reduced by up by a further 1.38%. In further preferred embodiments the size of the genome of the BmNPV virus is reduced 2.15%
In a preferred embodiment the NPV alpha baculovirus clade Ia genome, preferably the AcMNPV or the BmNPV genome, of the invention further lacks all 5′-UTR and/or 3′-UTR of the genes of (i) Type I, (ii) Type I and Type II, (iii) Type I and Type III, (iv) Type I and Type IV, (v) Type II and Type III, (vi) Type II and Type IV, (vii) Type II and Type IV, (viii) Type I, Type II and Type I, (ix) Type I, Type II and Type IV, (x) Type I, Type III and Type IV, (xi) Type II, Type I and Type IV, or (xii) Type I, Type II, Type III and Type IV.
In a preferred embodiment the NPV alpha baculovirus clade Ia genome, preferably the AcMNPV or the BmNPV genome, of the invention further lacks the spacers 5′ and/or 3′ of the genes of (i) Type I, (ii) Type I and Type II, (iii) Type I and Type m, (iv) Type I and Type IV, (v) Type II and Type I, (vi) Type II and Type IV, (vii) Type III and Type IV, (viii) Type I, Type II and Type II, (ix) Type I, Type II and Type IV, (x) Type I, Type III and Type IV, (xi) Type II, Type I and Type IV, or (xii) Type I, Type II, Type III and Type IV.
In a preferred embodiment the NPV alpha baculovirus clade Ia genome, preferably the AcMNPV or the BmNPV genome, of the invention further lacks one or more of the heterologous repeat (HR) sequences.
The preferred NPV alpha baculovirus clade Ia genomes, preferrably the AcMNPV or the BmNPV genome, lack the following heterologous repeat sequences:
(i) for AcMNPV one or more, preferably all of the heterologous repeat sequences corresponding to the regions spanning nucleotides 1 to 445, 7,747 to 7,864, 26,293 to 26,961, 48,679 to 48,708, 70,468 to 71,133, 93,456 to 93,605, 97,396 to 97,881, 102,606 to 102,635, 117,479 to 117,987 and 133,883 to 133,894 of the genome sequence according to SEQ ID NO: 1,
(ii) for PlxyNPV one or more, preferably all of the heterologous repeat sequences corresponding to the regions spanning nucleotides 1 to 441, 7,725 to 7,842, 27,391 to 28,255, 49,952 to 49,981, 50,439 to 51,255, 72,580 to 73,046, 94,061 to 94,210, 98,016 to 98,409, 103,134 to 103,163, 117,943 to 118,456, and 134,406 to 133,417 of the genome sequence according to SEQ ID NO: 2,
(iii) for RoMNPV one or more, preferably all of the heterologous repeat sequences corresponding to the regions spanning nucleotides 1 to 326, 6,454 to 6,491, 24,805 to 25,141, 46,826 to 46,855, 68,628 to 69,279, 91,594 to 91,623, 95,394 to 95,750, 100,470 to 100,499, 115,321 to 115,714, and 131,515 to 131,526 of the genome sequence according to SEQ ID NO: 3,
(iv) for BmNPV one or more, preferably all of the heterologous repeat sequences corresponding to the regions spanning nucleotides, 22,497 to 23,200, 24,012 to 24,278, 29,485 to 29566, 43821 to 43857, 64802 to 65350, 65499 to 65573, 86431 to 86648, 89552 to 90142, 94426 to 94483, 106947 to 107561 and 123706 to 124297 of the genome sequence according to SEQ ID NO: 4,
(v) for MaviMNPV one or more, preferably all of the heterologous repeat sequences corresponding to the regions spanning nucleotides 20436 to 21162, 56264 to 57313, 77148 to 77878, 92877 to 93652, and 109272 to 110071 of the genome sequence according to SEQ ID NO: 5,
(vi) for CfDefMNPV one or more, preferably all of the heterologous repeat sequences corresponding to the regions spanning nucleotides 6450-6723, 15714-16013, 22164 to 22959, 37020 to 37175, 65930 to 65959, 76886 to 77072, 86479 to 86720, 96698 to 96936, 100566 to 100716 and 105456 to 105485 of the genome sequence according to SEQ ID NO: 6, (vii) for AgMNPV one or more, preferably all of the heterologous repeat sequences corresponding to the regions spanning nucleotides 19648 to 19685, 52009 to 52064, 126402 to 126447, 126568 to 127008 and 128754 to 128960 of the genome sequence according to SEQ ID NO: 7,
(viii) for EppoNPV one or more, preferably all of the heterologous repeat sequences corresponding to the regions spanning nucleotides 3992 to 4094, 18956 to 19121, 88438 to 88950, 95415 to 95715, and 103275 to 103722 of the genome sequence according to SEQ ID NO: 8,
(ix) for AnpeNPV one or more, preferably all of the heterologous repeat sequences corresponding to the regions spanning nucleotides 18636 to 19241, 41040 to 41134, 48378 to 48521, 65768 to 65862, 65894 to 66019, 75128 to 75220, 78778 to 79048, 92649 to 92655, 110552 to 110905, 117548 to 117631, 120778 to 120838, 122220 to 122259 and 125492 to 125506 of the genome sequence according to SEQ ID NO: 9,
(x) for CfMNPV one or more, preferably all of the heterologous repeat sequences corresponding to the regions spanning nucleotides 6460 to 6723, 15714 to 16013, 22164 to 22959, 37020 to 37175, 65930 to 65959, 76886 to 77072, 86479 to 86720, 96698 to 96936, 100566 to 100716, 105456 to 105485, 113424 to 113779, 125448 to 125477 and 126985 to 127146 of the genome sequence according to SEQ ID NO: 10,
(xi) for OpMNPV one or more, preferably all of the heterologous repeat sequences corresponding to the regions spanning nucleotides 103528 to 103833, 127459 to 130270 and 141587 to 142185 of the genome sequence according to SEQ ID NO: 11, and
(xii) for HycuNPV one or more, preferably all of the heterologous repeat sequences corresponding to the regions spanning nucleotides 4710 to 5602, 18642 to 19810, 27465 to 28918, 35645 to 36379, 66095 to 66778, and 112869 to 113601 of the genome sequence according to SEQ ID NO: 12.
The relative reductions in genome size of the genomes of the invention that may be achieved in comparison to the native NPV-alpha baculovirus clade Ia genome are indicated in percent in Table 13a and 13b, i.e. the following table indicates preferred reduction in size that are achieved for the genomes of the present invention. The absolute reductions can be calculated for each of the NPV-alpha baculovirus clade Ia genome of the invention on the basis of the lengths of the respective elements indicated exemplary above for AcMNPV.
In each of the cases indicated above in Tables 13a and 13b one or more, preferably all of the hr regions are also deleted. For AcMNPV this leads to a further reduction in size of up to 3,115 base pairs (including overlaps with a CDS at positions 48,679 to 48,708) or up to 3,085 (excluding the overlap with the CDS), which equates to a size reduction of up to 2.33% and 2.30%, respectively. For BmNPV the size reduction of deleting one or more, preferably all of the hr regions leads to a deletion of up to 3,766 base pairs, which equates to a relative size reduction of up to 2.93%.
The present inventors have also discovered that certain genes of the NPV alpha baculovirus clade Ia genome are important to maintain vital functions of the virus. These genes are involved in various aspects, e.g. transcription, replication, assembly, packaging, and infectivity of the virus. It is preferred that these genes are left intact in the genomes of the invention.
Accordingly, in a preferred embodiment the AcMNPV genome of the invention comprises at least one of the genes encoding helicase, 38K, lef-5, 49K and odv-e18+28. Preferably, the genome of the invention comprises all of these genes. These genes are indicated in Table 14a with reference to the genome of AcMNPV and are designated as vital genes of category I:
Accordingly, in a preferred embodiment the BmNPV genome of the invention comprises at least one of the genes encoding helicase, 38K, lef-5, 49K and odv-e18+28. Preferably, the genome of the invention comprises all of these genes. These genes are indicated in Table 14b with reference to the genome of BmNPV and are designated as vital genes of category I:
The helicase is required for replication, lef5 is required for transcription and 38K, 49K and odv-e18 are required for viral structure, packaging and assembly.
The present inventors have identified further genes that are important for viral function. Accordingly, in a preferred embodiment the NPV alpha baculovirus clade Ia genome of the invention comprises at least one of the genes encoding lcf-2, lef-1, p47, lef-8 vp1054, lef-9, dnapo1, ac66, vlf-1, gp41, ac81, p95, capsid, lef-4, p33, p18, odv-c25, p6.9, odv-eo43, alk-exo, and odv-ec27. Preferably, the genome of the invention comprises all of these genes. These genes are indicated in Table 15a with reference to the genome of AcMNPV and are designated as vital genes of category II:
Accordingly, in a preferred embodiment the BmNPV genome of the invention comprises at least one of the genes encoding lef-1, pif-2, p47, lef-8, vp1054, lef-9, dnapo1, bm56 ac68, vlf-1, gp41, bm67 ac81, p95, capsid, lef-4, p33, p6.9, odv-ec43, pif-3, pif-1, alk-exo, p74, odv-ec27, odv-c56, and lef-2. Preferably, the genome of the invention comprises all of these genes. These genes are indicated in Table 15b with reference to the genome of BmNPV and are designated as vital genes of category II:
Lef-2, lef-1 and dna1 are required for DNA replication, p47, lef-8, lef-9 and lef-4 are required for transcription, vp1054, vlf-1, gp41, p95, capsid, p33, p6.9, odv-o43 and alk-exo are required for viral packaging and assembly, ac66, ac81, and odv-ec27 are required for host interaction and odv-e25 required for viral structure (ODV envelope).
The present inventors have identified further genes that are important for viral function. Accordingly, in a preferred embodiment the NPV alpha baculovirus clade Ia genome of the invention comprises at least one of the genes encoding pk-1, 38.7K, dbp, lef-6, ac29, 39K, lef-11, ac38, ac53, fp, lef-3, ac75, ac76, ac78 tlp20, p40, p12, p48, ac106/107 Ni, ac106/107 Ct, ac110, me53 and ie-1. Preferably, the genome of the invention comprises all of these genes. These genes are indicated in Table 16a with reference to the genome of AcMNPV and are designated as vital genes of category III:
Accordingly, in a preferred embodiment the NPV alpha baculovirus clade Ia genome of the invention comprises at least one of the genes encoding polyhedrin, pk-1, 38.7K ac13, env-prot, dbp, lef-6, bm20 ac29, v-ubi, 39k, lef-11, bm29 ac38, bm42 ac53, fp, bm54 ac66, lef-3, bm61 ac75, bm62 ac76, bm64 ac78, tlp20, p18, odv-e25, p40, p12, p45 Ac p48, bm90 ac106/107, bm92a ac110, me53, bm121 ac145, bm122 ac146 and io-1. Preferably, the genome of the invention comprises all of these genes. These genes are indicated in Table 16b with reference to the genome of BmNPV and are designated as vital genes of category III:
Dbp, lef-11, ac38, lef-3, me53 and ic-1 are required for replication, 38.7K, lef-6 and 39K are required for transcription, fp, ac75, p40 required for virus structure, pk-1, ac53 and p12 are required for host interaction and ac29 ac76, ac78, tlp20, p48, ac106/107 Nt, ac106/107 Ct, and ac110 are of unknown function.
The present inventors have identified further genes that are important for viral function. Accordingly, in a preferred embodiment the NPV alpha baculovirus clade Ia genome of the invention comprises at least one of the genes encoding ac12, ac34, ac55, and ac108. Preferably, the genome of the invention comprises all of these genes. These genes are indicated in Table 17a with reference to the genome of AcMNPV and are designated as vital genes of category IV:
Accordingly, in a preferred embodiment the NPV alpha baculovirus clade Ia genome of the invention comprises at least one of the genes encoding egt, bm25 ac34, lef-10, bm44 ac55, chaB, bm48 ac60, vp80, bm91 ac108, p24, pp34 and ic-0. Preferably, the genome of the invention comprises all of these genes. These genes are indicated in Table 17b with reference to the genome of BmNPV and are designated as vital genes of category IV:
These genes are of unknown function.
The present inventors have identified further genes that are important for viral function.
Accordingly, in a preferred embodiment the NPV alpha baculovirus clade Ia genome of the invention comprises at least one of the genes encoding ac4, ac5, orf1629, ac17+45, ac19, arif-1 Ct, arif-1 Nt, pkip, ac26, lcf-12, ao43, ac48, bjdp, ac72, ac73, ac74, ac79, ac111, ac114, ac120, ac124, lcf-7, gp67, gp16, ac132, ie-2, pe38. Preferably, the genome of the invention comprises all of these genes. These genes are indicated in Table 18a with reference to the genome of AcMNPV and are designated as vital genes of category V:
The present inventors have identified further genes that are important for viral function. Accordingly, in a preferred embodiment the NPV alpha baculovirus clade Ia genome of the invention comprises at least one of the genes and, preferably, the genome of the invention comprises all of these genes orf1629, bm4 ac11, bv/odv-e26, bm9 ac17, bm10 ac18, bm11 ac19, arif-1, pkip, bm17 ac26, iap-1, bm21 ac30, lef-12, bm34 ac43, bjdp, gp37, iap-2, bm58a ac72, bm59 ac73, bm60 ac74, bm65 ac79, bm93 ac111, bm94 ac114, bm96, bm98 ac120, bm101 ac124, lef-7, gp64/67, p116, bm109 ac132, p26, p10, io-2, pe38, ptp, bm133 ac4 and bm134 ac5. These genes are indicated in Table 18b with reference to the genome of BmNPV and are designated as vital genes of category V:
Ac79 and lef-7 are required for replication, lef-12 and bjdp are required for transcription, orf1629 is required for virus structure ao4, arif-1 Ct/Nt, pkip, gp67, ie-2 and pe38 are required for host interaction and ac5, ac17+45, ac19, ac26, ac43, ac48, ac72, ac73, ac74, ac111, ac114, ac120, ac124, gp16, and ac132 are of unknown.
The present inventors have identified further genes that are important for viral function. Accordingly, in a preferred embodiment the NPV alpha baculovirus clade Ia genome of the invention comprises at least one of the genes encoding ac45, ao47, ac52+71, he65, 35K and ac154. Preferably, the genome of the invention comprises all of these genes. These genes are indicated in Table 19a with reference to the genome of AcMNPV and are designated as vital genes of category VI:
These genes are indicated in Table 19b with reference to the genome of BmNPV and are designated as vital genes of category VI:
ac52+71 and he65 appear required for replication, 35K appears required for host interaction and ac45, ao47, and he65 are of unknown function.
It is preferred that the genome of the present invention comprises one or all, preferably all of the following genes: (i) vital genes of category I; (ii) vital genes of category II; (iii) vital genes of category III; (iv) vital genes of category IV; (v) vital genes of category V: (vi) vital genes of category VI; (vii) vital genes of category I and II; (viii) vital genes of category I and III; (ix) vital genes of category I and IV; (x) vital genes of category I and V; (xi) vital genes of category I and VI; (xii) vital genes of category I, II and m; (xiii) vital genes of category I, II and IV; (xiv) vital genes of category I, II and V; (xv) vital genes of category I, II and VI; (xvi) vital genes of category I, III and IV; (xvii) vital genes of category I, III and V; (xviii) vital genes of category I, III and VI; (xix) vital genes of category I, IV and V; (xx) vital genes of category I, IV and VI; (xxi) vital genes of category I, V and VI; (xxii) vital genes of category I, II, II and IV; (xxiii) vital genes of category I, II, III and V; (xxiv) vital genes of category I, II, III and VI; (xxv) vital genes of category I, II, IV and V; (xxvi) vital genes of category I, II, IV and VI; (xvii) vital genes of category I, II, V and VI; (xxix) vital genes of category I, II, III, IV and V; (xxx) vital genes of category I, II, III, IV and VI; (xxxi) vital genes of category I, III, IV, V and VI; (xxxii) vital genes of category I, II, IV, V and VI; (xxxii) vital genes of category I, II, III, V and VI; (xxxiii) vital genes of category I, II, III, IV, V and VI.
In each of above cases it is preferred that the genome comprises both the CDS as well as any 5′-UTR and/or 3′-UTR flanking the CDS.
The present inventors also found that the following CDSs of the hr4-5 section (bp 99182-121072) can be deleted completely without detrimental effect on AcMNPV: p26, p10, p74, pif-3, ac116, ac117, ac118, ac121, ac122, pk-2, v-cath, pp34 and/or 94K. Also, the CDSs of the genes pif-1 and chiA of the hr4-5 section can be deleted partially (truncated) without detrimental effect on AcMNPV. This is because these genes are not essential for virus infection and propagation, but since the promoter regions of adjacent genes overlap with their CDSs, a portion of pif-1 and chiA has to remain. For pif-1, the portion that may be deleted is bp 100699 to bp102199 of the AcMNPV genome (pif-1 itself extends from bp 100699 to bp 102291 of the AcMNPV genome), and for chiA, this portion is bp 105560 to bp 106937 of the AcMNPV genome (chiA itself extends from bp 105282 to bp 106937 of the AcMNPV genome). In other words, the portion that must remain of pif-1 in the genome is bp 102200 to bp 102291 of the AcMNPV genome and the portion that must remain of chiA in the genome is bp 105282 to bp 105559 of the AcMNPV genome. Accordingly, pif-1 and chiA are partially deleted such that the promoter regions of the adjacent genes remain. This applies to all embodiments mentioned herein, which relate to pif-1 and/or chiA. In a further general embodiment, genes taught to be deletable herein may be deleted only in as far as portions of the sequence remain which are part of the CDS or the 5′/3′ UTR of a gene taught to be essential.
Thus, in a further embodiment, the NPV alpha baculovirus clade Ia genome according to the first aspect of the invention lacks at least one, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen or fourteen genes, preferably all genes of the group consisting of pif-1, p26, p10, p74, pif-3, ac116, ac117, ac118, ac121, ac122, pk-2, chiA, v-cath, pp34 and 94K, wherein pif-1 and chiA are partially deleted such that the promoter regions of the adjacent genes remain. The adjacent genes are ac120 for pif-1 and lef-7 for chiA and the promoter region of these genes that overlap with pif-1 and chiA, respectively, are defined above. Preferably, said genome also lacks (i) the 5′-L TR and/or 3′-UTR (as defined above) and/or (ii) the spacers 5′ and/or 3′ (as defined above) of at least one, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen or preferably all of these genes The present inventors also found that the following CDSs of the hr4-5 section (bp 99182-121072) are important to maintain vital functions of AcMNPV: ac120, ac124, lef-7, gp64/67, p24, gp16, ac132, alk-exo, and 35K. Thus, in a further embodiment, the NPV alpha baculovirus clade Ia genome according to the first aspect of the invention comprises at least one, two, three, four, five, six, seven, eight preferably all genes of the group consisting of ac120, ac124, lef-7, gp64/67, p24, gp16, ac132, alk-exo, and 35K. Preferably, said genome comprises also (i) the 5′-UTR and/or 3′-UTR (as defined above) and/or (ii) the spacers 5′ and/or 3′ (as defined above) of at least one, two, three, four, five, six, seven, eight, or preferably all of these genes.
Thus, in a preferred embodiment, the NPV alpha baculovirus clade Ia genome according to the first aspect of the invention (a) lacks at least one, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen or fourteen genes, preferably all genes of the group consisting of pif-1, p26, p10, p74, pif-3, ac116, ac1117, ac118, ac121, ac122, pk-2, chiA, v-cath, pp34 and 94K, wherein pif-1 and chiA are only partially deleted such that the promoter regions of the adjacent genes remain; and (b) comprises at least one, two, three, four, five, six, seven, eight or preferably all genes of the group consisting of ac120, ac124, lcf-7, gp64/67, p24, gp16, ac132, alk-exo, and 35K. Preferably, said genome lacks/comprises also (i) the 5′-UTR and/or 3′-UTR and/or (ii) the spacers 5′ and/or 3′ of at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 16, 17, 19, 20, 21, 22, 23 or preferably all of these genes.
In a second aspect the present invention provides a NPV alpha baculovirus clade Ia genome according to the first aspect of the invention further comprising a nucleotide sequence heterologous to the NPV alpha baculovirus clade Ia genome. The term “heterologous” in this context refers to nucleotide sequence not natively occurring in the genome of the respective NPV alpha baculovirus clade Ia genome or not occurring at that position. Accordingly, a native promoter of that baculovirus that is rearranged within the genome is also considered heterologous. Preferably, the term refers to nucleotide sequence not natively comprised in a baculovirus, more preferably a eukaryotic gene or cDNA. Particularly, preferred the gene or cDNA is of mammalian, e.g. mouse, rat, rabbit, dog, cat, human origin. It is particularly preferred that the heterologous nucleotide sequence comprises more than one gene. Due to the large size reduction the genome of the present invention can accommodate large sections of heterologous nucleotide sequences. The inserts can be as big as the native sequence removed. Accordingly, Table 9 can serve as an indication of the size of the heterologous nucleotide sequence that can be present in the genome of the invention. The expression of the genes comprised in the heterologous nucleotide sequence is preferably driven by IE1 or polyhedrin or p10 promoter.
Other preferred heterologous nucleotide sequence that may be comprised in the genome are a Tn7 site, a nucleotide sequence encoding a resistance gene, a Homing endonuclease site, a mutated fp gene, loxP sites, IE1/polyhedrin/p10 promoter, a nucleotide sequence encoding a fluorescent protein. Preferred fluorescent proteins comprise green, red or yellow fluorescent proteins or mCherry.
In a third aspect the present invention provides an infectious NPV alpha baculovirus clade Ia virus comprising a genome according to the first or the second aspect of the invention.
Using improved methodology reported by Smith and colleagues (Smith H O et al PNAS 2003) on accurate assembly of ˜10 Kb DNA segments, we aim to assemble our computationally identified assembled gene segments from synthetic long polynucleotide fragments using molecular biology techniques for DNA assembly (Stemmer W P C et al., Gene 1995, Gibson et al., Nature Methods 2009). PCR amplification would then be used to obtain large amounts of pure full-length genomes (15-Kb DNA segments) and finally, gel electrophoresis will be used to purify amplified gene segments. Having engineered and amplified the gene segment of interest, the wild-type sequence with this synthetic DNA fragment could be replaced in the current “wild-type” baculoviral genome (which itself has been already engineered by classical means). In principle, the synthetic gene segment could be inserted into baculoviruses using E. coli/yeast based recombination and in-vitro ligation (Zhao Y. et al., Nucleic Acids Res 2003). As reported by Gibson and colleagues on synthetic genome assembly strategy in yeast (Gibson D G et al., Science 2010), three stage of genome assembly using transformation and homologues recombination in yeast. As a first stage, 10 kb synthetic gene segments and vector would be recombined in yeast/E. coli and transferred to E. coli. Then, the vector with assembled segments will be isolated for positive selection. At the second and third stage, the multiple 10 kb fragments and wild-type genome fragments would be transformed in the yeast/E. coli which would produce larger second-stage and final stage assembly intermediate. Here in order to generate semi-synthetic genome assembly, yeast/E. coli would be co-transformed with the baculovirus wild-type gene fragments and PCR amplified vector with overlapping ends of the synthetic inserts.
Once the hybrid genome is constructed and isolated from the yeast, it will be characterized by screening with multiplex PCR. Further for comparison, natural genome extracted from the yeast/E. coli and baculovirus (wild-type) would be used for characterization analysis. Once characterized, semi-synthetic genome transplantation strategy will be used wherein semi-synthetic genome from yeast clones would be transplanted into recipient cells as described before (Benders G A et al, Nucleic acids Res. 2010). Further the functionality of the hybrid virus with the synthetic gene segment would be validated based on its self-replicating properties and also by expressing test proteins and complexes.
In a fourth aspect the present invention provides a cell infected with a virus according to the third aspect of the present invention. A large number of suitable cells are publically available (see, e.g. Lynn, D. E. (2007) Methods in Molecular Biology. Vol 338, Chapter 6 Baculovirus and Insect Cell Expression Protocols, Editor: Murhammer D. W., Humanan Press Inc.). Preferably, the cell is selected from the group consisting of Ao/I, Hi5, Sf9, Sf21, Ao38, Drosophila $2, T.ni, FTRS-AoL1/-AoL2/-AfL, BCIRL/AMCY-AiOV-CLG, BCIRL/AMCY-AiTS-CLG, HCRL-ATO10/ATO20, BCIRL/AMCY-AfOV-CLG, BCIRL/AMCY-AfTS-CLG, RML-2, NISES-AnPe-426, NISES-Anya-0611, BCIRL/AMCY-AgE-CLG-1/2/3, UFL-AG-286, FTRS-AbL81, SES-Bma-O1A/R, Bm-N/-5/-21E-HNU5, NIV-BM-1296/-197, SES-Bm-130A/30R/e21A/c 21B/e 21R, SES-BoMo-15A/-C129/-JI25, SPC-Bm36/-Bm40, WIV-BS-481/484, FPMI-CF-1/2/3, FPMI-CF-203, FPMI-CF-50/60/70, IPRI-CF-1/10/12, IPRI-Cf124, IPRI-CF-16/-16T, IPRI-CF-5/-6/-8, CP-1268, CP-169, CpDW1-15, IZD-Cp 4/13, IZD-CP1508/-CP2202/-CP2507/-CP0508, SIE-EO-801/-803, IPLB-Ekx4T/-Ekx4V, EA1174A, EA1174H, IAFEs-1, BCIRL-HA-AM1, CSIRO-BCIRL-HA1-3, CSIRO-BCIRL-HP1-5, BCIRL/AMCY-HzE-CLG1-9, BCIRL-HZ-AM1-3, IMC-HZ-1, IPLB-HZ-1074-5, IPLB-HZ-1079, IPLB-HZ-110, IPLB-HZ-124Q, BCIRL/AMCY-HvE-CLG1-3, BCIRL/AMCY-HvOV-CLG, BCIRL/AMCY-Hv-TS-GES, BCIRL-HV-AM1-2, IPLB-HvE1a/-It, IPLB-HvE1s, IPLB-HvE6a/-It, IPLB-HvE6s/-It, IPLB-HvT1, FTRS-HmA45, FTRS-HIL1-2, NIAS-LeSe-11 IPLB-LD-64-67, IPLB-LdEG/-LdEI/-LdEIt/-LdEp/-LdFB, IZD-LD1307/-LD1407, UMN-MDH-1, HPB-MB, IZD-MB0503/MB0504/MB1203/MB2006/2007/2506, MB-H260, MbL-3, NIAS-MaBr-85192/93, NIAS-MaBr-92, NIAS-MB-19/25/32, SES-MaBr-1/2/3/4/5, FPMI-MS-12/4/5/7, MRRL-CH-1/2, BPMNU-MyCo-1, IPLB-OE505A/s, IPLB-O1E7, IPRI-OL-12/13/4/9, BCIRL/AMCY-OnFB-GES1/2, UMC-OnE, FTRS-PhL, Px-58/-64, ORS-Pop-93/-95, BTI-PR10B/-PR8A1/-PR8A2/-PR9A, NIAS-PRC-819A/-819B/-819C, NYAES-PR4A, IAL-PID2, IPLB-PiE, UMN-PIE-1181, BCIRL/AMCY-PxLP-CLG, IPLB-PxE1/-PxE2, PX-1187, BCIRL-PX2-HNU3, BTI-Pu-2, BTI-Pu-A7/-A7S, BTI-Pu-B9, BTI-Pu-M, BTI-Pu-MIB, FRI-SpIm-1229, BCIRL/AMCY-ScE-CLG1/-CLG4/-CLG5, Se3FH, Se4FH, Se5FH, Se6FHA, Se6FHB, SeHe920-1a, UCR-SE-1, BCIRL/AMCY-SfTS-GES, IAL-SFD1, Sf1254, IPLB-Sf21AE, HPB-SL, SPC-SI-48/-52, UIV-SL-373/-573/-673, IBL-SLIA, NIV-SU-893/-992, BCIRL-503-HNU1/504-HNU4, BCIRL/AMCY-TnE-CLG1/-TnE-CLG1 MK, BCIRL/AMCY-TnE-CLG2/-TnE-LG2MK/-TnE-CLG3/-TnTS-GES1/-TnTS-GES3, BTI-TN5B1-4/-TN5C1/-TN5F2/-TN5G2A1/BTI-TN5G3/-TN5G33, IAL-TND1, IPLB-TN-R, and TN-368.
In a fifth aspect the present invention provides a method for producing an NPV alpha baculovirus clade Ia genome according to the first or second aspect of the invention comprising the step of chemically synthesizing all or part of the genome.
In this it is preferred that the part of the genome that flanks the regions that are deleted from the genome is synthesized. These parts are preferably inserted into a part of a native NPV-alpha baculovirus clade Ia genome to reconstitute a genome that is capable of forming an infectious nucleopolyhedrovirus. This can be achieved by using advanced recombination technologies such as ET recombination (in vitro and in vivo) (Zhang, Y, et al., (1998) Nature Genet., 20, 123-128, Hill, F. et al., (2000) Genomics, 64, 111-113.) to assemble these synthetic DNAs into the part of a native NPV-alpha baculovirus genome to yield functional virus.
In a sixth aspect the present invention provides a method for producing a NPV alpha baculovirus clade Ia virus by introducing a genome of the first or second aspect or a genome producible according to the method of the fifth aspect of the invention into a cell, preferably one of the cells indicated above regarding the fourth aspect
The Examples are designed in order to further illustrate the present invention and serve the purpose to allow a better understanding of the invention. They are not to be construed as limiting the scope of the invention in any way.
The data used for creating the baculovirus genome map shown in
All available database annotations on gene product function were collected from NCBI Genebank, NCBI Protein, NCBI VOG clusters of related viral proteins and the UniProt database by Perl programs. Additionally 53 baculovirus genomes available in October 2011 in the NCBI ReSeq nucleotide database were analyzed for orthologous protein genes by clustering protein sequences downloaded from the NCBI protein database with a Perl program, which piped the clustering program blastclust, a part of the NCBI C-toolkit legacy blast package (Vijayachandran LS, Thimiri Govinda Raj D B, et al (2013) Bioengineered 4:5, 1-9). Categories for conservation/variability were assigned for following different lineages of baculoviruses: all baculoviruses with conserved synteny (core+), all baculoviruses (core), lepidopteran baculoviruses, NPV (alphabaculoviruses), NPV clade I, NPV clade Ia, variable for next neighbours of AcMNPV, unique for AcMNPV). For gene essentiality classification the grades of gene conservation were integrated with published information on mRNA expression (no mRNA observed), expression in different developmental stages (immediate early, early, late, very late), content of stage-specific promoter motifs in the upstream regions and gene product function and localization indicating non-essentiality for virus propagation in cell culture (e.g. host interaction factors, oral infectivity factor, occlusion-derived virus or occlusion-body localized, other auxiliary proteins, some genes acquired from the host genome). Protein genes, which were already published as non-essential, were categorized as type 1 deletion category (deletion is harmless), that having no known mRNA expression or have been proven as non-essential in the next relatives of AcMNPV (like BmNPV) as type 2 (deletion is likely harmless), that having presumably non-essential functions and are variable in alphabaculoviruses or such protein genes having unknown functions and are variable in next neighbours as type 3 (deletion perhaps harmless) and that with suspect non-essential functions and variability in other Alphabaculovirus genomes but conservation in next relatives as type 4 (deletion perhaps harmless). The genome map was drawn by designing a Perl program, which imports a table of the categorized gene data and exports an image made by the Perl packages GD and GD::Simple.
According to Example 1, the inventors identified, by comparative genomic analyses and data mining, regions in the baculovirus genome that can be rewired advantageously to generate an improved baculovirus for drug discovery purposes.
For a proof-of-concept of this approach, the hr4 and hr5 containing region was chosen. This region is located between bp 99.182 and 121.072 on the baculoviral genome, and contains in addition to the hr4, hr5 regions 22 genes of which, based on the inventors' studies, 12 are non-essential and 10 are essential. In terms of size, this 22 kb segment contains less than 10 kb of essential DNA material including genes, promoters and terminators, and around 20 kb of DNA with functions the deletions of which can be tolerated or even be enhancing the performance of the virus in cell culture. The hr4, hr5 regions and their essential and non-essential portions are shown in
The inventors designed a synthetic DNA corresponding to a rewired and minimized version of this segment (see
Next, they used homologous recombination techniques (see Zhang Y et al Nat. Biotechnology (2000) to graft this synthetic DNA into the wild-type baculoviral genome, replacing the original wild-type sequence in this segment with the designer DNA (see
Thus, the resulting hybrid (i.e. partly synthetic, partly wild-type) baculoviral genome (SynBac1.0) proved to be fully functional, infectious and able to produce heterologous protein. This shows that the approach of the invention can be reduced to practice.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/IB2014/058885 | 2/10/2014 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
61762607 | Feb 2013 | US |