Claims
- 1. A nucleotide concatemer comprising in the 5′→3′ direction a cassette of nucleotide sequence of the general formula
- 2. The concatemer according to claim 1, wherein the nucleotide sequence comprises a DNA sequence selected from the group comprising cDNA, genomic DNA.
- 3. The concatemer according to claim 1, wherein the nucleotide sequence is single stranded, or partly single stranded.
- 4. The concatemer according to claim 1, wherein the nucleotide sequence is double stranded.
- 5. The concatemer according to any of the preceding claims 1 to 4, comprising nucleotide sequences from at least one expression state.
- 6. The concatemer according to any of the preceding claims 1 to 5, comprising nucleotide sequences from at least two expression states.
- 7. The concatemer according to any of the preceding claims 1 to 6, wherein the rs1-rs2 restriction site of at least two cassettes are recognised by the same restriction enzyme, more preferably are identical.
- 8. The concatemer according to claim 7, wherein the rs1-rs2 restriction site of essentially all cassettes are recognised by the same restriction enzyme, more preferably are identical.
- 9. The concatemer according to any of the preceding claims 1 to 8, wherein substantially all cassettes are different.
- 10. The concatemer according to any of claims 1 to 9, wherein at least one cassette comprises an intron between the promoter and the expressible nucleotide sequence, more preferably substantially all cassettes comprise an intron between the promoter and the expressible nucleotide sequence.
- 11. The concatemer according to any of the preceding claims 1 to 10, wherein the difference comprises different promoters, and/or different expressible nucleotide sequences, and/or different spacers and/or different terminators and/or different introns.
- 12. The concatemer according to any of the preceding claims 1 to 11, wherein n is at least 10, such as at least 15, for example at least 20, such as at least 25, for example at least 30, such as from 30 to 60 or more than 60, such as at least 75, for example at least 100, such as at least 200, for example at least 500, such as at least 750, for example at least 1000, such as at least 1500, for example at least 2000.
- 13. The concatemer according to any of the preceding claims 1 to 12, wherein at least one cassette comprise the cassette from a primary vector according to claims 63 to 98, more preferably substantially all cassettes comprise the cassette from a primary vector according to claims 63 to 98.
- 14. The concatemer according to any of the preceding claims 1 to 13, comprised in an artificial chromosome.
- 15. The concatemer according to claim 14, wherein the artificial chromosome is selected from the group comprising a Yeast Artificial Chromosome, a mega Yeast Artificial Chromosome, a Bacterial Artificial Chromosome, a mouse artificial chromosome, a Mammalian Artificial Chromosome, an Insect Artificial Chromosome, an Avian Artificial Chromosome, a Bacteriophage Artificial Chromosome, a Baculovirus Artificial Chromosome, or a Human Artificial Chromosome.
- 16. The concatemer according to any of the preceding claims 1 to 13, comprised in a plasmid or an insertion vector, such as for example yeast integrative plasmid (YIp), Yeast replicating plasmid (YRp), Yeast Episomal plasmid (YEp), Yeast centromeric plasmid (YCp), Yeast linear plasmid (YLp), Yeast expression plasmid (YXp), Yeast retrotransposons (Ty elements), Yeast killer plasmid, Yeast disintegration plasmid (YDp).
- 17. The concatemer according to any of claims 134 to 16, wherein the vector further comprises at least one selectable genetic marker, such as a repressive or a dominant marker.
- 18. The concatemer according to claim 17, comprising two selectable genetic markers.
- 19. The concatemer according to claim 17 or 18, wherein the marker comprises a marker selected from the group comprising LEU 2, TRP 1, HIS 3, LYS 2, URA 3, ADE 2, Amyloglucosidase, β-lactamase, CUP 1, G418R, TUNR, KiLk1, C230, SMR1, SFA, HygromycinR, methotrexateR, chloramphenicolR, DiuronR, ZeocinR, CanavanineR.
- 20. The concatemer according to any of claims 1 to 19, wherein different expressible nucleotide sequences come from the same or from different expression states.
- 21. The concatemer according to claim 20, wherein the different expression states represent at least two different tissues, such as at least two organs, such as at least two species, such as at least two genera.
- 22. The concatemer according to claim 21, wherein the different species are from at least two different phylae, such as from at least two different classes, such as from at least two different divisions, more preferably from at least two different sub-kingdoms, such as from at least two different kingdoms.
- 23. The concatemer according to claim 21, wherein one species is a eukaryot and another species is a prokaryot.
- 24. The concatemer according to any of the preceding claims 1 to 23, being designed to minimise the level of repeat sequences occurring in the concatemer.
- 25. A method for concatenation comprising the steps of concatenating at least two cassettes of nucleotide sequences each cassette comprising a first sticky end, a spacer sequence, a promoter, an expressible nucleotide sequence, a terminator, a spacer sequence, and a second sticky end.
- 26. The method according to claim 25, further comprising
starting from a primary vector [RS1-RS2-SP—PR—X-TR—SP—RS2′-RS1′], wherein X denotes an expressible nucleotide sequence, RS1 and RS1′ denote restriction sites, RS2 and RS2′ denote restriction sites different from RS1 and RS1′, SP individually denotes a spacer sequence of at least two nucleotides, PR denotes a promoter, TR denotes a terminator, iii) cutting the primary vector with the aid of at least one restriction enzyme specific for RS2 and RS2′ obtaining cassettes having the general formula [rs2-SP—PR—X-TR—SP-rs1] wherein rs1 and rs2 together denote a functional restriction site RS2 or RS2′, iv) assembling the cut out cassettes through interaction between rs1 and rs2.
- 27. The method according to claim 25 or 26, comprising concatenating at least at least 10 cassettes, such as at least 15, for example at least 20, such as at least 25, for example at least 30, such as from 30 to 60 or more than 60, such as at least 75, for example at least 100, such as at least 200, for example at least 500, such as at least 750, for example at least 1000, such as at least 1500, for example at least 2000.
- 28. The method according to claim 26, further comprising addition of vector arms each having a RS2 or RS2′ in one end and a non-complementary overhang or a blunt end in the other end.
- 29. The method according to claim 27, whereby the ratio of vector arms to cassettes determines the number of cassettes in the concatemer.
- 30. The method according to claim 27 or 29, wherein the vector arms are artificial chromosome vector arms.
- 31. The method according to claim 26, further comprising addition of stopper fragments, the stopper fragments each having a RS2 or RS2′ in one end and a non-complementary overhang or a blunt end in the other end.
- 32. The method according to claim 31, further comprising ligating vector arms to the stopper fragments.
- 33. The method according to claim 26, further comprising
iv) isolating mRNA from an expression state, v) obtaining substantially full length cDNA clones corresponding to the mRNA sequences, vi) inserting the substantially full length cDNA clones into a cloning site in a cassette in a primary vector, said cassette being of the general formula in 5′→3′ direction: [RS1-RS2-SP—PR—CS-TR—SP—RS2′-RS1′]wherein CS denotes a cloning site.
- 34. The method according to claim 26, wherein RS1 and RS1′ are restriction sites leaving blunt ends, and RS2 and RS2′ are restriction sites leaving compatible sticky ends.
- 35. The method according to claim 26, wherein RS1 and RS1′ are identical, and wherein RS2 and RS2′ are identical.
- 36. The method according to claim 26, wherein RS2 and RS2′ have palindromic overhangs.
- 37. The method according to claim 26, wherein RS2 and RS2′ have non-palindromic overhangs.
- 38. The method according to any of the preceding claims 25 to 377, further comprising selection of vectors having expressible nucleotide sequences from at least two different expression states, such as from two different species.
- 39. The method according the claim 388, whereby the two different species are from two different classes, such as from two different divisions, more preferably from two different sub-kingdoms, such as from two different kingdoms.
- 40. The method according to any of the claims 25 to 39, whereby the concatemer is ligated into an artificial chromosome selected from the group comprising yeast artificial chromosome, mega yeast artificial chromosome, bacterial artificial chromosome, mouse artificial chromosome, human artificial chromosome.
- 41. The method according to any of the preceding claims 26 to 400, whereby RS2. and RS2′ in at least two cassettes are cleaved by one restriction enzyme, preferably RS2 and RS2′ in substantially all cassettes are cleaved by one restriction enzyme.
- 42. A cell comprising at least one concatemer of individual oligonucleotide cassettes, each concatemer comprising oligonucleotide of the following formula in 5′→3′ direction:
- 43. A cell comprising at least one concatemer of individual oligonucleotide cassettes, each concatemer comprising oligonucleotide of the following formula in 5′→3′ direction:
- 44. The cell according to claim 422 or 433, wherein substantially all rs1-rs2 sequences are recognised by the same restriction enzyme, more preferably wherein substantially all rs1-rs2 sequences are substantially identical.
- 45. The cell according to any of claims 422 to 444, wherein n is at least 10, such as at least 15, for example at least 20, such as at least 25, for example at least 30, such as from 30 to 60 or more than 60, such as at least 75, for example at least 100, such as at least 200, for example at least 500, such as at least 750, for example at least 1000, such as at least 1500, for example at least 2000.
- 46. The cell according to any of claims 422 to 455, comprising 2 concatemers per cell, for example 3 per cell, such as at least 4 per cell.
- 47. The cell according to any of claims 422 to 466, wherein at least one cassette comprises an intron between the promoter and the expressible nucleotide sequence, more preferably substantially all cassettes comprise an intron between the promoter and the expressible nucleotide sequence.
- 48. The cell according to any of claims 433 to 477, comprising a eukaryotic cell selected from the group comprising: yeasts; filamentous ascomycetes such as Neurospora crassa and Aspergillus nidulans; plant cells such as those derived from Nicotiana and Arabidopsis; mammalian host cells such as those derived from humans, monkeys and rodents, such as chinese hamster ovary (CHO) cells, NIH/3T3, COS, 293, VERO, HeLa.
- 49. The cell according to claim 488, being a yeast cell selected from the group comprising baker's yeast, Kluyveromyces marxianus, K. lactis, Candida utilis, Phaffia rhodozyma, Saccharomyces boulardii, Pichia pastoris, Hansenula polymorpha, Yarrowia lipolytica, Candida paraffinica, Schwanniomyces castellii, Pichia stipitis, Candida shehatae, Rhodotorula glutinis, Lipomyces lipofer, Cryptococcos curvatus, Candida spp. (e.g. C. palmioleophila), Yarrowia lipolytica, Candida guilliermondii, Candida, Rhodotorula spp., Saccharomycopsis spp., Aureobasidium pullulans, Candida brumptii, Candida hydrocarbofumarica, Torulopsis, Candida tropicalis, Saccharomyces cerevisiae, Rhodotorula rubra, Candida flaveri, Eremothecium ashbyii, Pichia spp., Kluyveromyces, Hansenula, Kloeckera, Pichia, Pachysolen spp., or Torulopsis bornbicola.
- 50. The cell according to any of the preceding claims 433 to 49, having a mutation in a central biosynthetic pathway.
- 51. The cell according to claim 500, comprising an inserted selectable genetic marker complementing the mutation.
- 52. The cell according to any of the preceding claims 433 to 511, comprising a selectable genetic marker.
- 53. The cell according to any of claims 433 to 522, wherein the nucleotide sequence of at least one concatemer, preferably the nucleotide sequence from substantially all concatemers have been designed to minimise the level of repeat sequences in any one concatemer.
- 54. The cell according to claim 533, wherein recombination within the expressible nucleotide sequence has been minimised.
- 55. The cell according to any of the preceding claims 433 to 544, wherein at least one concatemer, preferably substantially all concatemers is/are concatemer/s according to claims 1 to 24.
- 56. A method for producing a transgenic cell comprising inserting into a host cell a concatemer comprising a heterologous nucleotide sequence comprising at least two genes each controlled by a promoter, wherein the two promoters are different.
- 57. The method according to claim 566, whereby the inserted genes come from at least two different expression states.
- 58. The method according to claim 57, whereby the expression states are comprised in different species.
- 59. The method according to claim 588, whereby the different species are comprised in different kingdoms.
- 60. The method according to any of the preceding claims 566 to 59, comprising insertion of a concatemer according to claims 1 to 24.
- 61. The method according to any of the preceding claims 566 to 600, further comprising selecting for cells comprising at least one stably maintained concatemer,
- 62. The method according to claim 611, whereby selection comprises selection of cells carrying at least one selectable genetic marker on an artificial chromosome, more preferably two selectable genetic markers on an artificial chromosome.
- 63. A primary vector comprising a nucleotide sequence cassette of the general formula in 5′→3′ direction:
- 64. The vector according to claim 633, wherein the nucleotide sequence is a DNA sequence.
- 65. The vector according to claim 633, wherein the nucleotide sequence is double stranded.
- 66. The vector according to any of the preceding claims 633 to 655, further comprising an intron sequence between the promoter and the cloning site and/or between the cloning site and the terminator.
- 67. The vector according to any of the preceding claims 633 to 666, wherein the cloning site comprises an expressible nucleotide sequence.
- 68. The vector according to claim 677, wherein in the expressible nucleotide sequence comprises substantially full length cDNA.
- 69. The vector according to claim 677, wherein the expressible nucleotide sequence comprises genomic DNA.
- 70. The vector according to any of the preceding claims 633 to 69, wherein any of RS1, RS1′, RS2, RS2′ comprise a rare restriction site selected from those of Example 6.
- 71. The vector according to claim 700, wherein the recognition sequence for RS1, RS1′, RS2 and/or RS2′ comprise at least 6 bases such as at least 8 bases, for example at least 10 bases.
- 72. The vector according to claim 711, wherein the recognition sequence comprises a bipartite sequence.
- 73. The vector according to claim 711, wherein the GC content of the recognition sequence is more than 40%, preferably more than 50%, more preferably equal to or more than 60%.
- 74. The vector according to any of the preceding claims 633 to 733, wherein the restriction enzyme recognising RS2 and RS2′ produces sticky ends upon cleavage of a double stranded nucleotide sequence, preferably wherein the sticky ends have a pre-determined nucleotide sequence.
- 75. The vector according to any of the preceding claims 633 to 744, wherein RS2 and RS2′ are identical.
- 76. The method according to claim 755, wherein the RS2 and/or RS2′ overhang is a palindromic sequence.
- 77. The method according to claim 755, wherein the RS2 and/or RS2′ overhang is a non-palindromic sequence.
- 78. The vector according to any of the preceding claims 633 to 755, wherein the restriction enzyme recognising RS1 and RS1′ produces blunt ends upon cleavage of a double stranded nucleotide sequence
- 79. The vector according to any of the preceding claims 633 to 755, wherein the restriction enzyme recognising RS1 and RS1′ produces sticky ends with a nucleotide sequence being non-compatible with the nucleotide sequence of sticky ends produced upon cleavage of RS2 and RS2′.
- 80. The vector according to any of the preceding claims 633 to 79, wherein RS1 and RS1′ are identical.
- 81. The vector according to any of the preceding claims 633 to 800, further comprising a spacer sequence between TR and RS2′.
- 82. The vector according to any of the preceding claims 633 to 811, wherein the spacer and the optional spacer sequence together comprise at least 100 bases, such as at least 250 bases, such as at least 500 bases, such as at least 750 bases, for example at least 1000 bases, such as at least 1100 bases, for example at least 1200 bases, such as at least 1300 bases, for example at least 1400 bases, such as at least 1500 bases, for example at least 1600 bases, such as at least 1700 bases, for example at least 1800 bases, such as at least 1900 bases, for example at least 2000 bases, such as at least 2100 bases, for example at least 2200 bases, such as at least 2300 bases, for example at least 2400 bases, such as at least 2500 bases, for example at least 2600 bases, such as at least 2700 bases, for example at least 2800 bases, such as at least 2900 bases, for example at least 3000 bases, such as at least 3200 bases, for example at least 3500 bases, such as at least 3800 bases, for example at least 4000 bases, such as at least 4500 bases, for example at least 5000 bases, such as at least 6000 bases.
- 83. The vector according to claims 811 or 822, wherein at least one of the spacer sequences comprises between 100 and 2500 bases, preferably between 200 and 2300 bases, more preferably between 300 and 2100 bases, such as between 400 and 1900 bases, more preferably between 500 and 1700 bases, such as between 600 and 1500 bases, more preferably between 700 and 1400 bases.
- 84. The vector according to any of the preceding claims 633 to 833, wherein the promoter is an externally controllable promoter.
- 85. The vector according to any of the preceding claims 633 to 844, wherein the promoter comprises an inducible promoter or wherein the promoter comprises a repressible promoter.
- 86. The vector according to any of the preceding claims 633 to 855, wherein the promoter comprises both repressible and inducible elements.
- 87. The vector according to any of the preceding claims 633 to 866, wherein the promoter is chemically inducible and/or repressible and/or inducible/repressible by temperature.
- 88. The vector according to any of the preceding claims 633 to 877, wherein the promoter is induced and/or repressed by any factor selected from the group comprising carbohydrates, e.g. galactose; low inorganic phosphase levels; temperature, e.g. low or high temperature shift; metals or metal ions, e.g. copper ions; hormones, e.g. dihydrotestosterone; deoxycorticosterone; heat shock (e.g. 39° C.); methanol; redox-status; growth stage, e.g. developmental stage; synthetic inducers, e.g. the gal inducer.
- 89. The vector according to any of the preceding claims 633 to 888, wherein the promoter comprises a promoter selected from the group comprising ADH 1, PGK 1, GAP 491, TPI, PYK, ENO, PMA 1, PHO5, GAL 1, GAL 2, GAL 10, MET25, ADH2, MEL 1, CUP 1, HSE, AOX, MOX, SV40, CaMV, Opaque-2, GRE, ARE, PGK/ARE hybrid, CYC/GRE hybrid, TPI/α2 operator, AOX 1, MOX A.
- 90. The vector according to claim 889, wherein the promoter is selected from hybrid promoters including PGK/ARE hybrid, CYC/GRE hybrid.
- 91. The vector according to any of the preceding claims 633 to 900, wherein the promoter is a synthetic promoter.
- 92. The vector according to any of the preceding claims 633 to 911, wherein the cloning site allows directional cloning.
- 93. The vector according to any of the preceding claims 633 to 922, wherein the cloning site comprises multiple coning sites, such as a polylinker site, the cloning site preferably encoding a series of restriction endonuclease recognition sites.
- 94. The vector according to any of the preceding claims 633 to 933, wherein the promoter and terminator are capable of functioning in an expression host cell, preferably in a yeast cell.
- 95. The vector according to any of the preceding claims 633 to 944, wherein the primary vector comprising the cassette is a plasmid vector having a high copy number, being capable of being propagated in E. coli, and having a selectable marker for maintenance in E. coli.
- 96. The vector according to claim 955, wherein the primary vector can be made single-stranded.
- 97. The vector according to claim 966, further comprising an origin of replication in the vector backbone, preferably an origin of replication for filamentous phages, more preferably the f1 origin of replication.
- 98. The vector according to claim 955, wherein the primary vector is selected from the group comprising pBR322, pUC18, pUC19, pUC118, pUC119, pEMBL, pRSA101, pBluescript.
- 99. The vector according to claim 633, as defined by any of the sequences SEQ ID NO 1 to 3.
- 100. A method of preparing a primary vector comprising
inserting an expressible nucleotide sequence into a cloning site in a primary vector comprising a cassette, the cassette comprising a nucleotide sequence of the general formula in 5′→3′ direction: [RS1-RS2-SP—PR—CS-TR—SP—RS2′-RS1′]wherein RS1 and RS1′ denote restriction sites, RS2 and RS2′ denotes restriction sites different from RS1 and RS1′, SP individually denotes a spacer sequence of at least two nucleotides, PR denotes a promoter, CS denotes a cloning site, TR denotes a terminator.
- 101. The method according to claim 1000, wherein the expressible nucleotide sequences comprise genomic DNA.
- 102. The method according to claim 1000, further comprising
isolating total mRNA from an expression state, and obtaining full length cDNA for insertion into the vector.
- 103. The method according to claim 1022, further comprising selection of cDNA to obtain substantially full length cDNA.
- 104. The method according to any of the preceding claims 1000 to 1033, whereby the insertion into the primary vector comprises directional cloning.
- 105. The method according to any of the preceding claims 1000 to 1044, whereby a substantially full length cDNA population comprises a normalised represenation of cDNA species.
- 106. The method according to any of the preceding claims 1000 to 1055, whereby a substantially full length cDNA population comprises an ormalised representation of cDNA species characteristic of a given expression state.
- 107. A nucleotide library comprising at least two primary vectors each vector comprising a nucleotide sequence cassette of the general formula in 5′→3′ direction:
- 108. The library according to claim 1077, wherein at least three primary vectors comprise an expressible nucleotide sequence coding for the same peptide under the control of three different promoters.
- 109. The library according to claim 1077, wherein at least four primary vectors comprise an expressible nucleotide sequence coding for the same peptide under the control of four different promoters.
- 110. The library according to claim 1077, wherein at least five primary vectors comprise an expressible nucleotide sequence coding for the same peptide under the control of five different promoters, such as wherein at least six primary vectors comprise an expressible nucleotide sequence coding for the same peptide under the control of six different promoters, for example wherein at least seven primary vectors comprise an expressible nucleotide sequence coding for the same peptide under the control of seven different promoters, for example wherein at least eight primary vectors comprise an expressible nucleotide sequence coding for the same peptide under the control of eight different promoters, such as wherein at least nine primary vectors comprise an expressible nucleotide sequence coding for the same peptide under the control of nine different promoters, for example wherein at least ten primary vectors comprise an expressible nucleotide sequence coding for the same peptide under the control of ten different promoters.
- 111. The library according to any of the preceding claims 1077 to 1100, wherein the expressible nucleotide sequence coding for the same peptide comprises essentially the same nucleotide sequence, more preferably the same nucleotide sequence.
- 112. The library according to any of the preceding claims 1077 to 1111, being maintained in a host cell capable of maintaining the vectors comprising the cassettes substantially unaltered.
- 113. The library according to claim 1122, wherein the host cell is selected from the group comprising bacteria such as E. coli or Bacillus subtilis, or fungi such as yeast.
- 114. The library according to any of claims 1077 to 1133, wherein the promoters are not functional in the library host.
- 115. The library according to any of the preceding claims 1077 to 1144, wherein RS2 and RS2′ are identical.
- 116. The library according to claim 1155, wherein at least two vectors comprise the same RS2 and RS2′ sequence.
- 117. The library according to claim 1166, wherein substantially all vectors comprise the same RS2 and RS2′ sequence.
- 118. The library according to any of the preceding claims 1077 to 1177, comprising at least one primary vector according to claims 633 to 988.
- 119. A method for preparing a nucleotide library comprising obtaining expressible nucleotide sequences, cloning the expressible nucleotide sequences into cloning sites of a mixture of primary vectors, the primary vectors comprising a cassette, the cassettes comprising a nucleotide sequence of the general formula in 5′→3′ direction:
- 120. The method according to claim 11919, whereby the expressible nucleotide sequences comprises cDNA, and/or genomic DNA.
- 121. The method according to claim 11919, whereby the expressible nucleotide sequences are obtained from a cDNA library.
- 122. The method according to any of the claims 11919 to 1211, wherein the expressible nucleotide sequences are representative of an expression state.
Priority Claims (1)
Number |
Date |
Country |
Kind |
PA 2001 00127 |
Jan 2001 |
DK |
|
Parent Case Info
[0001] This application is a nonprovisional of U.S. provisional application Serial No. 60/301,022 filed 27. Jun. 2001, which is hereby incorporated by reference in its entirety. The application claims priority from Danish patent application number PA 2001 00127 filed 25. Jan. 2001, which is hereby incorporated by reference in its entirety. All patent and nonpatent references cited in the application, or in the present application, are also hereby incorporated by reference in their entirety.
PCT Information
Filing Document |
Filing Date |
Country |
Kind |
PCT/DK02/00055 |
1/25/2002 |
WO |
|