NON-VIRAL DELIVERY COMPOSITIONS AND SCREENING METHODS

Abstract
The invention relates to barcoded nucleic acid nanostructure delivery compositions for in vivo screening for subsequent use in vivo therapeutic delivery, and methods therefor. More particularly, the invention relates to nucleic acid nanostructure delivery compositions, such as DNA origami structures, associated with barcodes for high throughput in vivo screening of the nucleic acid nanostructure delivery compositions for subsequent use in drug delivery, and methods therefor.
Description
INCORPORATION BY REFERENCES OF MATERIAL SUBMITTED ELECTRONICALLY

Incorporated by reference in its entirety is a computer-readable nucleotide/amino acid sequence listing submitted concurrently herewith and identified as follows: 1,055 kilobytes xml file named “920006-391462_SL.xml,” created on Sep. 7, 2023.


FIELD OF THE DISCLOSURE

The invention relates to barcoded nucleic acid nanostructure delivery compositions for in vivo screening for subsequent use in vivo therapeutic delivery, and methods therefor. More particularly, the invention relates to nucleic acid nanostructure delivery compositions, such as DNA origami structures, associated with barcodes for high throughput in vivo screening of the nucleic acid nanostructure delivery compositions for subsequent use in drug delivery, and methods therefor.


BACKGROUND AND SUMMARY

Genetic medicines (including gene therapy, gene silencing, splicing regulators, and nuclease based gene editors) are poised to produce revolutionary treatments, including vaccines, infectious disease treatments, antimicrobial treatments, antiviral treatments, and most notably, genetic disease treatments. However, the in vivo delivery of these genetic medicine payloads to the specific tissues and cells that need to be treated, while avoiding tissues and cells that can reduce the efficacy or safety of the genetic medicine, poses a significant challenge. Additional challenges include the ability to deliver large genetic payloads or multiple payloads. Adeno-associated viruses (AAVs) are the most widely used tool for genetic medicine delivery, but AAVs are not able to deliver large genetic payloads or multiple payloads (such as the clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 system), and they sometimes trigger unwanted immune responses, including the generation of anti-AAV antibodies, a cell mediated response. Some of the immune responses caused by AAV in patients are potentially fatal immune responses.


Therapeutics based on the CRISPR/Cas9 system have an exceptional potential to treat a number of genetic diseases due to the capability of this system for precise and programmable gene editing. Gene editing and repair using the CRISPR/Cas9 system has two main mechanisms, including non-homologous end joining (NHEJ) which repairs the site of cut by inducing random indel mutation, and homology-directed repair (HDR), which repairs the cut site based on a pre-existing template. Because a pre-designed template can be used for HDR-directed repair, therapies based on this mechanism can be tailored to cure a large number of different genetic diseases. However, the main challenge is that HDR repair requires the delivery of CRISPR/Cas9, small guide RNA (sgRNA) and a donor DNA strand at the same time to a particular location. This requirement becomes particularly limiting for in vivo applications because ensuring co-delivery of multiple large molecules to the same targeted location is currently not feasible. For example, the Cas9 enzyme sequence and guide RNA complex is too large to fit into AAVs.


There is a need for effective non-viral delivery systems, not only for genetic delivery systems, but also for delivery of small molecule therapeutics. The current state-of-the-art non-viral gene delivery systems, such as liposomes, have many drawbacks such as poor biocompatibility and the inability to easily engineer or functionalize them. Additional concerns are that such non-viral gene delivery systems are easily degraded by various enzymes as they pass through intracellular or intercellular compartments, and these systems have not been able to package multiple large payloads.


The inventors have designed nucleice acid nanostructure delivery compositions (e.g., DNA origami nanostructures). These compositions have the advantage of being biocompatible, non-toxic, and can be programmed in many ways. For example, the nucleice acid nanostructure delivery compositions can be programmed to have functional groups that enable them to evade early degradation, that enable them to evade immune responses, and that enable intracellular imaging and targeted and controlled delivery of therapeutic genes and small molecule therapeutics. Thus, these non-viral delivery compositions can enhance the stability, safety, and/or efficacy of payloads by providing immune evasion, tissue-directed intracellular delivery, and the ability to deliver large genetic payloads or multiple payloads, or other genetic medicine payloads, or small molecule therapeutics.


The rate limiting step for development of such drug delivery vehicles is in vivo testing of the drug delivery vehicles leading to slow or un-optimized final products and a lack of new drug delivery vehicle candidates. The inventors have demonstrated the utility of a nucleic acid nanostructure delivery composition (e.g., a DNA origami nanostructure composition) for in vivo delivery, and the inventors have also developed novel methods for labeling the nucleic acid nanostructure delivery compositions with unique barcodes, administering them to an animal, and then extracting them from animal tissues for detection. This method will allow for in vivo high throughput screening of a diverse set of drug delivery nanoparticles, including DNA origami structures, for use in the delivery of large genetic payloads, multiple payloads, other genetic medicine payloads, or small molecule therapeutics.


The following clauses, and combinations thereof, provide various additional illustrative aspects of the invention described herein. The various embodiments described in any other section of this patent application, including the section titled “DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS” and the “EXAMPLES” are applicable to any of the following embodiments of the invention described in the numbered clauses below.

    • 1. A composition comprising a non-viral delivery vehicle comprising a nucleic acid nanostructure delivery composition, and a nucleic acid barcode construct.
    • 2. The composition of clause 1, wherein the nucleic acid nanostructure delivery composition comprises a DNA origami composition.
    • 3. The composition of clause 1, wherein the nucleic acid nanostructure delivery composition comprises single-stranded or double-stranded DNA or RNA.
    • 4. The composition of any one of clauses 1 to 3, wherein the nucleic acid barcode construct is associated with the nucleic acid nanostructure delivery composition via base-pairing.
    • 5. The composition of clause 4, wherein the base-pairing occurs between a sequence of a single-stranded overhang on the nucleic acid nanostructure delivery composition and a complementary sequence appended to the nucleic acid barcode construct.
    • 6. The composition of any one of clauses 1 to 3, wherein the nucleic acid nanostructure delivery composition comprises staples that self-assemble to form the nucleic acid nanostructure delivery composition.
    • 7. The composition of clause 6, wherein the staples act as the nucleic acid barcode construct.
    • 8. The composition of any one of clauses 1 to 3, wherein the nucleic acid barcode construct is associated with the nucleic acid nanostructure delivery composition by a high affinity, non-covalent bond interaction between a biotin molecule on the 5′ and/or the 3′ end of the nucleic acid barcode construct and a molecule that binds to biotin on the nucleic acid nanostructure delivery composition.
    • 9. The composition of clause 8, wherein the molecule that binds to biotin is bound to the nucleic acid nanostructure delivery composition by a covalent phosphoramidate bond formed via an EDC-NHS coupling reaction between a terminal phosphate group of a 5′ end of an overhang on the nucleic acid nanostructure delivery composition and an amine group on the molecule that binds to biotin.
    • 10. The composition of clause 8 or 9, wherein the biotin is bound to the nucleic acid barcode construct by a covalent bond.
    • 11. The composition of any one of clauses 1 to 3, wherein the nucleic acid barcode construct is bound to the nucleic acid nanostructure delivery composition by a covalent bond.
    • 12. The composition of clause 11, wherein the covalent bond is formed via an EDC-NHS coupling reaction between a terminal phosphate group of the 5′ end of an overhang on the nucleic acid nanostructure delivery composition and an amine group on an amino terminal nucleotide of the nucleic acid barcode construct.
    • 13. The composition of clause 11, wherein the covalent bond is formed via a click chemistry coupling reaction between an azide group on the nucleic acid nanostructure delivery composition and an alkyne group on the nucleic acid barcode construct.
    • 14. The composition of clause 11, wherein the covalent bond is formed via a click chemistry coupling reaction between an azide group on the nucleic acid barcode construct and an alkyne group on the nucleic acid nanostructure delivery composition.
    • 15. The composition of any one of clauses 1 to 3, wherein the nucleic acid barcode construct is associated with the nucleic acid nanostructure delivery composition by a covalent bond between a carboxy terminated molecule on the nucleic acid nanostructure delivery composition and a primary amine on the nucleic acid barcode construct at the 5′ and/or the 3′ end.
    • 16. The composition of any one of the preceding clauses, wherein the nucleic acid barcode construct comprises two primer binding segments and one or more unique barcode sequences between the two primer binding segments.
    • 17. The composition of clause 16, wherein the primer binding segments range in length from about 15 base pairs to about 30 base pairs.
    • 18. The composition of clause 16 or 17, wherein the primer binding segments are a universal primer binding set.
    • 19. The composition of any one of clauses 16 to 18, wherein the one or more unique barcode sequences comprise unique sequences of about 6 to about 20 nucleotides in length.
    • 20. The composition of any one of clauses 16 to 19, wherein the length of the unique barcode sequences is two times or more greater than the length of the primer binding segments.
    • 21. The composition of any one of clauses 1 to 20, wherein the nucleic acid barcode construct comprises DNA or RNA.
    • 22. The composition of any one of clauses 16 to 21, wherein the unique barcode sequences further comprise a hamming distance of at least 2 to 6 bases between any two unique barcode sequences.
    • 23. The composition of any one of the preceding clauses, wherein the nucleic acid barcode construct further comprises from about 6 to about 12 random bases at the 3′ end of the unique barcode sequences.
    • 24. The composition of clause 23, wherein the about 6 to about 12 random bases at the 3′ end of the unique barcode sequences are for bioinformatic removal of PCR duplicates.
    • 25. The composition of any one of the preceding clauses, wherein the nucleic acid barcode construct ranges in length from about 42 nucleotides to about 210 nucleotides.
    • 26. A method of in vivo screening for a desired nucleic acid nanostructure delivery composition, the method comprising (a) preparing a library comprising two or more types of nucleic acid nanostructure delivery compositions, wherein each nucleic acid nanostructure delivery composition is associated with a nucleic acid barcode construct comprising a different unique barcode sequence, (b) administering the library to an animal, (c) removing cells or tissues from the animal, (d) isolating the nucleic acid barcode constructs from the cells or the tissues of the animal, (e) detecting the nucleic acid barcode constructs in the cells or the tissues of the animal, and (f) identifying the desired nucleic acid nanostructure delivery composition for use as a delivery vehicle.
    • 27. The method of clause 26, wherein the nucleic acid nanostructure delivery composition is associated with the nucleic acid barcode construct according to any one of clauses 4 to 15.
    • 28. The method of clause 26 or 27, wherein the nucleic acid barcode construct is detected by a method selected from the group consisting of the polymerase chain reaction (PCR), isothermal amplification, sequencing, or a combination thereof, to obtain nucleotide sequence data.
    • 29. The method of any one of clauses 26 to 28, wherein the nucleic acid nanostructure delivery composition is loaded with a payload.
    • 30. The method of clause 29, wherein the payload is a luminescent molecule.
    • 31. The method of clause 30, wherein the luminescence is used to track the biodistribution or cell uptake of the nucleic acid nanostructure delivery composition via imaging.
    • 32. The method of any one of clauses 26 to 31, wherein the administration to the animal is via an intramuscular, an intravenous, an intraperitoneal, an oral, or a pulmonary route.
    • 33. The method of any one of clauses 26 to 32, wherein the nucleic acid barcode construct is isolated from the cells and the tissues by mixing with a first organic compound and incubating the organic phase with an aqueous phase of the cell or tissue sample, separating the organic phase from the aqueous phase, mixing the organic phase with a second organic compound, incubating the mixture, precipitating the nucleic acid barcode construct from the mixture, removing the organic phase by evaporation, and resuspending the nucleic acid barcode construct in an aqueous composition.
    • 34. The method of clause 33, wherein the organic phase comprises phenol chloroform.
    • 35. The method of clause 26, wherein the nucleic acid barcode construct is separated from cationic material in the cells or tissues by titrating the aqueous composition of the nucleic acid barcode construct to a pH of greater than 7.4.
    • 36. The method of clause 26, wherein the nucleic acid barcode construct is separated from material in the cells or tissues by binding the nucleic acid barcode construct with a molecule with a binding affinity to the nucleic acid barcode construct greater than the binding affinity to the cell or tissue material.
    • 37. The method of clause 26, wherein the nucleic acid barcode construct is separated from material in the cells or tissues via size exclusion chromatography.
    • 38. The method of clause 26, wherein the nucleic acid barcode construct is separated from material in the cells or tissues via dialysis or diafiltration.
    • 39. The method of clause 26, wherein the nucleic acid barcode construct is separated from material in the cells or tissues via filtration.
    • 40. The method of clause 26, wherein the nucleic acid barcode construct is separated from material in the cells or tissues by digesting proteins using an enzyme.
    • 41. The method of clause 40, wherein the enzyme is Proteinase K.
    • 42. The method of clause 26, wherein the nucleic acid barcode constructs associated with the nucleic acid nanostructure delivery composition are detected by first diluting the isolated nucleic acid barcode constructs by a factor of at least 1000 times, and then amplifying the nucleic acid barcode constructs by PCR using primers.
    • 43. The method of clause 42, wherein the primers from the PCR step are enzymatically digested prior to detection of amplicons.
    • 44. The method of clause 28, wherein the nucleotide sequence data is converted to fast Q files, and the fast Q files are mapped to known unique polynucleotide sequences and the unique polynucleotide sequences are enumerated.
    • 45. The method of any one of clauses 26 to 44, wherein the nucleic acid barcode construct of any one clauses 16 to 25 is used.
    • 46. A composition comprising a non-viral delivery vehicle comprising a nucleic acid nanostructure delivery composition and a payload wherein the nucleic acid nanostructure delivery composition comprises single-stranded or double-stranded DNA or RNA.
    • 47. The composition of clause 46, wherein the nucleic acid nanostructure delivery composition comprises DNA.
    • 48. The composition of clause 46, wherein the nucleic acid nanostructure delivery composition comprises RNA.
    • 49. The composition of clause 46, wherein the nucleic acid nanostructure delivery composition is single-stranded.
    • 50. The composition of clause 46, wherein the nucleic acid nanostructure delivery composition is double-stranded.
    • 51. The composition of any one of clauses 46 to 50, wherein the payload comprises nucleic acids.
    • 52. The composition of clause 51, wherein the nucleic acids comprise DNA or RNA.
    • 53. The composition of clause 51, wherein the payload nucleic acids are used for homology directed repair or as transposable elements.
    • 54. The composition of clause 51, wherein the payload nucleic acids comprise a short guide RNA (sgRNA) and a donor DNA strand.
    • 55. The composition of clause 54, wherein the sgRNA is used for targeting an enzyme to a specific genomic sequence.
    • 56. The composition of any one of clauses 46 to 50, wherein the payload comprises a CRISPR associated enzyme.
    • 57. The composition of clause 55, wherein the targeted enzyme is a CRISPR associated enzyme.
    • 58. The composition of clause 51, wherein the payload comprises a CRISPR associated enzyme, an sgRNA, and a donor DNA strand.
    • 59. The composition of any one of clauses 46 to 50, wherein the payload comprises CRISPR/Cas9.
    • 60. The composition of clause 51, wherein the payloads comprise CRISPR/Cas9, an sgRNA, and a donor DNA strand.
    • 61. The composition of any one of clauses 46 to 50, wherein the payload comprises CRISPR/Cas9 and Cas9 is fused with a deaminase.
    • 62. The composition of clause 51, wherein the payloads comprise a coding sequence for Cas9, an sgRNA, and a donor DNA strand in the form of a plasmid.
    • 63. The composition of clause 51, wherein the payloads consist of one molecule each of CRISPR/Cas9, an sgRNA, and a donor DNA strand.
    • 64. The composition of clause 51, wherein the payloads comprise an antisense oligonucleotide.
    • 65. The composition of clause 51, wherein the payload is of a size selected from the group consisting of 3 kB or more, 3.5 kB or more, 4 kB or more, 4.5 kB or more, 5 kB or more, 5.5 kB or more, 6 kB or more, 6.5 kB or more, 7 kB or more, 7.5 kB or more, 8 kB or more, and 8.5 kB or more.
    • 66. The composition of any one of clauses 46 to 51, wherein the nucleic acid nanostrucuture delivery composition comprises one or more oligonucleotides with overhangs that bind through complementary base paring with the payload nucleic acids.
    • 67. The composition of any one of clauses 46 to 65, wherein the payload is associated with the nucleic acid nanostructure delivery composition by a high affinity, non-covalent bond interaction between a biotin molecule on the payload and a molecule that binds to biotin on the nucleic acid nanostructure delivery composition.
    • 68. The composition of clause 67, wherein molecule that binds to biotin is bound to the nucleic acid nanostructure delivery composition by a covalent phosphoramidate bond formed via an EDC-NHS coupling reaction between a terminal phosphate group of a 5′ end of an overhang on the nucleic acid nanostructure delivery composition and an amine group on the molecule that binds to biotin.
    • 69. The composition of clause 67 or 68, wherein the biotin is bound to the payload by a covalent bond.
    • 70. The composition of any one of clauses 46 to 65, wherein the payload is bound to the nucleic acid nanostructure delivery composition by a covalent bond.
    • 71. The composition of clause 70, wherein the covalent bond is formed via an EDC-NHS coupling reaction between a terminal phosphate group of the 5′ end of an overhang on the nucleic acid nanostructure delivery composition and an amine group on the payload.
    • 72. The composition of clause 70, wherein the covalent bond is formed via a click chemistry coupling reaction between an azide group on the nucleic acid nanostructure delivery composition and an alkyne group on the payload.
    • 73. The composition of clause 70, wherein the covalent bond is formed via a click chemistry coupling reaction between an azide group on the payload and an alkyne group on the nucleic acid nanostructure delivery composition.
    • 74. The composition of any one of clauses 46 to 65, wherein the payload is associated with the nucleic acid nanostructure delivery composition by a covalent bond between a carboxy terminated molecule on the nucleic acid nanostructure delivery composition and an amine on the payload.
    • 75. The composition of any one of clauses 46 to 74, wherein the nucleic acid nanostructure delivery composition has an aspect ratio of about 2.
    • 76. The composition of any one of clauses 1 to 25 or 46 to 75, wherein the nucleic acid nanostructure delivery composition is coated with one or more polymers.
    • 77. The composition of any one of clauses 1 to 25 or 46 to 76, wherein the nucleic acid nanostructure delivery composition further comprises a targeting component for targeting to cells.
    • 78. A method of treating a patient with a disease, the method comprising administering to the patient the nucleic acid nanostructure delivery composition identified in the in vivo screening method of any one of clauses 26 to 45 or the nucleic acid nanostructure delivery composition of any one of clauses 46 to 77, wherein the nucleic acid nanostructure delivery composition comprises a payload, and treating the disease in the patient.
    • 79. The method of clause 78, further comprising administering a pharmaceutically acceptable carrier to the patient.
    • 80. The method of clause 79, wherein the pharmaceutically acceptable carrier is for parenteral administration or topical administration.
    • 81. The method of clause 78, wherein the patient has a disease or a disorder selected from the group consisting of cancer, a muscular disorder, a pulmonary disorder, a skin disorder, a neurological disease, neurofibromatosis 1, and a hemoglobinopathy.
    • 82 The method of clause 81, wherein the cancer is selected from the group consisting of lung cancer, bone cancer, pancreatic cancer, skin cancer, uterine cancer, ovarian cancer, endometrial cancer, rectal cancer, stomach cancer, colon cancer, breast cancer, cancer of the esophagus, cancer of the endocrine system, prostate cancer, leukemia, lymphoma, mesothelioma, cancer of the bladder, cancer of the kidney, neoplasms of the central nervous system, brain cancer, and adenocarcinoma.
    • 83. The method of clause 81, wherein the skin disorder is a Staphlococcus aureus infection.
    • 84. The method of clause 81, wherein the muscular disorder is muscular dystrophy.
    • 85. The method of clause 78, wherein the nucleic acid nanostructure delivery composition is not cytotoxic to the cells of the patient.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 TEM image of a DNA barcoded origami structure with 1% PTA negative stain at 100,000 × magnification. The white rectangular structures are the DNA origami structures.



FIG. 2 shows the gel electrophoresis image of the PCR amplification products for the various test articles and controls. The presence of the white bands indicates the presence of PCR amplicons consistent with amplification of the DNA barcodes.



FIG. 3 shows PCR amplification of DNAO with and without barcodes at various transfection concentrations.





DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS

The invention relates to barcoded nucleic acid nanostructure delivery compositions for in vivo screening for subsequent use in vivo therapeutic delivery, and methods therefor. More particularly, the invention relates to nucleic acid nanostructure delivery compositions, such as DNA origami structures, associated with barcodes for high throughput in vivo screening of the nucleic acid nanostructure delivery compositions for subsequent use of the nucleic acid nanostructure delivery compositions in drug delivery, and methods therefor.


The invention also relates to nucleic acid nanostructure delivery compositions for non-viral delivery, and methods therefor. More particularly, the invention relates to single-stranded or double-stranded DNA or RNA nanostructure delivery compositions, such as DNA origami compositions, for the delivery of more than one payload, a nucleic acid construct payload of 3 kB or more, other genetic medicine payloads, or small molecule therapeutics.


The following clauses, and combinations thereof, provide various additional illustrative aspects of the invention described herein. The various embodiments described in any other section of this patent application, including the summary portion of the section titled “BACKGROUND AND SUMMARY”, the “EXAMPLES”, and this “DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS” section of the application are applicable to any of the following embodiments of the invention described in the numbered clauses below.

    • 1. A composition comprising a non-viral delivery vehicle comprising a nucleic acid nanostructure delivery composition, and a nucleic acid barcode construct.
    • 2. The composition of clause 1, wherein the nucleic acid nanostructure delivery composition comprises a DNA origami composition.
    • 3. The composition of clause 1, wherein the nucleic acid nanostructure delivery composition comprises single-stranded or double-stranded DNA or RNA.
    • 4. The composition of any one of clauses 1 to 3, wherein the nucleic acid barcode construct is associated with the nucleic acid nanostructure delivery composition via base-pairing.
    • 5. The composition of clause 4, wherein the base-pairing occurs between a sequence of a single-stranded overhang on the nucleic acid nanostructure delivery composition and a complementary sequence appended to the nucleic acid barcode construct.
    • 6. The composition of any one of clauses 1 to 3, wherein the nucleic acid nanostructure delivery composition comprises staples that self-assemble to form the nucleic acid nanostructure delivery composition.
    • 7. The composition of clause 6, wherein the staples act as the nucleic acid barcode construct.
    • 8. The composition of any one of clauses 1 to 3, wherein the nucleic acid barcode construct is associated with the nucleic acid nanostructure delivery composition by a high affinity, non-covalent bond interaction between a biotin molecule on the 5′ and/or the 3′ end of the nucleic acid barcode construct and a molecule that binds to biotin on the nucleic acid nanostructure delivery composition.
    • 9. The composition of clause 8, wherein the molecule that binds to biotin is bound to the nucleic acid nanostructure delivery composition by a covalent phosphoramidate bond formed via an EDC-NHS coupling reaction between a terminal phosphate group of a 5′ end of an overhang on the nucleic acid nanostructure delivery composition and an amine group on the molecule that binds to biotin.
    • 10. The composition of clause 8 or 9, wherein the biotin is bound to the nucleic acid barcode construct by a covalent bond.
    • 11. The composition of any one of clauses 1 to 3, wherein the nucleic acid barcode construct is bound to the nucleic acid nanostructure delivery composition by a covalent bond.
    • 12. The composition of clause 11, wherein the covalent bond is formed via an EDC-NHS coupling reaction between a terminal phosphate group of the 5′ end of an overhang on the nucleic acid nanostructure delivery composition and an amine group on an amino terminal nucleotide of the nucleic acid barcode construct.
    • 13. The composition of clause 11, wherein the covalent bond is formed via a click chemistry coupling reaction between an azide group on the nucleic acid nanostructure delivery composition and an alkyne group on the nucleic acid barcode construct.
    • 14. The composition of clause 11, wherein the covalent bond is formed via a click chemistry coupling reaction between an azide group on the nucleic acid barcode construct and an alkyne group on the nucleic acid nanostructure delivery composition.
    • 15. The composition of any one of clauses 1 to 3, wherein the nucleic acid barcode construct is associated with the nucleic acid nanostructure delivery composition by a covalent bond between a carboxy terminated molecule on the nucleic acid nanostructure delivery composition and a primary amine on the nucleic acid barcode construct at the 5′ and/or the 3′ end.
    • 16. The composition of any one of the preceding clauses, wherein the nucleic acid barcode construct comprises two primer binding segments and one or more unique barcode sequences between the two primer binding segments.
    • 17. The composition of clause 16, wherein the primer binding segments range in length from about 15 base pairs to about 30 base pairs.
    • 18. The composition of clause 16 or 17, wherein the primer binding segments are a universal primer binding set.
    • 19. The composition of any one of clauses 16 to 18, wherein the one or more unique barcode sequences comprise unique sequences of about 6 to about 20 nucleotides in length.
    • 20. The composition of any one of clauses 16 to 19, wherein the length of the unique barcode sequences is two times or more greater than the length of the primer binding segments.
    • 21. The composition of any one of clauses 1 to 20, wherein the nucleic acid barcode construct comprises DNA or RNA.
    • 22. The composition of any one of clauses 16 to 21, wherein the unique barcode sequences further comprise a hamming distance of at least 2 to 6 bases between any two unique barcode sequences.
    • 23. The composition of any one of the preceding clauses, wherein the nucleic acid barcode construct further comprises from about 6 to about 12 random bases at the 3′ end of the unique barcode sequences.
    • 24. The composition of clause 23, wherein the about 6 to about 12 random bases at the 3′ end of the unique barcode sequences are for bioinformatic removal of PCR duplicates.
    • 25. The composition of any one of the preceding clauses, wherein the nucleic acid barcode construct ranges in length from about 42 nucleotides to about 210 nucleotides.
    • 26. A method of in vivo screening for a desired nucleic acid nanostructure delivery composition, the method comprising (a) preparing a library comprising two or more types of nucleic acid nanostructure delivery compositions, wherein each nucleic acid nanostructure delivery composition is associated with a nucleic acid barcode construct comprising a different unique barcode sequence, (b) administering the library to an animal, (c) removing cells or tissues from the animal, (d) isolating the nucleic acid barcode constructs from the cells or the tissues of the animal, (e) detecting the nucleic acid barcode constructs in the cells or the tissues of the animal, and (f) identifying the desired nucleic acid nanostructure delivery composition for use as a delivery vehicle.
    • 27. The method of clause 26, wherein the nucleic acid nanostructure delivery composition is associated with the nucleic acid barcode construct according to any one of clauses 4 to 15.
    • 28. The method of clause 26 or 27, wherein the nucleic acid barcode construct is detected by a method selected from the group consisting of the polymerase chain reaction (PCR), isothermal amplification, sequencing, or a combination thereof, to obtain nucleotide sequence data.
    • 29. The method of any one of clauses 26 to 28, wherein the nucleic acid nanostructure delivery composition is loaded with a payload.
    • 30. The method of clause 29, wherein the payload is a luminescent molecule.
    • 31. The method of clause 30, wherein the luminescence is used to track the biodistribution or cell uptake of the nucleic acid nanostructure delivery composition via imaging.
    • 32. The method of any one of clauses 26 to 31, wherein the administration to the animal is via an intramuscular, an intravenous, an intraperitoneal, an oral, or a pulmonary route.
    • 33. The method of any one of clauses 26 to 32, wherein the nucleic acid barcode construct is isolated from the cells and the tissues by mixing with a first organic compound and incubating the organic phase with an aqueous phase of the cell or tissue sample, separating the organic phase from the aqueous phase, mixing the organic phase with a second organic compound, incubating the mixture, precipitating the nucleic acid barcode construct from the mixture, removing the organic phase by evaporation, and resuspending the nucleic acid barcode construct in an aqueous composition.
    • 34. The method of clause 33, wherein the organic phase comprises phenol chloroform.
    • 35. The method of clause 26, wherein the nucleic acid barcode construct is separated from cationic material in the cells or tissues by titrating the aqueous composition of the nucleic acid barcode construct to a pH of greater than 7.4.
    • 36. The method of clause 26, wherein the nucleic acid barcode construct is separated from material in the cells or tissues by binding the nucleic acid barcode construct with a molecule with a binding affinity to the nucleic acid barcode construct greater than the binding affinity to the cell or tissue material.
    • 37. The method of clause 26, wherein the nucleic acid barcode construct is separated from material in the cells or tissues via size exclusion chromatography.
    • 38. The method of clause 26, wherein the nucleic acid barcode construct is separated from material in the cells or tissues via dialysis or diafiltration.
    • 39. The method of clause 26, wherein the nucleic acid barcode construct is separated from material in the cells or tissues via filtration.
    • 40. The method of clause 26, wherein the nucleic acid barcode construct is separated from material in the cells or tissues by digesting proteins using an enzyme.
    • 41. The method of clause 40, wherein the enzyme is Proteinase K.
    • 42. The method of clause 26, wherein the nucleic acid barcode constructs associated with the nucleic acid nanostructure delivery composition are detected by first diluting the isolated nucleic acid barcode constructs by a factor of at least 1000 times, and then amplifying the nucleic acid barcode constructs by PCR using primers.
    • 43. The method of clause 42, wherein the primers from the PCR step are enzymatically digested prior to detection of amplicons.
    • 44. The method of clause 28, wherein the nucleotide sequence data is converted to fast Q files, and the fast Q files are mapped to known unique polynucleotide sequences and the unique polynucleotide sequences are enumerated.
    • 45. The method of any one of clauses 26 to 44, wherein the nucleic acid barcode construct of any one clauses 16 to 25 is used.
    • 46. A composition comprising a non-viral delivery vehicle comprising a nucleic acid nanostructure delivery composition and a payload wherein the nucleic acid nanostructure delivery composition comprises single-stranded or double-stranded DNA or RNA.
    • 47. The composition of clause 46, wherein the nucleic acid nanostructure delivery composition comprises DNA.
    • 48. The composition of clause 46, wherein the nucleic acid nanostructure delivery composition comprises RNA.
    • 49. The composition of clause 46, wherein the nucleic acid nanostructure delivery composition is single-stranded.
    • 50. The composition of clause 46, wherein the nucleic acid nanostructure delivery composition is double-stranded.
    • 51. The composition of any one of clauses 46 to 50, wherein the payload comprises nucleic acids.
    • 52. The composition of clause 51, wherein the nucleic acids comprise DNA or RNA.
    • 53. The composition of clause 51, wherein the payload nucleic acids are used for homology directed repair or as transposable elements.
    • 54. The composition of clause 51, wherein the payload nucleic acids comprise a short guide RNA (sgRNA) and a donor DNA strand.
    • 55. The composition of clause 54, wherein the sgRNA is used for targeting an enzyme to a specific genomic sequence.
    • 56. The composition of any one of clauses 46 to 50, wherein the payload comprises a CRISPR associated enzyme.
    • 57. The composition of clause 55, wherein the targeted enzyme is a CRISPR associated enzyme.
    • 58. The composition of clause 51, wherein the payload comprises a CRISPR associated enzyme, an sgRNA, and a donor DNA strand.
    • 59. The composition of any one of clauses 46 to 50, wherein the payload comprises CRISPR/Cas9.
    • 60. The composition of clause 51, wherein the payloads comprise CRISPR/Cas9, an sgRNA, and a donor DNA strand.
    • 61. The composition of any one of clauses 46 to 50, wherein the payload comprises CRISPR/Cas9 and Cas9 is fused with a deaminase.
    • 62. The composition of clause 51, wherein the payloads comprise a coding sequence for Cas9, an sgRNA, and a donor DNA strand in the form of a plasmid.
    • 63. The composition of clause 51, wherein the payloads consist of one molecule each of CRISPR/Cas9, an sgRNA, and a donor DNA strand.
    • 64. The composition of clause 51, wherein the payloads comprise an antisense oligonucleotide.
    • 65. The composition of clause 51, wherein the payload is of a size selected from the group consisting of 3 kB or more, 3.5 kB or more, 4 kB or more, 4.5 kB or more, 5 kB or more, 5.5 kB or more, 6 kB or more, 6.5 kB or more, 7 kB or more, 7.5 kB or more, 8 kB or more, and 8.5 kB or more.
    • 66. The composition of any one of clauses 46 to 51, wherein the nucleic acid nanostrucuture delivery composition comprises one or more oligonucleotides with overhangs that bind through complementary base paring with the payload nucleic acids.
    • 67. The composition of any one of clauses 46 to 65, wherein the payload is associated with the nucleic acid nanostructure delivery composition by a high affinity, non-covalent bond interaction between a biotin molecule on the payload and a molecule that binds to biotin on the nucleic acid nanostructure delivery composition.
    • 68. The composition of clause 67, wherein molecule that binds to biotin is bound to the nucleic acid nanostructure delivery composition by a covalent phosphoramidate bond formed via an EDC-NHS coupling reaction between a terminal phosphate group of a 5′ end of an overhang on the nucleic acid nanostructure delivery composition and an amine group on the molecule that binds to biotin.
    • 69. The composition of clause 67 or 68, wherein the biotin is bound to the payload by a covalent bond.
    • 70. The composition of any one of clauses 46 to 65, wherein the payload is bound to the nucleic acid nanostructure delivery composition by a covalent bond.
    • 71. The composition of clause 70, wherein the covalent bond is formed via an EDC-NHS coupling reaction between a terminal phosphate group of the 5′ end of an overhang on the nucleic acid nanostructure delivery composition and an amine group on the payload.
    • 72. The composition of clause 70, wherein the covalent bond is formed via a click chemistry coupling reaction between an azide group on the nucleic acid nanostructure delivery composition and an alkyne group on the payload.
    • 73. The composition of clause 70, wherein the covalent bond is formed via a click chemistry coupling reaction between an azide group on the payload and an alkyne group on the nucleic acid nanostructure delivery composition.
    • 74. The composition of any one of clauses 46 to 65, wherein the payload is associated with the nucleic acid nanostructure delivery composition by a covalent bond between a carboxy terminated molecule on the nucleic acid nanostructure delivery composition and an amine on the payload.
    • 75. The composition of any one of clauses 46 to 74, wherein the nucleic acid nanostructure delivery composition has an aspect ratio of about 2.
    • 76. The composition of any one of clauses 1 to 25 or 46 to 75, wherein the nucleic acid nanostructure delivery composition is coated with one or more polymers.
    • 77. The composition of any one of clauses 1 to 25 or 46 to 76, wherein the nucleic acid nanostructure delivery composition further comprises a targeting component for targeting to cells.
    • 78. A method of treating a patient with a disease, the method comprising administering to the patient the nucleic acid nanostructure delivery composition identified in the in vivo screening method of any one of clauses 26 to 45 or the nucleic acid nanostructure delivery composition of any one of clauses 46 to 77, wherein the nucleic acid nanostructure delivery composition comprises a payload, and treating the disease in the patient.
    • 79. The method of clause 78, further comprising administering a pharmaceutically acceptable carrier to the patient.
    • 80. The method of clause 79, wherein the pharmaceutically acceptable carrier is for parenteral administration or topical administration.
    • 81. The method of clause 78, wherein the patient has a disease or a disorder selected from the group consisting of cancer, a muscular disorder, a pulmonary disorder, a skin disorder, a neurological disease, neurofibromatosis 1, and a hemoglobinopathy.
    • 82 The method of clause 81, wherein the cancer is selected from the group consisting of lung cancer, bone cancer, pancreatic cancer, skin cancer, uterine cancer, ovarian cancer, endometrial cancer, rectal cancer, stomach cancer, colon cancer, breast cancer, cancer of the esophagus, cancer of the endocrine system, prostate cancer, leukemia, lymphoma, mesothelioma, cancer of the bladder, cancer of the kidney, neoplasms of the central nervous system, brain cancer, and adenocarcinoma.
    • 83. The method of clause 81, wherein the skin disorder is a Staphlococcus aureus infection.
    • 84. The method of clause 81, wherein the muscular disorder is muscular dystrophy.
    • 85. The method of clause 78, wherein the nucleic acid nanostructure delivery composition is not cytotoxic to the cells of the patient.


In various embodiments, the nucleic acid nanostructure delivery compositions described herein may comprise any non-viral composition for in vivo delivery of the payloads. By way of example, the nucleic acid nanostructure delivery compositions described herein may be selected from the group comprising synthetic virus-like particles, carbon nanotubes, emulsions, and any nucleic acid nanostructure delivery composition, such as DNA origami structures.


In these embodiments, the nucleic acid nanostructure delivery compositions have a high degree of tunability in structure and function, opportunities to protect payloads from adverse reactions or degradation by the immune system, and cell targeting via surface charge, particle size, or conjugation with various aptamers. These delivery systems also lend themselves to computer aided design, and they have suitable pathways to robust, commercial scale manufacturing processes with higher yields and fewer purification steps than viral manufacturing processes.


A nucleic acid nanostructure delivery composition (e.g., a DNA origami structure), as a delivery platform, is programmable and offers an opportunity for precise scale-up and manufacturing. In this embodiment, the biologic and non-viral nature of the nucleic acid nanostructure delivery composition reduces the chance of adverse immune reactions. In this embodiment, control of each nucleotide that forms a part of the nucleic acid nanostructure delivery composition (e.g., DNA origami nanostructure) allows for the precise design and modification of the structure, including suitable chemical moieties which can make in vivo delivery and endosomal escape possible. In other embodiments, the nucleic acid nanostructure delivery composition can comprise RNA. In various embodiments, the nucleic acid nanostructure delivery composition can be single-stranded or double-stranded, and can comprise DNA or RNA.


In this embodiment, the nucleic acid nanostructure delivery composition can undergo self-base pairing (i.e., a DNA origami structure) to fold into structures that can form the single-stranded or double-stranded scaffold that can encapsulate a payload.


In this embodiment, the nucleic acid nanostructure delivery composition can comprise overhangs that bind through complementary base paring with payload nucleic acids or with the nucleic acid barcode constructs described herein. In this embodiment, the overhangs can be located within a cavity within the nucleic acid nanostructure delivery composition scaffold, and the cavity can be covered by a lid and a hinge allowing the payloads or the nucleic acid barcode constructs to be completely enclosed within the cavity when the lid is shut. In this embodiment, the lid can further comprise oligonucleotide strands that bind through complementary base pairing with other oligonucleotide strands attached to the nucleic acid nanostructure delivery composition scaffold when the lid is in the closed position. DNA nanostructure delivery compositions (e.g., DNA origami structures) are described in U.S. Pat. No. 9,765,341, incorporated herein by reference.


As used herein, the term “complementary base pairing” refers to the ability of purine and pyrimidine nucleotide sequences to associate through hydrogen bonding to form double-stranded nucleic acid molecules. Guanine and cytosine, adenine and thymine, and adenine and uracil are complementary and can associate through hydrogen bonding resulting in the formation of double-stranded nucleic acid molecules when two nucleic acid molecules have “complementary” sequences. The complementary sequences can be DNA or RNA sequences. The complementary DNA or RNA sequences are referred to as a “complement.”


In one aspect, the nucleic acid nanostructure delivery composition of the invention can comprise more than one payload for delivery to target cells, or a nucleic acid payload of 3 kB or more, or another genetic payload, or a small molecule therapeutic for delivery to target cells. In these embodiments, the nucleic acid payload can have a size of 3 kB or more and can be DNA or RNA. In any of the nucleic acid nanostructure delivery composition embodiments described herein, the nucleic acid nanostructure can comprise M13 bacteriophage DNA.


In one illustrative embodiment, the nucleic acid nanostructure delivery composition further comprises a targeting component for targeting to cells. In one aspect, the targeting component can be a nucleotide that is an RNA that forms a ‘stem-and-loop’ structure. In this aspect, the nucleic acid nanostructure delivery composition can be designed so that the polynucleotide strands fold into three-dimensional structures via a series of highly tuned ‘stem-and-loop’ configurations. In this embodiment, the nucleic acid nanostructure delivery composition can have a high affinity for protein receptors expressed on specific cells resulting in targeting of the nucleic acid nanostructure delivery composition and the payload to the specific cells. In this embodiment, the polynucleotide that binds to the target cell receptor can bind in conjunction with a peptide aptamer. In another aspect, the nucleic acid nanostructure delivery composition can be folded so that, in the presence of certain biomarkers such as cell receptors, microRNA, DNA, RNA or an antigen, the self-base pairs are disrupted and the nucleic acid nanostructure delivery composition can unfold, resulting in the triggered release of the payload only in the presence of the specific biomarker. For example, a lock-and-key mechanism for triggered opening of a nucleic acid nanostructure delivery composition (e.g., a DNA origami construct) has been demonstrated previously (Andersen, et al., Nature, Vol. 459, pages 73-76(2009), incorporated by reference herein). In these embodiments, the use of the nucleic acid nanostructure delivery composition to create three-dimensional structures that target cells and tissues allows for more efficient delivery of payloads with fewer side effects, since the nucleic acid nanostructure delivery composition can have low immunogenicity, and the payload will be released only in the presence of RNA or peptide biomarkers, for example, that exist in the cytosol of target cells and tissues.


In another embodiment, a cell-targeting peptide can be conjugated to a charge neutral peptide nucleic acid, PNA, oligonucleotide instead of a DNA oligonucleotide. PNAs are synthetic polymers of repeating peptide-like amide units (N-(2-aminoethyl) glycine) that mimic nucleic acids in their hybridization affinity and specificity via base-pairing. Their uncharged backbones lead to higher binding affinity with DNA than DNA:DNA so these molecules are suitable for binding to proteins and peptides.


In the embodiment where a nucleic acid nanostructure delivery composition is used, computer aided design tools can predict the nucleotide sequence necessary to produce highly engineered nucleic acid nanostructure delivery compositions. For gene delivery, these nucleic acid nanostructure delivery compositions offer the advantages of encapsulation efficiency, as the size and shape of the structure can be tailored to fit the cargo. In another aspect, loading efficiency can be increased by incorporating payloads into the encapsulating nucleic acid nanostructure delivery composition itself.


In another illustrative embodiment, any of the nucleic acid nanostructure delivery compositions described herein can be coated with one or more polymers to protect the compositions from immune responses or to enhance endosomal escape. In one embodiment, the one or more polymers comprise polyethylene glycol. In another embodiment, the one or more polymers comprise polyethylene glycol poly-L-lysine. In yet another embodiment, the one or more polymers comprise polyethylenimine. In an additional embodiment, the one or more polymers comprise polyethylene glycol poly-L-lysine and polyethylenimine.


In various embodiments, payloads may be combined with the nucleic acid nanostructure delivery compositions using any or all of covalent bonds, electrostatic interactions, and ligand affinity interactions. In one aspect, covalent bonding methods include the use of EDC/NHS to form stable amide bonds between the payload and the nucleic acid nanostructure delivery compositions for improved stability (both “on the shelf” and in vivo), ease of separation and extraction, and sensitive detection. In another illustrative aspect, electrostatic bonding methods include the use of cationic nucleic acid nanostructure delivery compositions that electrostatically complex with the payload. In another embodiment, ligand affinity bonding includes the use of ligands such as avidin and biotin, both covalently bonded to the nucleic acid nanostructure delivery compositions and the payload via EDC/NHS chemistry to yield the stable combination of the payload and the nucleic acid nanostructure delivery compositions. In another embodiment, methods for bonding the payload, including the nucleic acid barcode construct, to the nucleic acid nanostructure delivery composition are provided, including the use of cleavable linkers that can reverse the bond with high specificity, such as the inclusion of nuclease specific oligonucleic acid sequences, allowing the payload, including the nucleic acid barcode construct, to be cleaved and extracted as desired. In another embodiment, cleavable linker and enzyme pairs include amide bonds and amidase enzymes.


In one embodiment, a composition comprising a non-viral delivery vehicle comprising a nucleic acid nanostructure delivery composition, and a nucleic acid barcode construct is provided. In embodiments where the nucleic acid nanostructure delivery composition is barcoded, the nucleic acid barcode construct can be associated with the nucleic acid nanostructure delivery composition via base-pairing. In this embodiment, the base-pairing can occur between a sequence of a single-stranded overhang on the nucleic acid nanostructure delivery composition and a complementary sequence appended to the nucleic acid barcode construct. In other embodiments, the nucleic acid nanostructure delivery composition can comprise staples that self-assemble to form the nucleic acid nanostructure delivery composition, and exemplary stapes are described in Example 1.


In other embodiments, the nucleic acid barcode construct can be associated with the nucleic acid nanostructure delivery composition by a high affinity, non-covalent bond interaction between a biotin molecule on the 5′ and/or the 3′ end of the nucleic acid barcode construct and a molecule that binds to biotin on the nucleic acid nanostructure delivery composition. In this embodiment, the molecule that binds to biotin can be bound to the nucleic acid nanostructure delivery composition by a covalent phosphoramidate bond formed via an EDC-NHS coupling reaction between a terminal phosphate group of a 5′ end of an overhang on the nucleic acid nanostructure delivery composition and an amine group on the molecule that binds to biotin. In this embodiment, the biotin can be bound to the nucleic acid barcode construct by a covalent bond.


In another illustrative embodiment, the nucleic acid barcode construct can be bound to the nucleic acid nanostructure delivery composition by a covalent bond. In this embodiment, the covalent bond can be formed via an EDC-NHS coupling reaction between a terminal phosphate group of the 5′ end of an overhang on the nucleic acid nanostructure delivery composition and an amine group on an amino terminal nucleotide of the nucleic acid barcode construct. In another embodiment, the covalent bond can be formed via a click chemistry coupling reaction between an azide group on the nucleic acid nanostructure delivery composition and an alkyne group on the nucleic acid barcode construct. In yet another embodiment, the covalent bond can be formed via a click chemistry coupling reaction between an azide group on the nucleic acid barcode construct and an alkyne group on the nucleic acid nanostructure delivery composition. In still another embodiment, the nucleic acid barcode construct can be associated with the nucleic acid nanostructure delivery composition by a covalent bond between a carboxy terminated molecule on the nucleic acid nanostructure delivery composition and a primary amine on the nucleic acid barcode construct at the 5′ and/or the 3′ end.


In one aspect, the nucleic acid barcode construct can comprise a polynucleotide barcode and the barcode comprises a unique sequence not present in any known genome for identification of the polynucleotide barcode. In another embodiment, a set of different nucleic acid barcode constructs with different polynucleotide barcodes (e.g., 88 or 96 different polynucleotide barcodes) can be used to allow for multiplexing of samples on one sequencing run.


In various embodiments, the barcodes can be from about 5 to about 100 base pairs in length, from about 5 to about 90 base pairs in length, from about 5 to about 80 base pairs in length, from about 5 to about 70 base pairs in length, from about 5 to about 60 base pairs in length, from about 5 to about 50 base pairs in length, from about 5 to about 40 base pairs in length, from about 5 to about 35 base pairs in length, about 5 to about 34 base pairs in length, about 5 to about 33 base pairs in length, about 5 to about 32 base pairs in length, about 5 to about 31 base pairs in length, about 5 to about 30 base pairs in length, about 5 to about 29 base pairs in length, about 5 to about 28 base pairs in length, about 5 to about 27 base pairs in length, about 5 to about 26 base pairs in length, about 5 to about 25 base pairs in length, about 5 to about 24 base pairs in length, about 5 to about 23 base pairs in length, about 5 to about 22 base pairs in length, about 5 to about 21 base pairs in length, about 5 to about 20 base pairs in length, about 5 to about 19 base pairs in length, about 5 to about 18 base pairs in length, about 5 to about 17 base pairs in length, about 5 to about 16 base pairs in length, about 5 to about 15 base pairs in length, about 5 to 14 base pairs in length, about 5 to 13 base pairs in length, about 5 to 12 base pairs in length, about 5 to 11 base pairs in length, about 5 to 10 base pairs in length, about 5 to 9 base pairs in length, about 5 to 8 base pairs in length, about 6 to 10 base pairs in length, about 7 to 10 base pairs in length, about 8 to 10 base pairs in length, or about 6 to about 20 base pairs in length.


Various embodiments of barcodes are shown below in Table 1 (labeled “Polynucleotide Barcodes”). These barcodes can be used in the nucleic acid barcode constructs alone or in combinations of, for example, two or more barcodes, three or more barcodes, four or more barcodes, etc. In the embodiment where more than one barcode is used, the hamming distance between the barcodes can be about 2 to about 6 nucleotides, or any suitable number of nucleotides can form a hamming distance, or no nucleotides are present between the polynucleotide barcodes.














TABLE 1






SEQ

SEQ

SEQ


Polynucleotide
ID
Polynucleotide
ID
Polynucleotide
ID


Barcodes
NO:
Barcodes
NO:
Barcodes
NO:




















GCTACATAAT
1
AGCAGTCCCG
342
CAAAATAGCG
683





ATGTTACACA
2
TTTGGGCTGT
343
GAAGAAGAAG
684





TGGGGCCCAA
3
CTCACGATCT
344
CACCCGCACG
685





TAGTTTATCC
4
TGGCGCATAC
345
ACGATGCCCG
686





ACCCCGTCTT
5
GCAATTGAAA
346
CCTACTACAC
687





CCGGCCATCA
6
TCGGGAGACG
347
ATTGAAACAA
688





GAGCTTGCTC
7
CCCGGCGAAA
348
GACCGAAGAT
689





ACGTTCTATA
8
TGATGCGGAA
349
ACGGCCTGAA
690





TACAGCAAAA
9
AACTGAGGCG
350
AGGGGAGGTC
691





GTTAGGTGGT
10
CATATTATTT
351
CAATCAACTT
692





GGAGACCGAC
11
AAAAGTCATT
352
GGACAACCGA
693





TGGCCCCTTG
12
AAGCGGTGAG
353
TCCCTAAGGC
694





TGGCCGTAAG
13
AAGGTAATCA
354
GTTCTACACG
695





CGTTCGTCAA
14
CTGACACTTA
355
ACTAACCAGT
696





CGGACGTGGA
15
CTGTTTTCTA
356
GAAGCTGGAT
697





AGAGGGGGCA
16
CACATGGCAG
357
GGAACCATGG
698





GTTCAGGTCG
17
TTCAATCCGG
358
CTCTACCTGG
699





CTCGCAAGAG
18
TGTCCGGCAT
359
TAATGCCTGC
700





GCAACGACTT
19
TGGTACCGTG
360
TAAAGGCAAT
701





GCCATCCATC
20
AAGAGATATT
361
CGCCTGGGAA
702





TTCCGAGCAG
21
GATGTACTAC
362
TCTTGGGGAA
703





CTTCTGGACA
22
GAAATGGAAT
363
AGAGAGAGAG
704





AACATTAGAC
23
TTAAAATACT
364
GCGTTGGCGC
705





AAGCAATAGT
24
TGACCGGAAC
365
TTACGACAGA
706





AGGGTAAGAC
25
GTCGCCGCAA
366
GGAACTCTTA
707





CGTTGTCTTG
26
TAGGATACCG
367
GATTGTGGAG
708





TTTCCCCGCC
27
AGTCCAATTG
368
GGGCACTGAT
709





CGAATGGATC
28
GGGGGCTATA
369
AGACGCACCA
710





CATCACTTGC
29
ACCTTCAGTT
370
CCAATTATAA
711





CTCTCGCACT
30
ATGGCAAGTA
371
TAGAGACGCA
712





GTTCACGTGC
31
AGAATGTTTT
372
CCTCTTGTCG
713





AATAAGCCTG
32
AGTTCGTTTG
373
GAGGAAGCTC
714





GTTAACAATT
33
CACTACTGAC
374
AGTCCCGAGT
715





ATTCAGATCC
34
GATCAAGAGC
375
TGCTTGCAGT
716





CCTGCTGATT
35
ATTTATCGAG
376
CCCACTTCCC
717





CTTGGTCATA
36
CCTTTTTCCA
377
CGTTGCCGCG
718





TCTTCCTGTT
37
GCACAGAGGT
378
CCCCTGGTTC
719





ACTGCCATGG
38
TGATCTGAAT
379
ACGACCAATA
720





CATGTATAGT
39
GTTGGAGGGA
380
CTTAGGGTTC
721





GGTAGCGGCA
40
TTTTGAAGGT
381
AAACATATCA
722





TCACTCTAAC
41
TAAGTCCTAA
382
GGGTCGTAGA
723





AAGGTGCACC
42
GGTGTTAGGG
383
CTCCGTAGCG
724





AATGCTCGTT
43
TGTATGCACC
384
CTGGTCATAA
725





TGTCTAGAAA
44
CCGTGCCATT
385
TTGACAGATC
726





CTGCCTGCCT
45
GAAATCACCC
386
GAGTAAAGTC
727





ACTATAAAAG
46
TTTGCACGTG
387
ATATGGGCTT
728





TAGTATCGAG
47
CGTCTGTTTT
388
TACAACTACT
729





ATCGCAGTCC
48
CTACACCACA
389
AATTCAGCCG
730





TCATCAGAAC
49
TGCTACAGGG
390
GATTGTACTA
731





TCCTAGACGC
50
GGGAATATAT
391
TCGTAATGCG
732





GCCGGGCGGG
51
TCATGTATTT
392
CGATAACTGC
733





GCCCAGAAGA
52
TCTCCGTTTA
393
AACTTGGCGG
734





CTTAGAGCTG
53
TACCTCTCGC
394
CGTGGATGTA
735





GTCTGCGCTT
54
GCTTCAACCG
395
CCTTCCCGAA
736





CGCCGTCCTT
55
ATGAAGCTAC
396
CTAAACCCGT
737





TTTATCTGCT
56
CGGTACAACT
397
CAACATTCCC
738





TGCTTCGGAG
57
GTGTGGTCGT
398
CTTACCCTCT
739





GGGGAGAATG
58
GGGGTCATGT
399
GGAAAGTTCT
740





GTGGTAAGTG
59
AGGCAGCCCA
400
CGGATTGGCT
741





GAAATTAGTA
60
CAAGCACGAT
401
AATGTAGGGC
742





GCTATCCTAA
61
TCAAATGGAT
402
AATGAATCGC
743





ATCTGTACGA
62
GGACTGAATA
403
ATCATACACC
744





AGTTCGGGGC
63
CCGTAGACGT
404
AGTTGGGCAG
745





CGAGTCTGTC
64
CGGCGTACCG
405
AGAAGAAGGG
746





ATCCTACGCA
65
GGCGGCGCCC
406
GCGTGCGCTA
747





ATGGTGGATA
66
AGACTTGATC
407
CCCCGATAAA
748





CCTCTAACTA
67
ACCTTGCACA
408
TACCAAGTGC
749





ATAGCTGCAC
68
TAAGGTGAGT
409
TGTGTTTTCG
750





GACAGAATTT
69
TTGTTGTTTC
410
CCCAGATGTC
751





CAATTGGCAT
70
GAGGGAATAC
411
GCGAGCTTCC
752





TCTAGTAGAC
71
CTCGTACGCG
412
GTGTCACGTA
753





TTATTCATGG
72
CCGCGGTTTA
413
ATAGGCCGAG
754





TTGGCAACCG
73
TTAAAGTTAA
414
GAGCTACCAG
755





CATAATACAT
74
GCATATGGGT
415
CGCGGCGGAG
756





ACAGACTCAC
75
AGTCTGAGCC
416
TCTTGCACGA
757





GCGATGCTGC
76
TGTCGGTTCG
417
TGCCCTAAAG
758





CATCTTTGCC
77
GGTCTCAACC
418
TTGCGCTTTG
759





GTGACTCCAG
78
GTAACGGCAT
419
CATATAAAGG
760





GGACGAGTCT
79
ACACTGAGAA
420
AATAGCGAAT
761





TAGTGGCGTG
80
CCCAACGTCG
421
TACGCTAAGG
762





AACGCAGCTT
81
AAGAAACTGC
422
ACTTAGTTCG
763





AGAACAGGTG
82
ACCAGCCCAC
423
CGTGCGGAAC
764





AGGCTATGTT
83
TGTAGTTACT
424
ACCCGATTCG
765





CCTGGATCTT
84
GGCTAGAGGC
425
TGCAGAGTTT
766





CTAGCCGGCC
85
GTTCGGCAGA
426
GAATCATTAG
767





ACCAGTTATC
86
CCAAAATAGA
427
AGTACACTGG
768





ACGTTATAGC
87
CCCATATAAC
428
TTGTGCGGTT
769





TCGAGTTTGA
88
GTCACTACCG
429
ATGACATGCA
770





TGAAGCGAGC
89
GTAGTGTGGC
430
TTCTCGGACG
771





GACTGGCGAA
90
CAATCTCATA
431
AGATTGAAGA
772





GATGGACCTA
91
CCATGTTATA
432
GGCGGACTGT
773





GTCCACAACG
92
TAAGCAGTGG
433
TTTATGGTAA
774





CCTCCCCAGA
93
TCGGCGGCTA
434
CAGTAGGGTG
775





TTATGACGCC
94
TATTAAATGC
435
GACAGGCAAG
776





CTTGATCCGT
95
GTCGCCATTA
436
GATGTGTCGT
777





AATGCGCAAT
96
GGCGTCGTTC
437
ACTTGACGGA
778





GTACCCCTCA
97
CTAGTAGATA
438
AAGTCCGAAA
779





CGACAGCTCG
98
TCGTCAGTAT
439
TGGGTGTAGG
780





TGACCTGGCT
99
GGGGTATCGG
440
ACTTACCGCG
781





TTCATAGCCC
100
TGCTCTGCCA
441
CTGTGCACCC
782





CCCAAGAGAA
101
TGCCGTAACT
442
ATTGCTCTCT
783





AAACGAAGTA
102
CGGTACAGGC
443
CAGAAGACAA
784





GACGTTTACA
103
TCCTAATTTG
444
TTACGCTATA
785





GATCGATTTG
104
TCTTTCTGGA
445
ACGTGGAAAT
786





CACTGTCACC
105
CCGCGACTTG
446
TGAGGCTGGT
787





TGTGAGAGTT
106
ACCTATAGCG
447
ATTATGAGAT
788





GACGTAACCT
107
GCCGGCACCT
448
GACTTGTAGT
789





CAGACTCTGC
108
TTTGATAGGC
449
TCGCTGAGGA
790





TATGCCAATA
109
ACTGTGAGCT
450
CCCAACTCTA
791





ACAGGTGATG
110
TTATCGTTCA
451
GATAGGGAGG
792





GTCATCGCGT
111
ACTAGTGGCC
452
TAGAAATCAG
793





TCTTATAAAC
112
CCTCCGTGGT
453
GTCGCTAGAA
794





GTGTAGACTG
113
TTAGGGTATG
454
AAAATAGAAA
795





AAACAACCGG
114
GAATCAGGCG
455
GCTCCTGGGT
796





ATCCTGTACC
115
GGCTGACCAA
456
CGCGCTCGCG
797





TTATAAGAAT
116
TGCCAGACCG
457
GGCAAACGCA
798





ATAAGTAGGC
117
TCCCTACGCG
458
TTTACTACCT
799





TCTCGTAAGG
118
TCCGCTGGAG
459
ATCCTAAACT
800





GATCCGCCGC
119
GGATCAAAAC
460
CTCCGTATGT
801





TGTCAGGTTT
120
TTCACCTCAC
461
TATCGTCCAG
802





TCCGAAGCCC
121
GACACACGGC
462
GCCGGCGGTA
803





TCCATGTCCA
122
TGGGCGATTA
463
TGCTCCATTT
804





GTGATGGTAC
123
TAAGATCTTC
464
TGGCTGTTGT
805





CTCCACATAC
124
CTCCGACTAC
465
TACTGCGCAA
806





TTCGGATGAG
125
GGGCCATCAT
466
TATACGGCTT
807





ACGACATCGC
126
TCAGGCCAGA
467
GGTTATTACC
808





GAGATGCACA
127
CTTGTGGGGC
468
ATCAGGAGGA
809





TTTGTATGGC
128
AGATAGTCTG
469
CTATTGCCAG
810





CTTTTCTAGA
129
GCGTCAAAGT
470
ACGTACACAC
811





AGTCTAATCA
130
ACGAAAATTT
471
CAGCCTAGCT
812





GACTTAGCCA
131
GAGTCTGGTG
472
GAAAAACAAC
813





TATCACAGTA
132
ATCGAGCGAC
473
CGTTCAGTTA
814





AAGCTCGAGT
133
GGTCCTCAGA
474
CAATCAGAAT
815





TGTTACGACA
134
TGATTTTGTC
475
GGGCTACTCT
816





AAGGATAGTC
135
GCATTTCTCA
476
CCCCATTGGG
817





GCACTTAGCC
136
GCATGCCAGT
477
TAGGGAACGG
818





GAGGGATCCG
137
ATTAGACGAC
478
CAGCTGATAC
819





ATTCTAGAAG
138
AAAGCCCATA
479
ATTCCTGTGA
820





GATAACTGAT
139
CACTACATTC
480
TCAGAGCCGT
821





ATCTGACTGT
140
CACGGTTTCT
481
CATGAAAAGC
822





CAAAGCGAAC
141
CCCACCAGTG
482
TGACCTGTGA
823





GAAATTGCGA
142
CTCACTTGTC
483
GCATTAGCAG
824





GGGTCCAGTC
143
GATAGACTCT
484
GACAGAACCA
825





ATCAGGTAGC
144
ATTTCCATTT
485
TCCAGTATAT
826





GAAAGGTCCT
145
ATATGTGGCC
486
TGTTCCGCTA
827





GGCTACCACA
146
CGGGACGAAC
487
GATATCCATT
828





TTATTGCTGA
147
AGAACCGTGA
488
CATATGGACC
829





CGCCGCGTTT
148
TAGTGTACTG
489
GATATAGTAA
830





TTTTCAAAAG
149
AACTAATCGA
490
CACCTTTTTT
831





CTGGGCTAAA
150
CGAAGTGACG
491
AGCTTGCGGG
832





CCCGATGAGA
151
CGGAGCCTCG
492
CGCACAGGGA
833





TGGGAAATAT
152
ATCACACGAG
493
TCTGGGTGCT
834





GTACGAGCGG
153
CGACGAGTTC
494
TGAGTCGTTT
835





GCGTGCAGCT
154
GCTTCCCGTG
495
TTACAATGTG
836





AGTCTGCGGA
155
GATTCATACC
496
CTTGCAAACA
837





TAACTATTTA
156
GAGAGAAGCG
497
TGTCGAGCTG
838





GAGTTGCCGG
157
GAAGTGGCCT
498
ACTTTAACCT
839





CAGCCCGGCG
158
GGACGACGCC
499
ATATAAGTGC
840





TCACCTACAT
159
TAGGGTCTCA
500
GGAAGGGCGT
841





AGTGGCTAAC
160
AACTACAGGT
501
TTTGACTTGA
842





AGAATGTGAG
161
GTGGCCTGTG
502
GTATAAACGG
843





TAGTTTCGCA
162
CTTTACCAGC
503
TAACCGGATG
844





CTTCATTTCT
163
CGCGTTACTG
504
TTCTCATCAG
845





GCCATGATAT
164
TTGCTCCCGT
505
CTCGGTTACG
846





ACGGCAAATC
165
CATCAAACAA
506
ATATGGTTCT
847





ATCGATAGTA
166
GCTTTATGAT
507
CGCCCCCGAA
848





CCTAAAGGCA
167
CTGCATACTG
508
ACCTCGATCG
849





TACGAGCGGT
168
GGTGGCTCAG
509
CTCGAATAAT
850





TTTGTCGTCG
169
GGACGATCAA
510
GCCCGAGCTT
851





TACAAGCTTG
170
CCGACTGGTG
511
AACAGTCAAC
852





GACCAACACG
171
GGAACAACCG
512
CTGGAACCTC
853





GAACGACGAA
172
GAACGAGACC
513
AATAACGGGG
854





TCGGAACGCA
173
CACCAAGAAA
514
ACGCCCCACT
855





ATCCGGTGGT
174
ATGCATTACC
515
GGCAACATGA
856





TAAAACGTAG
175
GTATCATGCC
516
GCTATTTCGC
857





TATGTGAGCC
176
AGTAGATGTT
517
TTCCACTTTA
858





GAGGCATCGA
177
CTCTAGATGT
518
GCCGATGGAT
859





GAATGGGTGG
178
GCTACTTGTG
519
AAGTTGGTAA
860





AACGACACAA
179
TATGAAACGT
520
CACTAGCTAG
861





GTACGATGCA
180
CCTCGTTGAT
521
ACATGCCCCT
862





AGAAGGCGCC
181
CTAGAGCCAT
522
TTCATTACTC
863





CCGCAATGGA
182
TAGAGTTATA
523
GGTTTAATAT
864





TACGGATTTT
183
AACGAGAGGC
524
CCTGCAGTGA
865





GTCGTTAGCT
184
GGTCTACCGT
525
TCTTTAAGTT
866





GGACTAGGGC
185
GCCCCCTCAC
526
TGGCGATCGA
867





ATTGGTATTC
186
CATAGGAATT
527
CTTTTTAGCT
868





ATCCCAGAGA
187
TCCGGCTCGT
528
CCCAGTCTCT
869





GTCCCAGCTC
188
TGAGAGTCGG
529
AAATGTTTCG
870





CACGAGGAAT
189
CGTAGAAATA
530
ATATAAGACG
871





TACAATTGCA
190
CTTTACATGA
531
TCACTTTACA
872





ATTCCTGAAT
191
GAGCGCCGTC
532
CCTGGCGCCC
873





TAGCGAGGCG
192
GGCTCTCGGC
533
GGATTACTGG
874





CTGGATGGGC
193
AGAGCTTGTT
534
GAATGATCTT
875





GCGACGGCCA
194
AATCAGCCAC
535
GCTCGGATCG
876





ACCTGCACAA
195
AGAAGAGCCA
536
CAGCTGCGAG
877





CATGACAGAC
196
TCGTATGAGT
537
ACCCTTACTA
878





TTACCAACGT
197
TTCTTCCTCG
538
AGGTGAAACT
879





CAGGTGTGTG
198
ACACAAAAGC
539
CGAATTTGAT
880





CGAGGGACGG
199
CGCGGGACCC
540
CGCTGTGCGG
881





CGTCTCGGTA
200
GTCGCGACAC
541
TTACCGCACC
882





TAAGCTATCT
201
CCGGAGGAAA
542
GGAATCTTAA
883





TACTCCCCTA
202
CGGCGTATGA
543
CTCAACACCC
884





TTATATTCAT
203
TAGGCATTCT
544
CGTGCCCTTG
885





AGCGATCTGC
204
AAAGGAGGGA
545
GCAGGCTCGA
886





TCTTCTGATC
205
ACCTTTACGG
546
ACCAACGAAG
887





ATAGTTCCCA
206
CTACCGTTAA
547
CCTGTAATTT
888





TTTACGGGTG
207
GAGCTTCGCC
548
GGGTGGGATG
889





GTGTCCCCTG
208
GCCATAGAAG
549
TTGCTCACCG
890





GCGGGGGTCG
209
TTTAGCGTAT
550
TTACGACCAC
891





CATTGATCTA
210
GCAAACAGAT
551
TTTTCTAACC
892





AGGGACGGTG
211
TAGGTCATGG
552
GCTTTAGATA
893





CAGTTACTTT
212
CTCTAACAGA
553
CACGTATTGG
894





CCATACTTCC
213
GGCTCATGAA
554
AAATATCTCC
895





ATCAGAATTA
214
CAATGTCTCA
555
GCTGGAAAAC
896





AAACTAGGCA
215
TGATCGTATT
556
GAGCGCATTA
897





AATGTCGTTG
216
GCGCTTTTCA
557
GTGGAGGGGT
898





CACATGGGTC
217
AAGATTATAT
558
TCCACTGGGA
899





GGTCGCTGGT
218
ACTAGCTGAC
559
CAATAGCGGA
900





ACTGTATTAC
219
GGTGAGCTCA
560
CATCTAGTTT
901





CCGAGACGCG
220
CGCTTTCGCT
561
GAAGTTCCGG
902





ACTCCAACCC
221
TGATTCAAAA
562
AGCGAGATTC
903





ATATTACAAG
222
ACTGAACAGG
563
TTAAGGTCGG
904





CCATGGATAG
223
ATTCGAGCTA
564
AATGGTTAGG
905





CCGTCTCAAT
224
TGTAGGCTAA
565
CGTTATTATA
906





GATCGTCGGG
225
ACAAAGCTTT
566
ACGGAAAGGA
907





TCTTGTTTTG
226
GCCCGAGGGA
567
CCTTGTCCCG
908





AATATTGCTC
227
GCCCGCTGGG
568
ATACTTTTTT
909





AACGTCGTCT
228
ACCCCGCTGA
569
CTGGGTCTGG
910





AATATTTTTG
229
CTTATGCCCT
570
AACCATTGCG
911





CGTAACGTGC
230
CCGCCATAGC
571
AGACCGGGCC
912





GCGTGGTTAT
231
CTTAATGATT
572
TGGGACACAC
913





CAAAACATTA
232
CAGTCCACAA
573
TGCGCAGTTG
914





CGTATCCTGA
233
ATGGACGGAC
574
CGTTCGCCTT
915





TCGCTTACAA
234
CGGCCTCTCG
575
TCTCACTCGT
916





TCCATTGTGT
235
TAGTCGCCAT
576
ACACCGACGT
917





GCCCCCATTC
236
GTTGATCTTC
577
TTCAGCCCCT
918





TGACGTCTAT
237
ACTTGCCAAG
578
AGGCGACTAA
919





TGGGCCGAGG
238
ATGACTGGTT
579
TGCTATCAAG
920





AAGTGTCAAG
239
TGTCGTAGGA
580
GTCCAGTAGC
921





GACAGTAGAG
240
AGCAAACACG
581
CGTGTGGGCG
922





CGCAGCCATC
241
TACTGATGAA
582
GTGGTTCTCC
923





GAGGCAGAAC
242
GTATCCCATA
583
GCAGCCGACG
924





GTTGAAATTG
243
TAGCCAGGTT
584
GCTGTCCACG
925





ATCTGATAAA
244
CGTGTGGCGA
585
CGACACTCAT
926





AGCTGTCTCT
245
ATCGAATTGC
586
CATGGCACCT
927





TTTTAGGTTA
246
CCCCAATATT
587
TGTGACGTGT
928





TATCTGTCCG
247
CCCGTTTCTC
588
TTTGGACTAA
929





AAAACATATG
248
TCCGCATCTA
589
TTCATGCCCG
930





GTAAAGAAGA
249
CAAGCCTCAT
590
TTGATCGTGG
931





TCGACGTGCA
250
TTTCAATCCC
591
TAGCATAGGA
932





TAGATCTTAA
251
CCTTCCCATC
592
GTAGTTGCAA
933





CACTGGTCAC
252
AGGTACAAGA
593
GGGACAGCTA
934





ATTCTGATGT
253
GTGTAATGGA
594
AAACCCCCAA
935





ATGGCCCTGA
254
AAACTGAGCT
595
ACTCTCACAA
936





GGTGATGAGA
255
ATCTCTGCCC
596
ATCATTGCCA
937





CACCGTGGGG
256
CGACATTTGC
597
CCAGTTTGCG
938





GCTTGCTCGG
257
TGTGAACCCG
598
ACATTAGTCA
939





CCAGTTGAAC
258
TGACACCCCA
599
CTCCAGGGTA
940





CGTCTGTACC
259
TAGGCCAAAG
600
GAAGGGCCAA
941





CCAACGCGGC
260
GAAATTGTAG
601
CAGTCTCCCC
942





ACGTGATCGA
261
GCGTCTGATT
602
GAGACATTCC
943





CCATCGAATC
262
TCTCATTGTT
603
AACGGTGTTG
944





CGGTGTCTGC
263
CTGACATCTC
604
AGCATTATCA
945





AAACCACCTC
264
GTATCCAGTG
605
CTATACCGAG
946





TCAATGTTCC
265
GATGGCCGTT
606
AACTGGATCA
947





TTCGACATGT
266
TCACCCTCTC
607
GTCTTGTCGG
948





AGGCACGATA
267
GGCACTATTC
608
GACGAGCCGC
949





CACGAGATCA
268
AAATAACTGT
609
GGAACACTGT
950





CATGCTGGGG
269
CAGCTCCATT
610
TAAATGCGTT
951





TACCATGGTT
270
CTCTTGACTC
611
GCGAACACAG
952





TTGCCCATAT
271
TTTCCTATAC
612
TTCTCTCAAC
953





TGCACATTCG
272
CCATACCCGA
613
GTCGTACTGA
954





GTTATGTTGG
273
TCGCCGAGCG
614
TGTGGCGTAA
955





TGAGTTATGA
274
CGCTGAAGCC
615
TGAGCGGCGT
956





GATGGCCCCC
275
TCTGGCCCCA
616
CCTCGTGAAC
957





GATGGGTTAC
276
GCTACATTGA
617
GAGCAATGAA
958





AGCTACGTTG
277
CGCATCATAA
618
CGAGACCTAA
959





ACCCCATGCA
278
GCAAAGGGCC
619
AACTGAGCGC
960





TACTACCGTT
279
AACGGCGCAG
620
TAAAGCTCGT
961





TCGCTTCTAC
280
CGACTGACAT
621
CTCTTTACGT
962





CTGGCAGTGC
281
ATGACAGGGC
622
CCCCGTGGAA
963





TCTATATATA
282
CAAGTTCTCC
623
TCGGTTCGTC
964





GGATTAGTTC
283
TCGCCGCTTT
624
CTGCTTACAC
965





GTGTTACGCT
284
ATGCCGGAAA
625
ACACCGTAAT
966





TCGACTCCGT
285
GCGGTTACTA
626
CCTGGTCGGC
967





GGTAGCAGGC
286
GACATTACAA
627
GGTTATTTGG
968





TATTGGATTC
287
CAGAGAGGGC
628
GCAACTGAGT
969





GTTCGATCGA
288
GCACCGCCTC
629
ATAAGGCCTC
970





ATATTAATAT
289
CGGTCCGAGC
630
CGTGCGAAGG
971





AGAACGATTG
290
TGTCCGGTGC
631
GTCACACACT
972





GTAAAGTGTA
291
GGTCGGTTGC
632
CATACGGCAA
973





CCCATGTGCC
292
GCTCAGCTAA
633
GAACTGCCCA
974





GTGGCCTCGC
293
AGCAGTTCGT
634
AATATGTGAA
975





GACACTAGGA
294
AAATCGATGA
635
CCGATCCTGT
976





ATATTCTGAC
295
GCTCGGTATG
636
CAAAGAGCCT
977





TAAGTAGACG
296
CCCGCCGCGG
637
TAACTTAGAG
978





TAACGGTCTA
297
GTGTGATAGG
638
CAGCATGTAG
979





TAGTTTCATT
298
TTGGACTCCA
639
CCCCATGCAG
980





TTGGATCCGA
299
TGCTTATCTA
640
TCTGAACCAC
981





CGTGACAACC
300
CAAAAGGCGT
641
GCGTGCAAAA
982





CGCGCTCAGA
301
TAGGGGGCCT
642
GCTAGTACCG
983





CGTTCTTAAT
302
AAGTATTAAT
643
TTTCCCGCGC
984





ACAAGAGTTT
303
GTTTAGCCCG
644
CCTTAGTAGG
985





AGGGTTATAG
304
CGCTAATATG
645
TTGTGTCTTG
986





ACCACGACTC
305
ACAACACGTT
646
GCAACGAAGC
987





GTACTCGGGG
306
AGAGATGCTC
647
TGAAACCCTT
988





ACAAATATCT
307
TGCCTGATAT
648
TTCTACGATC
989





GATCGGGGTG
308
CTTGTAAGTA
649
ATTAAAGGTG
990





ATGTAACTCC
309
CATATTGCCG
650
TATCTAACGG
991





ATGAAGAAGC
310
CTTAGAAAGT
651
AGTGCTCCTG
992





ATGTATTGTC
311
ATGTTGTATT
652
CCGTCCCTCT
993





TGCATTGGAA
312
CGCATTGAAG
653
CTAACGAGCG
994





GCGGACGATC
313
TTATGTTGGT
654
AAGTCCGGCT
995





CCGTACTTGA
314
TCGCCTCAGA
655
GGCGTATAAG
996





TTTGCCCCCG
315
TTCGTTGAGG
656
AGATATTAGG
997





ACCTCACGCG
316
GGTGCCGGGC
657
TCCTAACAGC
998





ATTAAGGGGC
317
ACCATTGTAA
658
GAGGATACGC
999





CGTGGACATG
318
TTGATTGTCA
659
CGCTCTTTAA
1000





TTAGCCCTTC
319
CGGCTCACCT
660
ACCGGCAGGC
328





CGAGAGTTTG
320
CTATCACATG
661
GCTAAAATCT
329





TGCATCCTCT
321
GTAGACAGAA
662
GCCGTTGACG
330





TGCGATTCCG
322
CCTTTACCAA
663
GGAGTTGTTG
331





TTATTACGTT
323
GCACATCGAC
664
TACTTGAGAA
332





TGATGTGGTT
324
TCTCACTTTC
665
CGGGTGCGCT
333





GGGCGTCAAT
325
TTCGAGTACT
666
AAAAGCGTCT
334





CCCTTGAAAT
326
TAGAAGAGCA
667
GTAAAGATAG
335





TCTTTGGGGC
327
AACCCCACCA
668
GCCTGGTCAG
336





ACCGGCAGGC
328
CTGTATCAGT
669
GGCAAAAAGG
337





GCTAAAATCT
329
ACATAATGAG
670
ACCCTTCTCT
338





GCCGTTGACG
330
AGCCTTCCGC
671
TCACATAGTG
339





GGAGTTGTTG
331
CAGTGCTTTT
672
TCGTCTGTGC
340





TACTTGAGAA
332
TAGTCCGTGT
673
TGCTCGGATC
341





CGGGTGCGCT
333
CGGAATCGGT
674
GGCGTATAAG
996





AAAAGCGTCT
334
CTTGCGGAGA
675
AGATATTAGG
997





GTAAAGATAG
335
AAAAATTTGG
676
TCCTAACAGC
998





GCCTGGTCAG
336
TGTTTTCCGC
677
GAGGATACGC
999





GGCAAAAAGG
337
ATGCTAGGCG
678
CGCTCTTTAA
1000





ACCCTTCTCT
338
GACTAATTTC
679
GGCGTATAAG
996





TCACATAGTG
339
CTGTAGTAAC
680
AGATATTAGG
997





TCGTCTGTGC
340
CGGATGACTT
681
TCCTAACAGC
998





TGCTCGGATC
341
TCAGAGTGGA
682
GAGGATACGC
999









In another embodiment, a random sequence fragment can be linked to the 5′ and/or the 3′ end of the barcode and the random sequence fragment can, for example, be used for bioinformatic removal of PCR duplicates. The random sequence fragment can also be used to add length to the nucleic acid construct and can serve as a marker for bioinformatic analysis to identify the beginning or the end of the barcode after sequencing. In another embodiment, the nucleic acid barcode construct comprises at least a first and a second random sequence fragment, and the first random sequence fragment can be linked to the 5′ end of the barcode and the second random sequence fragment can be linked to the 3′ end of the barcode. In another embodiment, one or at least one random sequence fragment is linked to the 5′ and/or the 3′ end of the barcode. In one aspect, the random sequence fragments can be extended as needed to make the nucleic acid barcode construct longer for different applications such as whole genome sequencing where short inserts may be lost.


In various embodiments, the random sequence fragments can be from about 5 to about 20 base pairs in length, about 5 to about 19 base pairs in length, about 5 to about 18 base pairs in length, about 5 to about 17 base pairs in length, about 5 to about 16 base pairs in length, about 5 to about 15 base pairs in length, about 5 to about 14 base pairs in length, about 5 to about 13 base pairs in length, about 5 to about 12 base pairs in length, about 5 to about 11 base pairs in length, about 5 to about 10 base pairs in length, about 5 to about 9 base pairs in length, about 5 to about 8 base pairs in length, about 6 to about 10 base pairs in length, about 7 to about 10 base pairs in length, or about 8 to about 10 base pairs in length.


In another illustrative aspect, the barcode may be flanked by primer binding segments (i.e., directly or indirectly linked to the barcode) so that the nucleic acid barcode construct comprising the barcode can be amplified during a polymerase chain reaction (PCR) and/or sequencing protocol. In one aspect, the primer binding segments can be useful for binding to one or more universal primers or a universal primer set. In one illustrative embodiment, the universal primers can contain overhang sequences that enable attachment of index adapters for sequencing. In one embodiment, the adapters can be NGS adapters (e.g. Illumina) positioned internally but towards the end of either the 5′ or the 3′ primer, not as the terminating structure, to avoid the formation of primer dimers. In this aspect, the primers can be any primers of interest. In this embodiment, the first primer binding segment can be linked at its 3′ end to the 5′ end of a first random sequence fragment and the second primer binding segment can be linked at its 5′ end to the 3′ end of a second random sequence fragment with the barcode between the random sequence fragments. In another embodiment, the first primer binding segment can be linked at its 3′ end to the 5′ end of the barcode and the second primer binding segment can be linked at its 5′ end to the 3′ end of a random sequence fragment linked to the 3′ end of the barcode. In another embodiment, the first primer binding segment can be linked at its 3′ end to the 5′ end of a random sequence fragment and the second primer binding segment can be linked at its 5′ end to the 3′ end of the barcode where the barcode is linked at its 5′ end to the 3′ end of the random sequence fragment. In yet another embodiment, the first primer binding segment can be linked at its 3′ end to the 5′ end of the barcode and the second primer binding segment can be linked at its 5′ end to the 3′ end of the barcode.


In embodiments where primer binding segments are included in the nucleic acid barcode construct, the primer binding segments can range in length from about 15 base pairs to about 30, from about 15 base pairs to about 29 base pairs, from about 15 base pairs to about 28 base pairs, from about 15 base pairs to about 26 base pairs, from about 15 base pairs to about 24 base pairs, from about 15 base pairs to about 22 base pairs, from about 15 base pairs to about 20 base pairs, 16 base pairs to about 28 base pairs, from about 16 base pairs to about 26 base pairs, from about 16 base pairs to about 24 base pairs, from about 16 base pairs to about 22 base pairs, from about 16 base pairs to about 20 base pairs, 17 base pairs to about 28 base pairs, from about 17 base pairs to about 26 base pairs, from about 17 base pairs to about 24 base pairs, from about 17 base pairs to about 22 base pairs, from about 17 base pairs to about 20 base pairs, 18 base pairs to about 28 base pairs, from about 18 base pairs to about 26 base pairs, from about 18 base pairs to about 24 base pairs, from about 18 base pairs to about 22 base pairs, or from about 18 base pairs to about 20 base pairs.


An exemplary sequence of a nucleic acid barcode construct is shown below. The /5AmMC6/s a 5′ amine modification for attachment to the nucleic acid nanostructure delivery composition. The *'s are phosphorothioate bond modifications for stability. The A*G*A*CGTGTGCTCTTCCGATCT sequence (SEQ ID NO: 1001) is the 5′ primer binding segment sequence. The GCTACATAAT (SEQ ID NO: 1) is an exemplary barcode sequence. The N's represent the random sequence fragment. The AGATCGGAAGAGCGTCG*T*G*T (SEQ ID NO: 1002) is the 3′ primer binding segment sequence.









(SEQ ID NO: 1003)


/5AmMC6/A*G*A*CGTGTGCTCTTCCGATCTGCTACATAATNNNNNNNN


NNAGATCGGAAGAGCGTCG*T*G*T






In all of the various embodiments described above, the entire nucleic acid barcode construct can range in length from about 30 base pairs to about 350 base pairs, from about 30 base pairs to about 300 base pairs, from about 30 base pairs to about 270 base pairs, about 30 base pairs to about 240 base pairs, about 30 base pairs to about 230 base pairs, about 30 base pairs to about 220 base pairs, about 30 base pairs to about 210 base pairs, about 30 base pairs to about 200 base pairs, about 30 base pairs to about 190 base pairs, about 30 base pairs to about 180 base pairs, about 30 base pairs to about 170 base pairs, about 30 base pairs to about 160 base pairs, about 30 base pairs to about 150 base pairs, about 30 base pairs to about 140 base pairs, about 30 base pairs to about 130 base pairs, about 30 base pairs to about 120 base pairs, from about 30 base pairs to about 110 base pairs, from about 30 base pairs to about 100 base pairs, from about 30 base pairs to about 90 base pairs, from about 30 base pairs to about 80 base pairs, from about 30 base pairs to about 70 base pairs, from about 30 base pairs to about 60 base pairs, from about 30 base pairs to about 50 base pairs, from about 30 base pairs to about 40 base pairs, 40 base pairs to about 120 base pairs, from about 40 base pairs to about 110 base pairs, from about 40 base pairs to about 100 base pairs, from about 40 base pairs to about 90 base pairs, from about 40 base pairs to about 80 base pairs, from about 40 base pairs to about 70 base pairs, from about 40 base pairs to about 60 base pairs, from about 40 base pairs to about 50 base pairs, 50 base pairs to about 120 base pairs, from about 50 base pairs to about 110 base pairs, from about 50 base pairs to about 100 base pairs, from about 50 base pairs to about 90 base pairs, from about 50 base pairs to about 80 base pairs, from about 50 base pairs to about 70 base pairs, from about 50 base pairs to about 60 base pairs, or about 42 base pairs to about 210 base pairs.


In another embodiment, a method of in vivo screening for a desired nucleic acid nanostructure delivery composition is provided. The method comprises (a) preparing a library comprising two or more types of nucleic acid nanostructure delivery compositions, wherein each nucleic acid nanostructure delivery composition is associated with a nucleic acid barcode construct comprising a different unique barcode sequence, (b) administering the library to an animal, (c) removing cells or tissues from the animal, (d) isolating the nucleic acid barcode constructs from the cells or the tissues of the animal, (e) detecting the nucleic acid barcode constructs in the cells or the tissues of the animal, and (f) identifying the desired nucleic acid nanostructure delivery composition for use as a delivery vehicle. In this embodiment, any of the nucleic acid nanostructure delivery compositions can be used and any of the nucleic acid barcode constructs described herein can be used.


In various embodiments, any suitable route for administration of the library of nucleic acid nanostructure delivery compositions associated with nucleic acid barcode constructs for the method of in vivo screening for the nucleic acid nanostructure delivery compositions associated with a nucleic acid barcode construct, or for the method of treatment described below can be used including parenteral administration. Suitable routes for such parenteral administration include intravenous, intraarterial, intraperitoneal, intrathecal, epidural, intracerebroventricular, intraurethral, intrasternal, intracranial, intratumoral, intramuscular and subcutaneous delivery. In one embodiment, means for parenteral administration include needle (including microneedle) injectors, needle-free injectors and infusion techniques. In other embodiments, oral or pulmonary routes of administration can be used.


In one aspect, libraries of nucleic acid nanostructure delivery compositions can be pooled and concentrated before administration to the animal of the nucleic acid barcode constructs associated with the nucleic acid nanostructure delivery compositions. Methods for library preparation and for sequencing are described in Green and Sambrook, “Molecular Cloning: A Laboratory Manual”, 4th Edition, Cold Spring Harbor Laboratory Press, (2012), incorporated herein by reference.


In various embodiments, cell or tissue samples may be analyzed for the presence of the nucleic acid nanostructure delivery compositions associated with the nucleic acid barcode constructs described herein. The samples can be any tissue, cell, or fluid sample from an animal, for example, selected from the group consisting of urine, nasal secretions, nasal washes, inner ear fluids, bronchial lavages, bronchial washes, alveolar lavages, spinal fluid, bone marrow aspirates, sputum, pleural fluids, synovial fluids, pericardial fluids, peritoneal fluids, saliva, tears, gastric secretions, stool, reproductive tract secretions, lymph fluid, whole blood, serum, plasma, or any tissue or cell sample from an animal. Exemplary tissue or cell samples include brain tissue or cells, muscle tissue or cells, skin tissue or cells, heart tissue or cells, kidney tissue or cells, stomach tissue or cells, liver tissue or cells, urinary tract tissue or cells, gastrointestinal tract tissue or cells, head or neck tissue or cells, lung tissue or cells, reproductive tract tissue or cells, pancreatic tissue or cells, or any other tissue or cell type from an animal.


In one illustrative aspect for removing cells or tissues from the animal and isolating the nucleic acid barcode constructs from the cells or tissues of the animal, the nucleic acid barcode constructs are removed from cells or tissues of the animal. In various embodiments, nucleic acid barcode constructs (e.g., DNA or RNA) obtained from the tissues or cells of the animal can be removed by rupturing the cells and isolating the nucleic acid barcode constructs from the lysate. Techniques for rupturing cells and for isolation of nucleic acids are well-known in the art, and removal techniques include homogenization, such as by using a bead-beating technique. In other embodiments, the nucleic acid barcode constructs may be isolated by rupturing cells using a detergent or a solvent, such as phenol-chloroform. In another aspect, the nucleic acid barcode constructs may be separated from the lysate by physical methods including, but not limited to, centrifugation, dialysis, diafiltration, filtration, size exclusion, pressure techniques, digestion of proteins with Proteinase K, or by using a substance with an affinity for nucleic acids such as, for example, beads that bind nucleic acids.


In one illustrative embodiment, the nucleic acid barcode constructs are removed from cells or tissues by treating with a mixture of an organic phase (e.g., phenol chloroform) and an aqueous phase (e.g., water). The organic phase (e.g., phenol chloroform) is isolated and the nucleic acid barcode construct can be precipitated by raising the pH, for example, to pH 7.4. The organic phase (e.g., phenol chloroform) can be evaporated and the nucleic acid barcode constructs can be suspended in water and diluted to appropriate concentrations for PCR and/or sequencing. In one embodiment, the isolated nucleic acid barcode constructs are suspended in either water or a buffer after sufficient washing.


In other embodiments, commercial kits are available for isolation of the nucleic acid barcode constructs, such as Qiagen™, Nuclisensm™, Wizard™ (Promega), QiaAmp 96 DNA Extraction Kit™ and a Qiacube HT™ instrument, and Promegam™. Methods for preparing nucleic acids for PCR and/or sequencing are also described in Green and Sambrook, “Molecular Cloning: A Laboratory Manual”, 4th Edition, Cold Spring Harbor Laboratory Press, (2012), incorporated herein by reference.


The nucleic acid barcode constructs can be detected by using, for example, the polymerase chain reaction (PCR), isothermic amplification, sequencing, and/or imaging. The polymerase chain reaction (PCR) has been developed to analyze nucleic acids in a laboratory. PCR evolved over the last decade into a new generation of devices and methods known as Next Generation Sequencing (NGS). NGS provides faster detection and amplification of nucleic acids at a cheaper price. The NGS devices and methods allow for rapid sequencing as the nucleic acids are amplified in massively parallel, high-throughput platforms.


In one illustrative aspect, the nucleic acid barcode constructs can be sequenced, to detect the polynucleotide barcodes using any suitable sequencing method including Next Generation Sequencing (e.g., using Illumina, ThermoFisher, or PacBio or Oxford Nanopore Technologies sequencing platforms), sequencing by synthesis, pyrosequencing, nanopore sequencing, or modifications or combinations thereof can be used. In one embodiment, the sequencing can be amplicon sequencing. In another embodiment, the sequencing can be whole genome sequencing. In another embodiment, the sequencing can be exome/targeted hybridization sequencing. Methods for sequencing nucleic acids are also well-known in the art and are described in Sambrook et al., “Molecular Cloning: A Laboratory Manual”, Cold Spring Harbor Laboratory Press, incorporated herein by reference.


In another embodiment, a method of treating a patient with a disease is provided. The method comprises administering to the patient the nucleic acid nanostructure delivery composition identified in the in vivo screening method described herein, or administering to the patient any of the nucleic acid nanostructure delivery compositions described herein, wherein the nucleic acid nanostructure delivery compositions comprises a payload, and treating the disease in the patient.


Illustrative payloads for the nucleic acid nanostructure delivery compositions described herein can include any one or a combination of compositions selected from the group comprising: nucleic acids (e.g., DNA or RNA), pDNA, oligodeoxyribonucleic acids (ODNs), dsDNA, ssDNA, antisense oligonucleotides, antisense RNA, siRNA, messenger RNA, guide RNA (e.g., small guide RNA), ribonucleoproteins, donor DNA strands used in the CRISPR/Cas9 system, and enzymes, such as CRISPR-associated enzymes, e.g., Cas9, enzymes used in other gene editing systems, such as ZFNs, custom designed homing endonucleases, TALENS systems, other gene editing endonucleases, and reverse transcriptase.


Other illustrative payloads include DNA constructs such as chimeric antigen receptor (CAR) constructs. CAR-T cells are T cells expressing chimeric antigen receptors (CARs). The CAR is a genetically engineered receptor that is designed to target a specific antigen, for example, a tumor antigen. This targeting can result in cytotoxicity against the tumor, for example, such that CAR-T cells expressing CARs can target and kill tumors via the specific tumor antigens. CARs can comprise a recognition region, e.g., a single chain fragment variable (scFv) region derived from an antibody for recognition and binding to the antigen expressed by the tumor, an activation signaling domain, e.g., the CD3 chain of T cells can serve as a T cell activation signal in CARs, and a co-stimulation domain (e.g., CD137, CD28 or CD134) to achieve prolonged activation of T cells in vivo. In some aspects, CARs are large DNA constructs.


In another embodiment, the payload can be a nucleic acid (e.g., DNA or RNA) with a size selected from the group consisting of 3 kB or more, 3.1 kB or more, 3.2 kB or more, 3.3 kB or more, 3.4 kB or more, 3.5 kB or more, 3.6 kB or more, 3.7 kB or more, 3.8 kB or more, 3.9 kB or more, 4 kB or more, 4.1 kB or more, 4.2 kB or more, 4.3 kB or more, 4.4 kB or more, 4.5 kB or more, 4.6 kB or more, 4.7 kB or more, 4.8 kB or more, 4.9 kB or more, 5 kB or more, 5.1 kB or more, 5.2 kB or more, 5.3 kB or more, 5.4 kB or more, 5.5 kB or more, 5.6 kB or more, 5.7 kB or more, 5.8 kB or more, 5.9 kB or more, 6 kB or more, 6.1 kB or more, 6.2 kB or more, 6.3 kB or more, 6.4 kB or more, 6.5 kB or more, 6.6 kB or more, 6.7 kB or more, 6.8 kB or more, 6.9 kB or more, 7 kB or more, 7.1 kB or more, 7.2 kB or more, 7.3 kB or more, 7.4 kB or more, 7.5 kB or more, 7.6 kB or more, 7.7 kB or more, 7.8 kB or more, 7.9 kB or more, 8 kB or more, 8.1 kB or more, 8.2 kB or more, 8.3 kB or more, 8.4 kB or more, and 8.5 kB or more.


In various embodiments, the payload can be any one or more of the components of the CRISPR RNP system including a CRISPR-associated enzyme (e.g., Cas9), a short guide RNA (sgRNA), and a donor DNA strand. In an embodiment where the payload comprises Cas9, Cas9 can be fused to a deaminase. In yet another embodiment, the payload can comprise an sgRNA used for targeting an enzyme to a specific genomic sequence. In another aspect, the targeted enzyme can be a CRISPR-associated enzyme. In another illustrative aspect, the payload can comprise one molecule each of CRISPR/Cas9, an sgRNA, and a donor DNA strand in the nucleic acid nanostructure delivery compositions described herein. In another embodiment, the payloads can be nucleic acids used for homology directed repair or as transposable elements. In yet another embodiment, the payloads can be any of the payloads described herein in the form of a plasmid construct.


In one aspect, the nucleic acid nanostructure delivery composition described herein can encapsulate a payload that is used for gene editing. In one aspect, the CRISPR/Cas9 system can be the payload and can be used for gene editing. In another embodiment, another gene editing system can be the payload, such as ZFNs, custom designed homing endonucleases, and TALENS systems. In the embodiment where the CRISPR/Cas9 system is the payload, the Cas9 endonuclease is capable of introducing a double strand break into a DNA target sequence. In this aspect, the Cas9 endonuclease is guided by the guide polynucleotide (e.g., sgRNA) to recognize and optionally introduce a double strand break at a specific target site into the genome of a cell. In this illustrative embodiment, the Cas9 endonuclease can unwind the DNA duplex in close proximity to the genomic target site and can cleave both target DNA strands upon recognition of a target sequence by a guide polynucleotide (e.g., sgRNA), but only if the correct protospacer-adjacent motif (PAM) is approximately oriented at the 3′ end of the target. In this embodiment, the donor DNA strand can then be incorporated into the genomic target site. The CRISPR/Cas9 system for gene editing is well-known in the art.


In another illustrative embodiment, the payload may include DNA segments that serve as nuclear localization signals, enhancing nuclear delivery of the nucleic acid nanostructure delivery compositions upon endosomal escape. In another aspect, the payload may include a nucleotide sequence designed to bind as an aptamer to endosomal receptors, enhancing intracellular trafficking of the nucleic acid nanostructure delivery compositions.


In one illustrative aspect, a nucleic acid nanostructure delivery composition (e.g., DNA origami) is provided to package the Cas9 protein, the sgRNA and the single stranded donor DNA strand together in one nanostructure to ensure co-delivery of all the components to a particular location at the same time. In this embodiment, the single stranded nature of the sgRNA and the donor DNA strand can be used to convert these components into constitutive parts of the nucleic acid nanostructure delivery composition (e.g., the DNA origami structure) such that they get delivered together and dissociate at the same time from the DNA nanostructure delivery composition upon reaching the target site (e.g., a target cell). In this embodiment, the DNA nanostructure delivery composition can deliver either a plasmid or the ribonucleoprotein (RNP) form of CRISPR/Cas 9.


In one embodiment, a method for gene therapy is provided. In one aspect, the method comprises administering to a patient a nucleic acid nanostructure delivery composition described herein.


In one embodiment, the nucleic acid nanostructure delivery compositions described herein may be formulated as pharmaceutical compositions for parenteral or topical administration. Such pharmaceutical compositions and processes for making the same are known in the art for both humans and non-human mammals. See, e.g., REMINGTON: THE SCIENCE AND PRACTICE OF PHARMACY, (1995) A. Gennaro, et al., eds., 19th ed., Mack Publishing Co. Additional active ingredients may be included in the compositions.


In one aspect, the nucleic acid nanostructure delivery composition may be administered, for example, directly into the blood stream of a patient, into muscle, into an internal organ, or can be administered in a topical formulation. In various embodiments, suitable routes for such parenteral administration include intravenous, intraarterial, intraperitoneal, intrathecal, epidural, intracerebroventricular, intraurethral, intrasternal, intracranial, intratumoral, intramuscular and subcutaneous delivery. In one embodiment, means for parenteral administration include needle (including microneedle) injectors, needle-free injectors and infusion techniques.


In one illustrative aspect, parenteral formulations are typically aqueous solutions which may contain carriers or excipients such as salts, carbohydrates and buffering agents (preferably at a pH of from 3 to 9), but they may be more suitably formulated as a sterile non-aqueous solution or as a dried form to be used in conjunction with a suitable vehicle such as sterile, pyrogen-free water or sterile saline. The preparation under sterile conditions, by lyophilization to produce a sterile lyophilized powder for a parenteral formulation, may readily be accomplished using standard pharmaceutical techniques well-known to those skilled in the art. In one embodiment, the solubility of the composition used in the preparation of a parenteral formulation may be increased by the use of appropriate formulation techniques, such as the incorporation of solubility-enhancing agents.


In one illustrative embodiment, pharmaceutical compositions for parenteral administration comprise: a) a pharmaceutically active amount of the nucleic acid nanostructure delivery composition; b) a pharmaceutically acceptable pH buffering agent to provide a pH in the range of about pH 4.5 to about pH 9; c) an ionic strength modifying agent in the concentration range of about 0 to about 300 millimolar; and d) water soluble viscosity modifying agent in the concentration range of about 0.25% to about 10% total formula weight or any combinations of a), b), c) and d) are provided.


In various illustrative embodiments, the pH buffering agents for use in the compositions and methods described herein are those agents known to the skilled artisan and include, for example, acetate, borate, carbonate, citrate, and phosphate buffers, as well as hydrochloric acid, sodium hydroxide, magnesium oxide, monopotassium phosphate, bicarbonate, ammonia, carbonic acid, hydrochloric acid, sodium citrate, citric acid, acetic acid, disodium hydrogen phosphate, borax, boric acid, sodium hydroxide, diethyl barbituric acid, and proteins, as well as various biological buffers, for example, TAPS, Bicine, Tris, Tricine, HEPES, TES, MOPS, PIPES, cacodylate, or MES.


In another illustrative embodiment, the ionic strength modulating agents include those agents known in the art, for example, glycerin, propylene glycol, mannitol, glucose, dextrose, sorbitol, sodium chloride, potassium chloride, and other electrolytes.


Useful viscosity modulating agents include but are not limited to, ionic and nonionic water soluble polymers; crosslinked acrylic acid polymers such as the “carbomer” family of polymers, e.g., carboxypolyalkylenes that may be obtained commercially under the Carbopol® trademark; hydrophilic polymers such as polyethylene oxides, polyoxyethylene-polyoxypropylene copolymers, and polyvinylalcohol; cellulosic polymers and cellulosic polymer derivatives such as hydroxypropyl cellulose, hydroxyethyl cellulose, hydroxypropyl methylcellulose, hydroxypropyl methylcellulose phthalate, methyl cellulose, carboxymethyl cellulose, and etherified cellulose; gums such as tragacanth and xanthan gum; sodium alginate; gelatin, hyaluronic acid and salts thereof, chitosans, gellans or any combination thereof. Typically, non-acidic viscosity enhancing agents, such as a neutral or a basic agent are employed in order to facilitate achieving the desired pH of the formulation.


In one embodiment, the solubility of the compositions described herein used in the preparation of a parenteral formulation may be increased by the use of appropriate formulation techniques, such as the incorporation of solubility-enhancing agents.


In other embodiments, the compositions described herein may be administered topically. A variety of dose forms and bases can be applied to the topical preparations, such as an ointment, cream, gel, gel ointment, plaster (e.g. cataplasm, poultice), solution, powders, and the like. These preparations may be prepared by any conventional method with conventional pharmaceutically acceptable carriers or diluents as described below.


For example, vaseline, higher alcohols, beeswax, vegetable oils, polyethylene glycol, etc. can be used. In the preparation of a cream formulation, fats and oils, waxes, higher fatty acids, higher alcohols, fatty acid esters, purified water, emulsifying agents etc. can be used. In the preparation of gel formulations, conventional gelling materials such as polyacrylates (e.g. sodium polyacrylate), hydroxypropyl cellulose, hydroxypropyl methyl cellulose, polyvinyl alcohol, polyvinylpyrrolidone, purified water, lower alcohols, polyhydric alcohols, polyethylene glycol, and the like are used. In the preparation of a gel ointment, an emulsifying agent (preferably nonionic surfactants), an oily substance (e.g. liquid paraffin, triglycerides, and the like), etc. are used in addition to the gelling materials as mentioned above. A plaster such as cataplasm or poultice can be prepared by spreading a gel preparation as mentioned above onto a support (e.g. fabrics, non-woven fabrics). In addition to the above-mentioned ingredients, paraffins, squalane, lanolin, cholesterol esters, higher fatty acid esters, and the like may optionally be used. Moreover, antioxidants such as BHA, BHT, propyl gallate, pyrogallol, tocopherol, etc. may also be incorporated. In addition to the above-mentioned preparations and components, there may optionally be used any other conventional formulations for incorporation with any other additives.


In various embodiments, the dosage of the nucleic acid nanostructure delivery composition can vary significantly depending on the patient condition, or the disease state being treated, the route of administration and tissue distribution, and the possibility of co-usage of other therapeutic treatments. The effective amount to be administered to a patient is based on body surface area, patient weight or mass, and physician assessment of patient condition. In various embodiments, the nucleic acid nanostructure delivery composition can be administered to a patient with a disease or a disorder selected from the group consisting of cancer, a muscular disorder, a pulmonary disorder, a skin disorder, a neurological disease, neurofibromatosis 1 (NF1), and a hemoglobinopathy. In one embodiment, the cancer is selected from the group consisting of lung cancer, bone cancer, pancreatic cancer, skin cancer, uterine cancer, ovarian cancer, endometrial cancer, rectal cancer, stomach cancer, colon cancer, breast cancer, cancer of the esophagus, cancer of the endocrine system, prostate cancer, leukemia, lymphoma, mesothelioma, cancer of the bladder, cancer of the kidney, neoplasms of the central nervous system, brain cancer, and adenocarcinoma. In another embodiment, the skin disorder is a Staphlococcus aureus infection. In yet another embodiment, the muscular disorder is muscular dystrophy (e.g., Duchenne Muscular Dystrophy). In still another embodiment, the nucleic acid nanostructure delivery composition are not cytotoxic to the cells of the patient. In another embodiment, the gene therapy may result in the inactivation of a pathogen (i.e., a microorganism) rather than altering the genome of the patient.


In another embodiment, a method is provided comprising synthesizing a diverse set of non-viral gene delivery compositions, wherein each non-viral gene delivery composition differs from each other non-viral gene delivery composition of the diverse set with respect to at least one of a set of composition characteristics, simultaneously testing one or more quality attributes of each of the non-viral gene delivery composition of the diverse set, and creating from results of the testing, a predictive model that correlates the composition characteristics with the quality attributes. In this embodiment, the composition characteristics can comprise one or more of molecular weight, degree of branching, number of ionizable groups, core-to-corona molecular weight ratio, hydrophilicity, hydrophobicity, propensity for aggregation, size, pKa, logP, and surface charge. In this embodiment, the quality attributes can comprise one or more of cytotoxicity, immunogenicity, transfection efficiency, zeta potential, size, pKa, logP, and loading efficiency. In this embodiment, the diverse set can comprises hundreds or thousands of non-viral gene delivery compositions. In this embodiment, each of the non-viral gene delivery compositions of the diverse set can be a nucleic acid nanostructure delivery composition according to any one of clauses described above. In one embodiment of this method, high-throughput testing, and machine learning data analysis can accelerate the design-build-test-learn (DBTL) cycle for development of CRISPR-based therapeutics.


In some embodiments, the nucleic acid nanostructure delivery composition may be labelled to enhance downstream separation. For example, this may include covalently bonding the nucleic acid nanostructure delivery composition to a magnetic nanoparticle (e.g., superparamagnetic iron oxide), to polyhistidine tags for metal ion chromatography, and/or to fluorescent labels for fluorescent assisted separation (such as with FACS). The labels may be used to track the nucleic acid delivery composition in vivo. Possible “endpoints” include, but are not limited to, quantitative presence in various physiological tissue, post administration, measured via, for example, fluorescence.


In the embodiment where labels are used, the labels allow for rapid in vivo screening of many non-viral delivery vehicle variants with parallel determination of quantitative bio-distribution, and rapid in vitro screening of many variants for stability, cytotoxicity, immunogenicity, and efficacy. This embodiment allows for the construction of a large library of non-viral delivery vehicles that can be drawn from for use as delivery vehicles for genetic medicines, including gene therapies, genetic vaccines, gene editing, gene regulators, and small molecule therapeutics.


In some illustrative embodiments, a large library of similar, but unique nucleic acid nanostructure delivery compositions may be constructed for use in a high-throughput screening process to identify targeting components that bind to specific targets. This rapid screening platform can quickly determine an effective targeting molecule that can be used for targeted delivery to a specific cell or tissue, or for use as a neutralizing molecule for a pathogen.


References in the specification to “one embodiment,” “an embodiment,” “an illustrative embodiment,” etc., indicate that the embodiment described may include a particular feature, structure, or characteristic, but every embodiment may or may not necessarily include that particular feature, structure, or characteristic. Moreover, such phrases are not necessarily referring to the same embodiment. Further, when a particular feature, structure, or characteristic is described in connection with an embodiment, it is submitted that it is within the knowledge of one skilled in the art to effect such feature, structure, or characteristic in connection with other embodiments whether or not explicitly described. Additionally, it should be appreciated that items included in a list in the form of “at least one A, B, and C” can mean (A); (B); (C); (A and B); (A and C); (B and C); or (A, B, and C). Similarly, items listed in the form of “at least one of A, B, or C” can mean (A); (B); (C); (A and B); (A and C); (B and C); or (A, B, and C).


In the drawings, some structural or method features may be shown in specific arrangements and/or orderings. However, it should be appreciated that such specific arrangements and/or orderings may not be required. Rather, in some embodiments, such features may be arranged in a different manner and/or order than shown in the illustrative figures. Additionally, the inclusion of a structural or method feature in a particular figure is not meant to imply that such feature is required in all embodiments and, in some embodiments, may not be included or may be combined with other features.


While certain illustrative embodiments have been described in detail in the drawings and the foregoing description, such an illustration and description is to be considered as exemplary and not restrictive in character, it being understood that only illustrative embodiments have been shown and described and that all changes and modifications that come within the spirit of the disclosure are desired to be protected. There exist a plurality of advantages of the present disclosure arising from the various features of the apparatus, systems, and methods described herein. It will be noted that alternative embodiments of the apparatus, systems, and methods of the present disclosure may not include all of the features described, yet still benefit from at least some of the advantages of such features. Those of ordinary skill in the art may readily devise their own implementations of the apparatus, systems, and methods that incorporate one or more of the features of the present disclosure.


While the concepts of the present disclosure are susceptible to various modifications and alternative forms, specific embodiments thereof have been shown by way of example in the appended drawings and will be described herein in detail. It should be understood, however, that there is no intent to limit the concepts of the present disclosure to the particular forms disclosed, but on the contrary, the intention is to cover all modifications, equivalents, and alternatives consistent with the present disclosure.


EXAMPLE 1
Nucleic Acid Nanostructure Delivery Compositions

The materials and methods below describe an example in which a DNA origami (DNAO) nanostructure is a cuboid structure and the nucleic acid barcode constructs are attached to the DNAO via complementary base pairing of the barcodes with one of the oligonucleotide staples within the DNAO.


Barcode Design

The barcodes used in this example comprise a unique portion comprising 8 to 10 nucleotides in the center of the polynucleotide, the unique portion further characterized by a hamming distance of at least 3 bases from any other barcodes to be pooled. Directly on the 3′ end of the barcode, 7 to 10 random bases are included for bioinformatic removal of PCR duplicates. This central sequence is flanked by universal primer annealing sites containing overhangs for the addition of index adapters during sequencing library preparation. The polynucleotide barcodes in this example were designed with a biotin functional group on the 5′ end.









(SEQ ID NO: 1004)


/5BiosG/A*G*A*CGTGTGCTCTTCCGATCTGAGGGTACTTNNNNNNN


NNNAGATCGGAAGAGCGTCG*T*G*T.






Materials for Nucleic Acid Nanostructure Delivery Composition Synthesis (DNAO)

The DNA origami scaffold is a single stranded DNA (ssDNA) isolated from the M13 bacteriophage. The oligonucleotide staples are short single stranded DNA with sequences described in Table 2. The barcode is a single stranded DNA segment as described under barcode design above.


Preparation of the DNA Barcoded DNAO Structures

A reaction mixture was prepared comprising the DNA scaffold, the oligonucleotide staples and magnesium together in TE buffer in a reaction vessel in the following amounts: 160 uL all oligonucleotide staples (described in Table 2) pooled at a total concentration of 500 nM, 80 uL scaffold (100 nM), 80 uL water, 40 uL of TE buffer (1.46 g EDTA, 3.03 g Tris Base, 1.46 g NaCl, 500 mL water), 40 uL of 200 mM MgCl2. The reaction vessel was placed in a thermocycler with thermal ramp starting at ˜65° C. and descending to 24° C. over the course of ˜67 hours. The product was purified using a precipitation with a PEG purification protocol using a PEG solution made with the following recipe: 75 g PEG8000, 50 mL of the TE buffer described above, and 62.5 mL of 4 M NaCl, brought up to 500 mL with water, yielding a DNAO nanostructure in water. The concentration was measured via nanodrop and then the barcodes were added to the product at a molar ratio of 4:1 (polynucleotide barcodes to DNAO nanostructure). This mixture was incubated at 37° C. for 2 to 3 hours. The product was purified with another PEG purification process as described above, yielding a final product of DNA barcoded DNAO in water. Transmission electron microscopy images were captured on formvar/carbon coated nickel grids with a negative stain with 1% phophotungstic acid (PTA) using an FEI Tecnai G2 Bio Twin TEM (see FIG. 1).


Barcode Amplification

Dilutions of the DNA barcoded DNAO nanostructure described above were prepared at the following concentrations: 13.5, 1.35, 0.135 and 0.0135 nM. A Master Mix was created with: Kapa HiFi 2×Master mix, Reverse barcode primer, forward barcode primer, DMSO, and nuclease free water. Master Mix (15 μL) was loaded into each well. Each of the dilutions of DNA barcoded DNAO was loaded (5 μL) into each well of a 96 well plate. Nuclease free water (5 μL) was loaded into the designated NTC wells. Positive control (5 μL) was loaded into the designated positive control wells where the positive control comprised a solution of polymer nanoparticles in phosphate buffered saline consisting of dimethylaminoethylmethacrylate, polyacylate, and butyl methacrylate, labelled with the same barcode as that used to label the DNAO, as described in US patent application 17/715784. Each well was covered, either with a strip cap or adhesive seal, and centrifuged for approx. 1 min at 1,000×g. Amplification of the barcodes was conducted by incubating in a thermocycler under typical PCR conditions.


Analysis of PCR Amplification Via Gel Electrophoresis

The gel electrophoresis was done on a 4% 12-well Ethidium Bromide gel, using 15 μL of 1 kb E-gel Ladder in the first well. DNA barcoded DNAO dilutions (10 μL) from the multiwell plate above were added to each well of the 12-well gel. E-gel buffer (100 μL) was added to each well. Nuclease free water (15 μL) was added to any remaining empty wells as the no test control (NTC). The gel doe was powered on to run current through the gel for about 20 to 25 mins. or until the sample buffer line reached the end of the gel. The gel was removed from the base and analyzes in a Gel Imager (see FIG. 2).


Results


FIG. 1 shows a transmission electron micrograph of the DNA barcoded DNAO nanostructures. The image shows evidence that the nanostructures were successfully folded into cuboid nanostructures, showing that the DNA origami folding process was successful. The TEM does not offer the resolution to discern the polynucleotide barcodes on the structure, therefore, PCR amplification was used prove their presence as shown in FIG. 2.











TABLE 2







Cuboid Body-1
ACTGAGTGACTTCACATGGAGCGGTATGGTT
(SEQ ID




NO: 1005)





Cuboid Body-2
TACGGCCTCTAAACGACAATCGGCACTCCAA
(SEQ ID




NO: 1006)





Cuboid Body-3
GACCATACCGGAAGCAAACTATAATACTGATATTC
(SEQ ID




NO: 1007)





Cuboid Body-4
CCGTGACCAGGTCATTACCATTGAACGAGAAT
(SEQ ID




NO: 1008)





Cuboid Body-5
ACACCTACGAAAAAACAAAATTAAT
(SEQ ID




NO: 1009)





Cuboid Body-6
ATTCAGAGCATAAATAGCTAT
(SEQ ID




NO: 1010)





Cuboid Body-7
TAACCACAGAAACTTTCATCAGTATCTTTTC
(SEQ ID




NO: 1011)





Cuboid Body-8
GACGGACTCCTATAACCCACAAGAAACAAAGTCCTAATT
(SEQ ID




NO: 1012)





Cuboid Body-9
AGCGAAAAATGCCATCATCTTTGATTGTAAAAGTT
(SEQ ID




NO: 1013)





Cuboid Body-10
GAACCCTAAGGATATTTAATTCCTGACTATAGCGT
(SEQ ID




NO: 1014)





Cuboid Body-11
CCGCGATTAGGACGCAGTTTGACGG
(SEQ ID




NO: 1015)





Cuboid Body-12
TAAGGTTTAGTCATAGTTCATGTAGATACCGAAGGCTTG
(SEQ ID




NO: 1016)





Cuboid Body-13
AATATTTGACCCTGAATCATACAGAAAATCCCCCT
(SEQ ID




NO: 1017)





Cuboid Body-14
CCAATACGCAGACGACGATAAACCAACTAGAAACCGATTATC
(SEQ ID




NO: 1018)





Cuboid Body-15
CAGGCTGAAAGTACAAATTCGTGAAAGC
(SEQ ID




NO: 1019)





Cuboid Body-16
TACATCCAAGACGCGCCCCCTCAGC
(SEQ ID




NO: 1020)





Cuboid Body-17
AATACAGTAGGGCAAGCCGAAACATGGCAACAAAAAGGG
(SEQ ID




NO: 1021)





Cuboid Body-18
ATTAGCGACCAGAGCCACCCT
(SEQ ID




NO: 1022)





Cuboid Body-19
AAGAAATTTTAATTTATCGCGCATT
(SEQ ID




NO: 1023)





Cuboid Body-20
CAGGGAGGAGAAACACAGTAACCTACCAGACAACT
(SEQ ID




NO: 1024)





Cuboid Body-21
CCCACCTCCTTACCGAGATTTTTT
(SEQ ID




NO: 1025)





Cuboid Body-22
CACCAGAGCCGCAGCAGAGGG
(SEQ ID




NO: 1026)





Cuboid Body-23
GGCCAAATCCCAGCGATCGGGTAAAACAATT
(SEQ ID




NO: 1027)





Cuboid Body-24
TGCGGGAACCTAAAACGAATTGATGATG
(SEQ ID




NO: 1028)





Cuboid Body-25
ACCCTGAGTTTCACAAACAAAAGAAAGCCCA
(SEQ ID




NO: 1029)





Cuboid Body-26
GCATTGACAGGACAAATTGTA
(SEQ ID




NO: 1030)





Cuboid Body-27
GAGCGGGATCCTGAAAAGTGT
(SEQ ID




NO: 1031)





Cuboid Body-28
CATTAGTTGATGCCAGCTGTAATGATTGCCCTAGAATCA
(SEQ ID




NO: 1032)





Cuboid Body-29
ATATCGCATAGTAATTTTTTGGATT
(SEQ ID




NO: 1033)





Cuboid Body-30
GCGTAAAATTTAAGGGTGAGAAAGG
(SEQ ID




NO: 1034)





Cuboid Body-31
TAATGTTAGCACTAGAAGGTTCATCACCCAACAGT
(SEQ ID




NO: 1035)





Cuboid Body-32
AGAGCCATTCGGATGTTTTTGCCCAATCAAG
(SEQ ID




NO: 1036)





Cuboid Body-33
GCTTTGAAAGAGTCTTATCCG
(SEQ ID




NO: 1037)





Cuboid Body-34
CAGTCGCGTCGCTTGTTGGGGTGCAACGGTA
(SEQ ID




NO: 1038)





Cuboid Body-35
GTTATACAAAACATAACGCCAAAAG
(SEQ ID




NO: 1039)





Cuboid Body-36
GAACTGGCTCAACAAAAGGTAAAGTAA
(SEQ ID




NO: 1040)





Cuboid Body-37
AAGCGGGCGATCACTCCATCCCAATGTATAAGGCTATTA
(SEQ ID




NO: 1041)





Cuboid Body-38
AGATGATAGCGCGATGTAAATTCGAGAAAGGAACCTTA
(SEQ ID




NO: 1042)





Cuboid Body-39
CGTCAAATCATTAAGAAACCATCGATTTATTAGCGGCC
(SEQ ID




NO: 1043)





Cuboid Body-40
CAGAGGCTTTGAGGACCTTTAGATTAGCGAACTGCTATTAAACGA
(SEQ ID



TGAATAACA
NO: 1044)





Cuboid Body-41
TTACCTCGAGATTTCGCTCGCTGAATTCAGAAAGCGGTGCCGTA
(SEQ ID




NO: 1045)





Cuboid Body-42
AAAAGAAACATCGGTTAAAGGTCTTAAAAGCCTTTGTTTCAGGCC
(SEQ ID




NO: 1046)





Cuboid Body-43
ATACTTCATACACTAGAAAACTTATCATAACTACATCA
(SEQ ID




NO: 1047)





Cuboid Body-44
CGACATTGGGTCGAGAGGTCCACGCTGGCTTCTAATTA
(SEQ ID




NO: 1048)





Cuboid Body-45
TAAGTTGCACCGAAAGCGTGTACTGGTAATATCAAGAGT
(SEQ ID




NO: 1049)





Cuboid Body-46
AAAAAAAGGAAAGAGCCACCACCACCGGAACCGTCATAGTTA
(SEQ ID




NO: 1050)





Cuboid Body-47
TACATTTAATACGTGACAGCAGTAGGAATATAGAAAACGCTATAC
(SEQ ID



AAAAACTGAACA
NO: 1051)





Cuboid Body-48
AAATTATGGGCGAAGATGGTGCAGCTGAGTAAACATAAAGACCC
(SEQ ID



ATCAAAA
NO: 1052)





Cuboid Body-49
TGCCAGTACGAGCGTTGATATTCGTCACAGAGCCATTT
(SEQ ID




NO: 1053)





Cuboid Body-50
TACGAGCCGCCCCGGGGGTTGGTTTCCGGCGCAACTGAAC
(SEQ ID




NO: 1054)





Cuboid Body-51
GTCTTTACGGTTGTAATTAGCTGAGAGCCAGCATAAACAGTTCAG
(SEQ ID



GCAATAG
NO: 1055)





Cuboid Body-52
TTTAAAGGTGGAGCAATAAAGCCTACCGCCTGTTGCTG
(SEQ ID




NO: 1056)





Cuboid Body-53
CCCTGATTGAGTTACTGGCACAAACGTCTCCTCACCAGGGTAGGCT
(SEQ ID



GCACC
NO: 1057)





Cuboid Body-54
CCGGTATTTTTTATGCACTCAGCTGATGGTTAAATAGGACAGCAAA
(SEQ ID



GCTTGCGGAAAATC
NO: 1058)





Cuboid Body-55
AGTTCTGGTGCAACTCTGTGAGAGAGTATAACGTGCTTTATCAGTG
(SEQ ID



CAACA
NO: 1059)





Cuboid Body-56
AATAAGAGGCACTGACCTAATATATTTAGAGGCAAAAGATGAATA
(SEQ ID



ATGCCCGAACG
NO: 1060)





Cuboid Body-57
AAAGCCTACCTCGAGGGCTTAAAGATCGCGGTGCGGTT
(SEQ ID




NO: 1061)





Cuboid Body-58
CTGGACCTACATTTTGACGCTCATTATAGCTGTTTCCCCGAC
(SEQ ID



GCATTCT
NO: 1062)





Cuboid Body-59
GCGTCCATTACGAGAGGGCCAGGGTAAAGGGGCAG
(SEQ ID




NO: 1063)





Cuboid Body-60
CCGGAGATCAACGCTCTGACCCAATCGTCTGAAATTTAAATATGCT
(SEQ ID



CCTCCCTCGT
NO: 1064)





Cuboid Body-61
CGCCAGAAGCTAAACGGGCAAGTTCCGAGGCCAGTGGGTITTAAC
(SEQ ID



CGCCTAG
NO: 1065)





Cuboid Body-62
AATAACAATAAAGAGCCGCAACAGTATATCAAGGT
(SEQ ID




NO: 1066)





Cuboid Body-63
GTAAGAAGAGAAGCAACCAGAAATCAAATCGTCAT
(SEQ ID




NO: 1067)





Cuboid Body-64
ATTCTATCAAAATTATTTCGTCAGAGTTTATCTTACGAGAGC
(SEQ ID




NO: 1068)





Cuboid Body-65
TTCGAGTCTTTCCTTTTTCAAAATTTAGACCTTCATCAAATAATCCTT
(SEQ ID



TGGAAGGG
NO: 1069)





Cuboid Body-66
GTACGCGCTAGGAAGGGAATGCGCATTATTCTCAAAGTITTAACG
(SEQ ID



GAAG
NO: 1070)





Cuboid Body-67
ACCATCAAAATCACCGGATTTGCCAGCGACAGGAAACGTACCAGG
(SEQ ID



CCGTTGTATCG
NO: 1071)





Cuboid Body-68
AGGGATTAACCAACTATCGGTGTGTGAAATTGTGTCCATCACGCA
(SEQ ID



ACGC
NO: 1072)





Cuboid Body-69
AAATAGAGGTCTAGCTCAAGTGCGGGCCTTGAGCAAGTGGCGCC
(SEQ ID



GCTAC
NO: 1073)





Cuboid Body-70
AGAGCATTTTCCGTAACAAAGTACCTTTCATCTCGGAACGA
(SEQ ID




NO: 1074)





Cuboid Body-71
CTACACGCGAGGCGTTTTAAGACGCTGAGAAGCTA
(SEQ ID




NO: 1075)





Cuboid Body-72
AACCTCAATGAAAGGCTTTAATAGTCAGGGCTGGCGTTTCCCAATC
(SEQ ID



AAAAGTC
NO: 1076)





Cuboid Body-73
CGTATTACTTTAGGTCATCTTTTTTCGACCCTCAGGCTCAGTCACCA
(SEQ ID



ATAGGTGAA
NO: 1077)





Cuboid Body-74
TGAGTAAGTGTACAGTGATAACGCCATACCACCCTGCTGAGAAGA
(SEQ ID



AAATTTGAGG
NO: 1078)





Cuboid Body-75
CACTATTAAAGCGTCACCAAGGCCGAATCAAGGGATTTTAGAATA
(SEQ ID



GAAGGCTCCAA
NO: 1079)





Cuboid Body-76
AAATTAATATCGAACTCAGTTGTAGCAATACTGCTTAATTAGCGGT
(SEQ ID



CCGGCGATTA
NO: 1080)





Cuboid Body-77
CCTGTAGTCAGAAATTTGACCAGCTGAAATTTCAAAATTGAGACA
(SEQ ID



ACTAACA
NO: 1081)





Cuboid Body-78
ATAATATCAGAGAGTAAGCAGCACCGTAAAGGTTGAGGCAGGAC
(SEQ ID



GCCTGACC
NO: 1082)





Cuboid Body-79
AACCAATTTAAATTATATAACAACATCCGAAACACCGG
(SEQ ID




NO: 1083)





Cuboid Body-80
AAACAAACATCAAGGGCACCATCGTCACAACTTGCT
(SEQ ID




NO: 1084)





Cuboid Body-81
TCATTTGAATTACCAAGTTTCGCAACGGCAAGCCGTCT
(SEQ ID




NO: 1085)





Cuboid Body-82
GAGACAGGAGGCCGATTAAAGGGCTTTTTTCTCTTATAAGTCAC
(SEQ ID



GAGGA
NO: 1086)





Cuboid Body-83
GATAGCTTAATATCAGTTATTCAGTAGAGTACCAAAGCGCTTTAT
(SEQ ID



TCAGTCAAATCACCA
NO: 1087)





Cuboid Body-84
TAAAAACAATAGCAGGAAACGAAACGCGAACCTACGACTTATT
(SEQ ID



CTAATTAC
NO: 1088)





Cuboid Body-85
CTATGTATCGGGATTTAGAGCCCCAAACTGATAGCCCTA
(SEQ ID




NO: 1089)





Cuboid Body-86
ATAATGAAGATTTCTGCGATCTACTAAATTGGGATGAGTAATCT
(SEQ ID



TGATTGCAAATGAA
NO: 1090)





Cuboid Body-87
TTATCACAACGTGGAAAATCCTTTCACCTGGACCGTAATGGGGA
(SEQ ID



CGACATAC
NO: 1091)





Cuboid Body-88
AAACACGTTGATAACCAGCTTTGGGAACAACTCACTCGT
(SEQ ID




NO: 1092)





Cuboid Body-89
AAGCACAAAGACTATAAAACGAGGAACAGAATGCAG
(SEQ ID




NO: 1093)





Cuboid Body-90
GGGTACATTAAATATACCAGGCAATTATTITGCGGAACAATTG
(SEQ ID



AAAGAAG
NO: 1094)





Cuboid Body-91
GTGTTITTATACCTCGTTTCACCGCCGAAAATGGA
(SEQ ID




NO: 1095)





Cuboid Body-92
AACATTATTACCTGACGAAATCCAACAGGTCAGGA
(SEQ ID




NO: 1096)





Cuboid Body-93
CAGCGCCTAAATCGAGAAAGGGCGCTGATCGGCTTTG
(SEQ ID




NO: 1097)





Cuboid Body-94
AGACGGGGAGCAAGAATAACGTAAAGGTGAAAGTAGTGGTTG
(SEQ ID



CCAAAGCGC
NO: 1098)





Cuboid Body-95
CAAATGCCAAGAACCAGAACGCTAACGGTTGGGAATTTCTCACC
(SEQ ID



TGTTTAAACCGTCT
NO: 1099)





Cuboid Body-96
ATCAAGTAAATAATGTTAGTGATTAAATTGGCCTCTTTCCGCAA
(SEQ ID



GCAAAT
NO: 1100)





Cuboid Body-97
CAATAGTGTTATATGAATCATCGAACTGAAACCAAAGACTGGAT
(SEQ ID



TATAGTCAGAAGTTC
NO: 1101)





Cuboid Body-98
ATAATAAAGAATTATAAACAGCCATATTTCCTGAAATCCTCACA
(SEQ ID



TGTACAGG
NO: 1102)





Cuboid Body-99
TTAAATTTGTTAGGTGTCTGGATTAGGAATAAGAACGTAAATGA
(SEQ ID



ACGCATTATCCATC
NO: 1103)





Cuboid Body-100
CAGAGCCCGGAGTGGCTAAACCAGCCCTACCGCCAGCCAGTAG
(SEQ ID



AAGAAATGG
NO: 1104)





Cuboid Body-101
GCCACGCAAAATTACATCAATACGAGTACCTCAGGAGCTACGAGT
(SEQ ID




NO: 1105)





Cuboid Body-102
AAGCACGCAATAAACAATGAAATAGCAGGGAACCAATCCAAA
(SEQ ID




NO: 1106)





Cuboid Body-103
CTCATTTAGGCAAACGAAAGTAGTAATAGTAGGGCAAAGACCAA
(SEQ ID



AAATAAATTAATGCCG
NO: 1107)





Cuboid Body-104
CAGTATTAACAAAAATACGGGCGCGATTAGATAGGGGACATAGG
(SEQ ID



TCAGGGGGGCA
NO: 1108)





Cuboid Body-105
CGAGTAACGAGCACGTTGCAGCAAGCAAAAATCATCATGGT
(SEQ ID



CAAATTTTTG
NO: 1109)





Cuboid Body-106
AAATATTCCAAATCGCTTGCCAGGT
(SEQ ID




NO: 1110)





Cuboid Body-107
TTATGCTGGCTATGGTTTCATGTAAGAA
(SEQ ID




NO: 1111)





Cuboid Body-108
GAGGGAAGTCACCCACAGCAGGCTGG
(SEQ ID




NO: 1112)





Cuboid Body-109
AATATAATCCTGACCCCAATCGCATTAA
(SEQ ID




NO: 1113)





Cuboid Body-110
GCTTTCATTCCGTAAACGCACAGAC
(SEQ ID




NO: 1114)





Cuboid Body-111
CCAACCACATTATCATAATTTGATAATC
(SEQ ID




NO: 1115)





Cuboid Body-112
AACAACGCCAAGAAATACGAACGCGAAA
(SEQ ID




NO: 1116)





Cuboid Body-113
GAGAGGGGCTAAATATGCGCGAAAA
(SEQ ID




NO: 1117)





Cuboid Body-114
TCAATATTTTGCGGTACGTGGTTAA
(SEQ ID




NO: 1118)





Cuboid Body-115
CATTCAAGATTCTGAGAATATAA
(SEQ ID




NO: 1119)





Cuboid Body-116
GTAACTGTATGTTTGCCTCCCCCTT
(SEQ ID




NO: 1120)





Cuboid Body-117
TTAGAACAGTACCTTTAGTTTCAA
(SEQ ID




NO: 1121)





Cuboid Body-118
AAATCTAAAGATCTAAACTTGAGAAATC
(SEQ ID




NO: 1122)





Cuboid Body-119
CGACACATGTTGGATTATAGAGATA
(SEQ ID




NO: 1123)





Cuboid Body-120
GCCAACGTTGGATAAGAGGTTCAGC
(SEQ ID




NO: 1124)





Cuboid Body-121
TCGGTTTATCAGTAAGCGCCA
(SEQ ID




NO: 1125)





Cuboid Body-122
CAGATCATTACACGGGTAAGACAAACGA
(SEQ ID




NO: 1126)





Cuboid Body-123
CATTTCGGCTGGTGAATTCCGCTTT
(SEQ ID




NO: 1127)





Cuboid Body-124
AAGGCAGCTTGAAACCAAAACAGAA
(SEQ ID




NO: 1128)





Cuboid Body-125
GAAGAGAGAAGCACCCTCCAG
(SEQ ID




NO: 1129)





Cuboid Body-126
TAATTGAGCGCTAAGAACCAC
(SEQ ID




NO: 1130)





Cuboid Body-127
TTTTGAGAGATCTAATTCGCC
(SEQ ID




NO: 1131)





Cuboid Body-128
GCCAGATTTTAGACAGGCTAATGA
(SEQ ID




NO: 1132)





Cuboid Body-129
AACCGTTCTAGCTGACATTATTTGAATGCAA
(SEQ ID




NO: 1133)





Cuboid Body-130
ATATAGGAACGGGAGGATGAAGCATGAA
(SEQ ID




NO: 1134)





Cuboid Body-131
AGGTCTTTACCGAGCTTCTTTAATTGCA
(SEQ ID




NO: 1135)





Cuboid Body-132
CTCACAATCGTAATTTCTCCGCGGAAAC
(SEQ ID




NO: 1136)





Cuboid Body-133
TTTTTTGCAACCGAACATACAGAAT
(SEQ ID




NO: 1137)





Right End-PolyT 5-1
TTTTTGTAACCGTGCATCTGCGATGTGCTGCAAGGCGTTTTT
(SEQ ID




NO: 1138)





Right End-PolyT 5-2
TTTTTCCCACGCATAACCGATATATTCGCTG
(SEQ ID




NO: 1139)





Right End-PolyT 5-3
TTTTTAGGCGGTTTGCGTATTCCGAGATAGGGTTGAGTTTTT
(SEQ ID




NO: 1140)





Right End-PolyT 5-4
TTTTTTGTTGTTCCAGTTTGGGGGAATTAGAGCCAGCTTTTT
(SEQ ID




NO: 1141)





Right End-PolyT 5-5
TTTTTTTCTGTCCAGACGACGGATAAGTCCTGAACAATTTTT
(SEQ ID




NO: 1142)





Right End-PolyT 5-6
TTTTTCATGTCAATCATATGTGCAAATGGTCAATAACTTTTT
(SEQ ID




NO: 1143)





Right End-PolyT 5-7
GAGACAAAGGCTATCAGGTCATTGCCTTTTTT
(SEQ ID




NO: 1144)





Right End-PolyT 5-8
TTTTTCTGTTTAGCTATATTTAATTACCTTATGCGATTTTTT
(SEQ ID




NO: 1145)





Right End-PolyT 5-9
GCCATTTAACAAGAGAATAGCGGGCGCCACGT
(SEQ ID




NO: 1146)





Right End-PolyT 5-
TTTTTACAAAATCGCGCAGAGGCGAATT
(SEQ ID


10

NO: 1147)





Right End-PolyT 5-
TGGTTCTCCGTCATCAACATTAAATGTGAGCGATTTTT
(SEQ ID


11

NO: 1148)





Right End-PolyT 5-
TTTTTATTGCGTAGATTTTCACTTTGAATACCAAGTTTTTTT
(SEQ ID


12

NO: 1149)





Right End-PolyT 5-
TTTTTCCAGACGTTAGTAAATAGGAATTGCGAATAATTTTTT
(SEQ ID


13

NO: 1150)





Right End-PolyT 5-
TTTTTTTTAAGAACTGGCTCAACAAAAGGTAAAGTAATTTTT
(SEQ ID


14

NO: 1151)





Right End-PolyT 5-
TTTTTTTCATCGGCATTTTCGGCCTCCCTCAGAGCCGTTTTT
(SEQ ID


15

NO: 1152)





Right End-PolyT 5-
TTTGACATTTCACCCCGGATGAACGGTAATCGTAAAACTAGTTTTT
(SEQ ID


16

NO: 1153)





Right End-PolyT 5-
TACCCATTGTGTCATTTGCGAACGACAA
(SEQ ID


17

NO: 1154)





Right End-PolyT 5-
TTTTTGTAACAACCCGTCGGATGTAGATGGGCGCATCTTTTT
(SEQ ID


18

NO: 1155)





Right End-PolyT 5-
TTTTTGAAAAATAATATCCCAATGACAACAACCATCGTTTTT
(SEQ ID


19

NO: 1156)





Right End-PolyT 5-
TTTTTATTAAGTTGGGTAACGTTGATATAAGTATAGCTTTTT
(SEQ ID


20

NO: 1157)





Right End-PolyT 5-
TTAATGAATCGAAACCTGATTAATTGCGTTGCGCTCACTGCTTTTT
(SEQ ID


21

NO: 1158)





Right End-PolyT 5-
TTTTTCCGGAATAGGTGTATCAAGTTTTGTCGTCTTTTTTTT
(SEQ ID


22

NO: 1159)





Right End-PolyT 5-
ATTGGGTTTAAGCACGTAATTAGAC
(SEQ ID


23

NO: 1160)





Right End-PolyT 5-
TTTTTCCGCTTTCCAGTCGGGGCCAACGCGCGGGGAGTTTTT
(SEQ ID


24

NO: 1161)





Right End-PolyT 5-
TTTTTAAAATCACCAGTAGCACAGACTGTAGCGCGTTTTTTT
(SEQ ID


25

NO: 1162)





Right End-PolyT 5-
TTTTTTGGTCAGTTGGCAAATTCAATAGATAATACATTTTTT
(SEQ ID


26

NO: 1163)





Right End-PolyT 5-
TTTTTTTGAGGATTTAGAAGTAAACAGAAATAAAGAATTTTT
(SEQ ID


27

NO: 1164)





Right End-PolyT 5-
ATTCATTGATTCGCGTCGCTGTAGTTGCCAA
(SEQ ID


28

NO: 1165)





Right End-PolyT 5-
TTTTTAATTTTTTCACGTTGAAAATCTCGCCGACATCCTAATAAC
(SEQ ID


29

NO: 1166)





Right End-PolyT 5-
TTTTTGAGAGTCTGGAGCAAAACCACCAGCAGAAGATTTTTT
(SEQ ID


30

NO: 1167)





Right End-PolyT 5-
TTTTCGATCTAACCGTACAGTACCGTTA
(SEQ ID


31

NO: 1168)





Right End-PolyT 5-
TTTTTAAAACAGAGGTGAGGCACCCTCAATCAATATCTTTTT
(SEQ ID


32

NO: 1169)





Right End-PolyT 5-
TTTTTCCACCCTCAGAACCGCCACCCTCCAACTAAGAA
(SEQ ID


33

NO: 1170)





Left End-PolyT 5-1
TTTTTTTAATTGCTGAATATTCACCAGTCACACGATTTTT
(SEQ ID




NO: 1171)





Left End-PolyT 5-2
TTTTTTATGCGTTATACAAAACATAACGCCAAAAGTTTTT
(SEQ ID




NO: 1172)





Left End-PolyT 5-3
TTTTTGAAGCCTTAAATCAAACCGTTCCAGTAAGCTTTTT
(SEQ ID




NO: 1173)





Left End-PolyT 5-4
TTTTTGTCGGTGGGCACGAAATATTACCGCCAGCCTTTTT
(SEQ ID




NO: 1174)





Left End-PolyT 5-5
TTTTTTAACCTTGCTTCTGTCGCCTGATAAATTGTTTTTT
(SEQ ID




NO: 1175)





Left End-PolyT 5-6
AGAGAGAGGCTAATCATAAACCATGTTACTTAGCCGGAACTTTTT
(SEQ ID




NO: 1176)





Left End-PolyT 5-7
CCGAAGCACAGAGATTTTTGTTTAACGTCAAAAATGTTTTT
(SEQ ID




NO: 1177)





Left End-PolyT 5-8
TTTGATTAGTCTCCCGACTTGCGGGAGGTTTTTTTTT
(SEQ ID




NO: 1178)





Left End-PolyT 5-9
GAACGGCCAACTTACATTTGG
(SEQ ID




NO: 1179)





Left End-PolyT 5-10
TTTTTGTCATACATGGCTTTGTAACAGTGCCCGTATTTTT
(SEQ ID




NO: 1180)





Left End-PolyT 5-11
TAGCATTAATTAATTTTCCCTTAGAATTTTT
(SEQ ID




NO: 1181)





Left End-PolyT 5-12
CTTGTATCATAAATCGTTAAATCAATATATGTGAGTGAATTTTT
(SEQ ID




NO: 1182)





Left End-PolyT 5-13
TTTTTATTAAGAGGAAGCCCTATTTTAAATGCAATTTTTT
(SEQ ID




NO: 1183)





Left End-PolyT 5-14
TTTTTGAGGCGCAGACGGTCTTTGCAAAAGAAGTTTTTTT
(SEQ ID




NO: 1184)





Left End-PolyT 5-15
TTTTTAAAATAGCAGCCTTTCCTTITTAAGAAAAGTTTTT
(SEQ ID




NO: 1185)





Left End-PolyT 5-16
TTTTTCTACCTTTTTAACCTGCCTGTTTAGTATCATTTTT
(SEQ ID




NO: 1186)





Left End-PolyT 5-17
TTTTTTAAGCAGATAGCCGAATAAGTTTATTTTGTTTTTT
(SEQ ID




NO: 1187)





Left End-PolyT 5-18
TTTAGGTTGGGAATTTAACAGGAGCAGTCTCTACCAGATATCTTA
(SEQ ID




NO: 1188)





Left End-PolyT 5-19
TTTTTGAATTACGAGGCATACGGATGGCTTAGAGCTTTTT
(SEQ ID




NO: 1189)





Left End-PolyT 5-20
TTTTTATTGCAACAGGAAAAACGCTCAGGCAGATAATGCTGATTTT
(SEQ ID



TGGTA
NO: 1190)





Left End-PolyT 5-21
GTGCCCCTGCCGCCTTGATGATGATTCAAAATCGA
(SEQ ID




NO: 1191)





Left End-PolyT 5-22
TTTTTGCCTGAGTAATGTGTAGGTAAAGATTCAATTA
(SEQ ID




NO: 1192)





Left End-PolyT 5-23
TGCCCAAAATATGCAGATTTCTTACAGAAAAACCGG
(SEQ ID




NO: 1193)





Left End-PolyT 5-24
ACTCAACTAAACCCCGCAGTGTCCACGTGGCGGAACCC
(SEQ ID




NO: 1194)





Left End-PolyT 5-25
TTTTTCCAGTAATAAAAGGGACATTCTCCTCATAGAAA
(SEQ ID




NO: 1195)





Left End-PolyT 5-26
TAAAGGGATTCATACCACGGAACAAAGTTGAA
(SEQ ID




NO: 1196)





Left End-PolyT 5-27
TTTTTCACCACACCCGCCGCTCTTTGATTAGTAATTTTTT
(SEQ ID




NO: 1197)





Left End-PolyT 5-28
TTTTTAACATCACTTGCCTGAGTAGAACAGAACATATAGGGCCC
(SEQ ID




NO: 1198)





Left End-PolyT 5-29
TTTTTTTGCCAGAGGGGGTAGATTGCATCAAAAAGTTTTT
(SEQ ID




NO: 1199)





Left End-PolyT 5-30
TTTTTATGACAATGTCCCGCATCTGTAAGCAACTCTTTTT
(SEQ ID




NO: 1200)





Left End-PolyT 5-31
TTTTTTAAACAGTTAATGCCTGAATTGTCAACCTTTTTTT
(SEQ ID




NO: 1201)





Left End-PolyT 5-32
TTTTTCACAATCAATAGAAAAGCCCCCGATTTAGATTTTT
(SEQ ID




NO: 1202)





Left End-PolyT 5-33
TTTTTTCCTTGAAAACATAGCATAGGTCTGAGAGATTTTT
(SEQ ID




NO: 1203)





Left End-PolyT 5-34
GACCAAAGCGATAGTAAAATGTTTAATAGCGCAAC
(SEQ ID




NO: 1204)





Left End-PolyT 5-35
TTTTTGCTTGACGGGGAAAGCACGCTGCGCGTAACTTTTT
(SEQ ID




NO: 1205)





Left End-PolyT 5-36
TTTTTGTCGAAATCCGCGACCTGCTTTATCATCAGAGA
(SEQ ID




NO: 1206)





Barcode connection 1
ACCCAAATAATCTTACCGGCTTATACACGACGCTCTTCCGATCT
(SEQ ID




NO: 1207)





Barcode connection 2
TATTCAGCTCCTCGAATTTCCACAAGGCCACACACGACGCTCTTCC
(SEQ ID



GATCT
NO: 1208)





Barcode connection 3
CGCGCCTTGAATATAATAACGTCAATTACCTGAGCACACGACGCTC
(SEQ ID



TTCCGATCT
NO: 1209)





Barcode connection 4
GTGAGCTAAACGGCGGATTGTTGGCCTTACACGACGCTCTTCCGA
(SEQ ID



TCT
NO: 1210)









EXAMPLE 2
Nucleic Acid Nanostructure Delivery Compositions

The materials and methods below describe an example in which a DNA origami (DNAO) nanostructure is a cuboid structure and the nucleic acid barcode constructs are attached to the DNAO via complementary base pairing of the barcodes with one of the oligonucleotide staples within the DNAO.


Barcode Design

The barcodes used in this example comprise a unique portion comprising 8 to 10 nucleotides in the center of the polynucleotide, the unique portion further characterized by a hamming distance of at least 3 bases from any other barcodes to be pooled. Directly on the 3′ end of the barcode, 7 to 10 random bases are included for bioinformatic removal of PCR duplicates. This central sequence is flanked by universal primer annealing sites containing overhangs for the addition of index adapters during sequencing library preparation. The polynucleotide barcodes in this example were designed with a biotin functional group on the 5′ end.









(SEQ ID NO: 1004)


/5BiosG/A*G*A*CGTGTGCTCTTCCGATCTGAGGGTACTTNNNNNNN


NNNAGATCGGAAGAGCGTCG*T*G*T.






Materials for Nucleic Acid Nanostructure Delivery Composition Synthesis (DNAO)

The DNA origami scaffold is a single stranded DNA (ssDNA) isolated from the M13 bacteriophage. The oligonucleotide staples are short single stranded DNA with sequences described in Table 2. The barcode is a single stranded DNA segment as described under barcode design above.


Preparation of the DNA Barcoded DNAO Structures

A reaction mixture was prepared comprising the DNA scaffold, the oligonucleotide staples and magnesium together in TE buffer in a reaction vessel in the following amounts: 160 uL all oligonucleotide staples (described in Table 2) pooled at a total concentration of 500 nM, 80 uL scaffold (100 nM), 80 uL water, 40 uL of TE buffer (1.46 g EDTA, 3.03 g Tris Base, 1.46 g NaCl, 500 mL water), 40 uL of 200 mM MgCl2. The reaction vessel was placed in a thermocycler with thermal ramp starting at ˜65∇C and descending to 24∇C over the course of ˜67 hours. The product was purified using a precipitation with a PEG purification protocol using a PEG solution made with the following recipe: 75 g PEG8000, 50 mL of the TE buffer described above, and 62.5 mL of 4 M NaCl, brought up to 500 mL with water, yielding a DNAO nanostructure in water. The concentration was measured via nanodrop and then the barcodes were added to the product at a molar ratio of 4:1 (polynucleotide barcodes to DNAO nanostructure). This mixture was incubated at 37∇C for 2 to 3 hours. The product was purified with another PEG purification process as described above, yielding a final product of DNA barcoded DNAO in water. Transmission electron microscopy images were captured on formvar/carbon coated nickel grids with a negative stain with 1% phophotungstic acid (PTA) using an FEI Tecnai G2 Bio Twin TEM (see FIG. 1).


In Vitro Transfection

HEK 293 cells were plated in a 48 well plate at 75,000 cells/well with 200 uL of complete growth media 24 hours prior to transfection. Complete growth media consisted of DMEM supplemented with 10% FBS and 1% Pen-Strep. Cells were dosed with DNAO at a final concentration of 10 nM, 5 nM and 2.5 nM DNAO with and without barcode in triplicate for a total of 16 wells. Three (3) wells were used for controls. After 16 hours the cells were trypsinized and replicate wells pooled together. DNA from the cells was extracted using a Qiagen DNA Extraction Kit. These cell extracts were used for PCR amplification.


Barcode PCR Amplification

A Master Mix was created with: Kapa HiFi 2×Master mix, Reverse barcode primer, forward barcode primer, DMSO, and nuclease free water. Master Mix (15 μL) was loaded into each well. Each of the cell extracts was loaded (5 μL) in duplicate into each well of a 96 well plate. Nuclease free water (5 μL) was loaded into the designated NTC wells. Positive control (5 μL) was loaded into the designated positive control wells. The positive control comprised a solution of polymer nanoparticles in phosphate buffered saline consisting of dimethylaminoethylmethacrylate, polyacylate, and butyl methacrylate, chemically conjugated with the same barcode as that used to label the DNAO, as described here before. Each well was covered, either with a strip cap or adhesive seal, and centrifuged for approx. 1 min at 1,000×g. Amplification of the barcodes was conducted by incubating in a thermocycler under typical PCR conditions.


Analysis of PCR Amplification Via Gel Electrophoresis

The gel electrophoresis was done on a 4% 48-well Ethidium Bromide gel, using 15 μL of 1 kb E-gel Ladder in the first well. DNAO and DNA barcoded. DNAO cell extracts and controls (10 μL) were added to each well of the gel. E-gel buffer (5 μL) was added to each well. Negative controls from cells were extracts without any barcoded DNAO that underwent amplification. Negative controls in the last 2 wells were water. Nuclease free water (15 μL) was added to any remaining empty wells. The gel doe was powered on to run current through the gel for about 20 to 25 mins, or until the sample buffer line reached the end of the gel. The gel was removed from the base and analyzed in a Gel Imager (see FIG. 3).


Results


FIG. 3 shows PCR amplification of DNAO with and without barcodes at various transfection concentrations. Positive amplification is denoted by a distinct band above the primer bands (bright hands extending all the way across the gel towards the bottom of the gel) and a likeness to the positive control banding pattern. There is clear amplification of barcode from the cell extract of the 10 nM DNAO dose (two lanes above the primer bands) and sonic visible amplification of the 5 nM DNAO dose (two lanes above the primer bands). This confirms that the barcodes attached to the DNA origami structure were successfully delivered in the cells and can he read back at the right concentration. Note that the diffused bands near the wells at the top of the lane in the “DNAO 10 nM” and “Barcoded DNAO 10 nM” lanes are the actual DNA origami nanostructures.

Claims
  • 1. A composition comprising a non-viral delivery vehicle comprising a nucleic acid nanostructure delivery composition, and a nucleic acid barcode construct.
  • 2. The composition of claim 1, wherein the nucleic acid nanostructure delivery composition comprises a DNA origami composition.
  • 3. The composition of claim 1, wherein the nucleic acid nanostructure delivery composition comprises single-stranded or double-stranded DNA or RNA.
  • 4. The composition of claim 1, wherein the nucleic acid barcode construct is associated with the nucleic acid nanostructure delivery composition via base-pairing.
  • 5. The composition of claim 4, wherein the base-pairing occurs between a sequence of a single-stranded overhang on the nucleic acid nanostructure delivery composition and a complementary sequence appended to the nucleic acid barcode construct.
  • 6. The composition of claim 1, wherein the nucleic acid nanostructure delivery composition comprises staples that self-assemble to form the nucleic acid nanostructure delivery composition.
  • 7. The composition of claim 6, wherein the staples act as the nucleic acid barcode construct.
  • 8. The composition of claim 1, wherein the nucleic acid barcode construct is bound to the nucleic acid nanostructure delivery composition by a covalent bond.
  • 9. The composition of claim 8, wherein the covalent bond is formed via an EDC-NHS coupling reaction between a terminal phosphate group of the 5′ end of an overhang on the nucleic acid nanostructure delivery composition and an amine group on an amino terminal nucleotide of the nucleic acid barcode construct.
  • 10. The composition of claim 8, wherein the covalent bond is formed via a click chemistry coupling reaction between an azide group on the nucleic acid nanostructure delivery composition and an alkyne group on the nucleic acid barcode construct.
  • 11. The composition of claim 8, wherein the covalent bond is formed via a click chemistry coupling reaction between an azide group on the nucleic acid barcode construct and an alkyne group on the nucleic acid nanostructure delivery composition.
  • 12. The composition of claim 1, wherein the nucleic acid barcode construct is associated with the nucleic acid nanostructure delivery composition by a covalent bond between a carboxy terminated molecule on the nucleic acid nanostructure delivery composition and a primary amine on the nucleic acid barcode construct at the 5′ and/or the 3′ end.
  • 13. The composition of claim 1, wherein the nucleic acid barcode construct comprises two primer binding segments and one or more unique barcode sequences between the two primer binding segments.
  • 14. The composition of claim 13, wherein the length of the unique barcode sequences is two times or more greater than the length of the primer binding segments.
  • 15. The composition of claim 13, wherein the unique barcode sequences further comprise a hamming distance of at least 2 to 6 bases between any two unique barcode sequences.
  • 16. The composition of claim 13, wherein the nucleic acid barcode construct further comprises from about 6 to about 12 random bases at the 3′ end of the unique barcode sequences.
  • 17. The composition of claim 16, wherein the about 6 to about 12 random bases at the 3′ end of the unique barcode sequences are for bioinformatic removal of PCR duplicates.
  • 18. A method of in vivo screening for a desired nucleic acid nanostructure delivery composition, the method comprising (a) preparing a library comprising two or more types of nucleic acid nanostructure delivery compositions, wherein each nucleic acid nanostructure delivery composition is associated with a nucleic acid barcode construct comprising a different unique barcode sequence, (b) administering the library to an animal, (c) removing cells or tissues from the animal, (d) isolating the nucleic acid barcode constructs from the cells or the tissues of the animal, (e) detecting the nucleic acid barcode constructs in the cells or the tissues of the animal, and (f) identifying the desired nucleic acid nanostructure delivery composition for use as a delivery vehicle.
  • 19. The method of claim 18, wherein the nucleic acid barcode construct is detected by a method selected from the group consisting of the polymerase chain reaction (PCR), isothermal amplification, sequencing, or a combination thereof, to obtain nucleotide sequence data.
  • 20. The method of claim 18, wherein the nucleic acid nanostructure delivery composition is loaded with a payload.
  • 21. The method of claim 20, wherein the payload is a luminescent molecule.
  • 22. The method of claim 21, wherein the luminescence is used to track the biodistribution or cell uptake of the nucleic acid nanostructure delivery composition via imaging.
  • 23. The method of claim 18, wherein the nucleic acid barcode construct is isolated from the cells and the tissues by mixing with a first organic compound and incubating the organic phase with an aqueous phase of the cell or tissue sample, separating the organic phase from the aqueous phase, mixing the organic phase with a second organic compound, incubating the mixture, precipitating the nucleic acid barcode construct from the mixture, removing the organic phase by evaporation, and resuspending the nucleic acid barcode construct in an aqueous composition.
  • 24. The method of claim 23, wherein the organic phase comprises phenol chloroform.
  • 25. The method of claim 18, wherein the nucleic acid barcode construct is separated from cationic material in the cells or tissues by titrating the aqueous composition of the nucleic acid barcode construct to a pH of greater than 7.4.
  • 26. The method of claim 18, wherein the nucleic acid barcode construct is separated from material in the cells or tissues by binding the nucleic acid barcode construct with a molecule with a binding affinity to the nucleic acid barcode construct greater than the binding affinity to the cell or tissue material.
  • 27. The method of claim 18, wherein the nucleic acid barcode construct is separated from material in the cells or tissues via a method selected from the group consisting of size exclusion chromatography, dialysis, diafiltration, and filtration.
  • 28. The method of claim 18, wherein the nucleic acid barcode construct is separated from material in the cells or tissues by digesting proteins using an enzyme wherein the enzyme is Proteinase K.
  • 29. The method of claim 18, wherein the nucleic acid barcode constructs associated with the nucleic acid nanostructure delivery composition are detected by first diluting the isolated nucleic acid barcode constructs by a factor of at least 1000 times, and then amplifying the nucleic acid barcode constructs by PCR using primers.
  • 30. The method of claim 29, wherein the primers from the PCR step are enzymatically digested prior to detection of amplicons.
CROSS-REFERENCE TO RELATED APPLICATIONS

This non-provisional application claims the benefit and priority, under 35 U.S.C. § 119(e) and any other applicable laws and statutes, to U.S. Provisional Application Ser. No. 63/350,688 filed on Jun. 9, 2022, the entire disclosure of which is incorporated herein by reference.

Provisional Applications (1)
Number Date Country
63350688 Jun 2022 US