Applicant asserts that the information recorded in the form of an Annex C/ST.26 file submitted under Rule 13ter.1 (a), entitled >>>UCI 21.14 PCT<<<, is identical to that forming part of the international application as filed. The content of the sequence listing is incorporated herein by reference in its entirety
The present invention features compositions and methods for improving the activity and affinity of in vitro selected XNA aptamers as affinity reagents to protein targets.
The effectiveness of antibodies in diagnostic and therapeutic applications have inspired efforts to explore the chemical space of evolvable non-natural genetic systems in search of sequence-defined macromolecules that can recapitulate antibody binding by folding into shapes that recognize disease-associated proteins with high affinity and high specificity. Toward this goal, much attention has been given to the establishment of Darwinian evolution systems that allow for the isolation of artificial genetic polymers with nucleobase or sugar-phosphate backbone modifications that function with enhanced target binding affinity or elevated biological stability. However, despite many notable accomplishments, including the evolution of novel synthetic genetic polymers (XNAs) with backbone chemistries not found in nature and the creation of highly functionalized nucleic acid polymers (HFNAPs), the ability to generate affinity reagents that are both recalcitrant to biological nucleases and capable of recognizing their cognate protein target with low dissociation rate constants has proven elusive. Since slow-off rate binding is a hallmark of a high quality antibody, renewed efforts are needed to establish examples of biologically stable affinity reagents that mirror the binding properties of the best monoclonal antibodies.
It is an objective of the present invention to provide compositions and methods that allow for improvements in the activity and affinity of in vitro selected XNA (e.g., TNA) aptamers (e.g., threomers) as affinity reagents to protein targets, as specified in the independent claims. Embodiments of the invention are given in the dependent claims. Embodiments of the present invention can be freely combined with each other if they are not mutually exclusive.
Synthetic genetic polymers (XNAs) have the potential to transition aptamers from laboratory tools to therapeutic agents, but additional functionality is needed to compete with antibodies. Described herein is the evolution of a biologically stable artificial genetic system comprised of α-L-threofuranosyl nucleic acid (TNA) that facilitates the production of backbone- and base-modified aptamers termed ‘threomers’ that function as high quality protein capture reagents. Threomers were discovered against two prototypical protein targets implicated in human diseases through a combination of in vitro selection and next-generation sequencing using uracil nucleotides that are uniformly equipped with aromatic side chains commonly found in the paratope of antibody-antigen crystal structures. Kinetic measurements reveal that the side chain modifications are critical for generating threomers with slow off-rate binding kinetics. These findings expand the chemical space of evolvable non-natural genetic systems to include functional groups that enhance protein target binding by mimicking the structural properties of traditional antibodies.
In some embodiments, the present invention features base-modified xeno nucleic acid (XNA) nucleoside monomers. The base-modified XNA nucleoside monomers may comprise a synthetic, non-natural sugar (e.g., a threose or a hexose sugar), a pyrimidine nucleotide base comprising a chemical modification (e.g., at position C-5 of the nucleobase) bound to the sugar moiety (i.e., the synthetic non-natural sugar), and a phosphorus group also bound to the sugar moiety.
In other embodiments, the present invention features a based-modified threose nucleic acid (TNA) nucleoside monomer. The base-modified TNA nucleoside monomer may comprise a threose sugar, a pyrimidine nucleotide base comprising a chemical modification (e.g., at position C-5 of the nucleobase) bound to the sugar moiety (i.e., the threose sugar), and a phosphorus group also bound to the sugar moiety.
Additionally, the present invention may feature single-stranded oligonucleotide aptamer (e.g., threomers) for binding a target protein. The aptamer comprise one or more of the base-modified nucleoside monomers described herein. For example, the present invention may feature a single-stranded threose nucleic acid (TNA) aptamer (i.e., a threomer) for binding a target protein. The aptamer may comprise one or more of the base-modified TNA nucleoside monomers as described herein.
One of the unique and inventive technical features of the present invention is the use of a modified pyrimidine nucleotide base (e.g., a pyrimidine nucleotide base comprising a chemical modification, e.g., at position C-5 of the nucleobase). Without wishing to limit the invention to any theory or mechanism, it is believed that the technical feature of the present invention advantageously provides for an XNA aptamer (e.g., a threomer) that mimics the amino acid residues found at the interface of antibody-antigen interactions. None of the presently known prior references or work has the unique inventive technical feature of the present invention.
Furthermore, the prior references teach away from the present invention. For example, prior references utilize XNA nucleosides monomers that are pre-made. However, the base-modified XNA nucleosides monomers (e.g., TNA nucleosides monomers) of the present invention require XNA nucleosides monomers (e.g., TNA nucleosides monomers) that have to be prepared in-house because they are not commercially available. This in-house preparation requires about 8 synthetic steps, starting from L-ascorbic acid (vitamin C).
Furthermore, the inventive technical features of the present invention contributed to a surprising result. For example, the impact that the modified sidechain had on the ability for TNA aptamers to bind their protein targets. The base-modified aptamers significantly outperformed the standard base aptamers in both quantity and quality.
Any feature or combination of features described herein are included within the scope of the present invention provided that the features included in any such combination are not mutually inconsistent as will be apparent from the context, this specification, and the knowledge of one of ordinary skill in the art. Additional advantages and aspects of the present invention are apparent in the following detailed description and claims.
The features and advantages of the present invention will become apparent from a consideration of the following detailed description presented in connection with the accompanying drawings in which:
2E, and 2F show the selection performance of functionally enhanced libraries.
Before the present compounds, compositions, and/or methods are disclosed and described, it is to be understood that this invention is not limited to specific synthetic methods or to specific compositions, as such may, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting. All embodiments disclosed herein can be combined with other embodiments unless the context clearly dictates otherwise.
Unless otherwise explained, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which a disclosed invention belongs. The singular terms “a,” “an,” and “the” include plural referents unless context clearly indicates otherwise. Similarly, the word “or” is intended to include “and” unless the context clearly indicates otherwise. The term “comprising” means that other elements can also be present in addition to the defined elements presented. The use of “comprising” indicates inclusion rather than limitation. Stated another way, the term “comprising” means “including principally, but not necessary solely”. Furthermore, variation of the word “comprising”, such as “comprise” and “comprises”, have correspondingly the same meanings. In one respect, the technology described herein related to the herein described compositions, methods, and respective component(s) thereof, as essential to the invention, yet open to the inclusion of unspecified elements, essential or not (“comprising”).
Suitable methods and materials for the practice and/or testing of embodiments of the disclosure are described below. Such methods and materials are illustrative only and are not intended to be limiting. Other methods and materials similar or equivalent to those described herein can be used. For example, conventional methods well known in the art to which the disclosure pertains are described in various general and more specific references, including, for example, Sambrook et al., Molecular Cloning: A Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory Press, 1989; Sambrook et al., Molecular Cloning: A Laboratory Manual, 3d ed., Cold Spring Harbor Press, 2001; Ausubel et al., Current Protocols in Molecular Biology, Greene Publishing Associates, 1992 (and Supplements to 2000); Ausubel et al., Short Protocols in Molecular Biology: A Compendium of Methods from Current Protocols in Molecular Biology, 4th ed., Wiley & Sons, 1999; Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 1990; and Harlow and Lane, Using Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 1999, Gene Expression Technology (Methods in Enzymology, Vol. 185, edited by D. Goeddel, 1991. Academic Press, San Diego, Calif.), “Guide to Protein Purification” in Methods in Enzymology (M. P. Deutshcer, ed., (1990) Academic Press, Inc.); PCR Protocols: A Guide to Methods and Applications (Innis, et al. 1990. Academic Press, San Diego, Calif.), Culture of Animal Cells: A Manual of Basic Technique, 2nd Ed. (R. I. Freshney. 1987. Liss, Inc. New York, N.Y.), Gene Transfer and Expression Protocols, pp. 109-128, ed. E. J. Murray, The Humana Press Inc., Clifton, N.J.), and the Ambion 1998 Catalog (Ambion, Austin, Tex.), the disclosures of which are incorporated in their entirety herein by reference.
All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety for all purposes. In case of conflict, the present specification, including explanations of terms, will control.
Although methods and materials similar or equivalent to those described herein can be used to practice or test the disclosed technology, suitable methods and materials are described below. The materials, methods, and examples are illustrative only and not intended to be limiting
As used herein, the term “XNA” or “xeno-nucleic acids” may refer to artificial genetic polymers with novel sugar-phosphate backbones that harbor unique physicochemical properties relative to natural deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) (e.g., properties may include nuclease stability, acid stability, increased thermodynamics of hybridization, or a combination thereof).
As used herein, the term “TNA” or “α-L-threofuranosyl nucleic acid” or “threose nucleic acid” refers to an artificial nucleic acid wherein the sugar portion of the nucleic acid is threose.
As used herein, the term “threomers” are backbone- and base-modified aptamers comprising a TNA sugar moiety and at least one base-modified TNA nucleotide monomer.
Referring now to
The present invention features base-modified xeno nucleic acid (XNA) nucleoside monomers. The base-modified XNA nucleoside monomers may comprise a synthetic, non-natural sugar (e.g., a threose or a hexose sugar), a pyrimidine nucleotide base bound to the sugar moiety (i.e., the synthetic non-natural sugar), and a phosphorus group also bound to the sugar moiety. The pyrimidine nucleotide base may comprise a chemical modification, e.g., at position C-5 of the nucleobase.
Non-limiting examples of xeno nucleic acids (XNA) nucleoside monomers may include but are not limited to threose nucleic acids (TNA) nucleoside monomers, hexose nucleic acids (HNA) nucleoside monomers, locked nucleic acids (LNA) nucleoside monomers, arabino nucleic acid (ANA) nucleoside monomers, fluoroarabino nucleic acid (FANA) nucleoside monomers, or mirror-image DNA or L-DNA nucleoside monomers. In some embodiments, the synthetic, non-natural sugar (i.e., the sugar moiety) is a threose sugar. In other embodiments, the synthetic, non-natural sugar is hexose sugar. In further embodiments, the synthetic, non-natural sugar comprises a modified ribose sugar where the 2′ oxygen and 4′ carbon are connected. In some embodiments, the synthetic, non-natural sugar comprises a ribose sugar or a deoxyribose sugar.
The present invention may also feature a base-modified threose nucleic acid (TNA) nucleoside monomer. The base-modified TNA nucleoside monomer may comprise a threose sugar, a pyrimidine nucleotide base bound to the sugar moiety (i.e., the threose sugar), and a phosphorus group also bound to the sugar moiety. The pyrimidine nucleotide base comprises a chemical modification, e.g., at position C-5 of said pyrimidine nucleotide base.
In some embodiments, the phosphorus group comprises a triphosphate group. In other embodiments, the phosphorus group comprises a phosphoramidite group. The position of where the phosphorus group (e.g., either the triphosphate group or the phosphoramidite group) binds to the sugar moiety of the nucleoside monomers described herein depends on the sugar therein. As used herein, for nucleic acid numbering a prime (′) is used to distinguish atoms on the sugar moiety from those on the nucleobase. The triphosphate group may be bound to the C3′ position on a threose sugar. Or, the triphosphate group may be bound to the C6′ position on a hexose sugar. In some embodiments, the triphosphate group is bound to the C5′ position on a ribose sugar or a deoxyribose sugar. The phosphoramidite group may be bound to the C2′ position on a threose sugar. In an alternative embodiment, the phosphoramidite group may be bound to the C3′ position on a threose sugar. The phosphoramidite group may be bound to the C3′ position on a hexose sugar. In some embodiments, the phosphoramidite group is bound to the C3′ position on a ribose sugar or a deoxyribose sugar.
The nucleotide base of the nucleoside monomers described herein may be bound to the C1′ position of the sugar moiety (e.g., the synthetic, non-natural sugar (e.g., threose or hexose)). In some embodiments, the modified pyrimidine nucleotide base (i.e., the pyrimidine nucleotide base comprising a chemical modification at position C-5 of the nucleobase) is bound to the C1′ position of the sugar moiety. The pyrimidine nucleotide base (e.g., the modified pyrimidine nucleotide base) may comprise a uracil base or a cysteine base. In some embodiments, the pyrimidine nucleotide base may comprise a thymine base. In some embodiments, the pyrimidine nucleotide base (e.g., the modified pyrimidine nucleotide base) may comprise a modified uracil base or a modified cysteine base.
In some embodiments, the pyrimidine nucleotide base described herein comprises a chemical modification at position C-5 of the nucleobase. In some embodiments, the chemical modification at position C-5 of the pyrimidine nucleotide base is a planar aromatic side chain (e.g., a phenylalanine side chain or a tryptophan side chain). Non-limiting examples of the chemical modification include, but are not limited to, phenylalanine side chain, a tryptophan side chain, a leucine side chain, a dioxol side chain, an isopentyl side chain, a dioxethyl side chain, a cyclopropyl side chain, a p-methoxy-phenyl side chain, a napthalene side chain, or a phenethyl side chain (see
In some embodiments, the present invention may also feature modified purine nucleotide bases. In some embodiments, the purine nucleotide base can be modified at the N-7 position of the ring system. In further embodiments, the chemistry involves synthesizing the purine nucleotide as a 7-deaza-7-modified base.
Additionally, the present invention may feature single-stranded oligonucleotide aptamer (e.g., threomers) for binding a target protein. The aptamers may comprise one or more of the base-modified nucleoside monomers described herein. For example, the present invention may feature a single-stranded threose nucleic acid (TNA) aptamer (i.e., a threomer) for binding a target protein comprising one or more of the base-modified TNA nucleoside monomers as described herein.
The present invention may feature a single-stranded threose nucleic acid (TNA) aptamer (e.g., a threomer) for binding a target protein. The aptamer may comprise one or more of the base-modified TNA nucleotides (i.e., a base-modified TNA nucleoside monomer). The base-modified TNA nucleotide comprising a threose sugar, a pyrimidine nucleotide base comprising a chemical modification at position C-5 bound to the sugar moiety, and a phosphorous group bound to the threose group.
In some embodiments, the aptamers (e.g., threomers) described herein comprise about 20 to 40 nucleoside monomers (e.g., TNA nucleoside monomer) in length. In other embodiments, the aptamers (e.g., threomers) described herein comprise about 15 to 50, or about 15 to 45, or about 15 to 40, or about 15 to 35, or about 15 to 30, or about 15 to 25, or about 15 to 20, or about 20 to 50, or about 20 to 45, or about 20 to 40, or about 20 to 35, or about 20 to 30, or about 20 to 25, or about 25 to 50, or about 25 to 45, or about 25 to 40, or about 25 to 35, or about 25 to 30, or about 30 to 50, or about 30 to 45, or about 30 to 40, or about 30 to 35, or about 35 to 50, or about 35 to 45, or about 35 to 40, or about 40 to 50, or about 40 to 45, or about 45 to 50 nucleoside monomers (e.g., TNA nucleoside monomer) in length. In some embodiments, the aptamers (e.g., threomers) described herein are about 15, or about 20, or about 25, or about 30, or about 35, or about 40, or about 45, or about 50 nucleoside monomers (e.g., TNA nucleoside monomer) in length.
In some embodiments, the aptamers (e.g., threomers) described herein comprise at least one base-modified nucleoside monomer (e.g., a nucleoside monomer with a pyrimidine nucleotide base comprising a chemical modification). In some embodiments, the aptamers comprise at least one modified cysteine monomer. In some embodiments, the aptamers comprise at least one modified uracil monomer. In other embodiments, the aptamers comprise at least one modified cysteine monomer, and at least one modified uracil monomer. In alternative embodiments, the aptamers comprise at least one modified thymine monomer.
In some embodiments, the aptamers (e.g., threomers) described herein is comprised of 25-35% base-modified nucleoside monomer (e.g., a nucleoside monomer with a pyrimidine nucleotide base comprising a chemical modification). In other embodiments, the aptamer is comprised of about 10% to 50%, or about 10% to 45%, or about 10% to 40%, or about 10% to 35%, or about 10% to 30%, or about 10% to 25%, or about 10% to 20%, or about 10% to 15%, or about 15% to 50%, or about 15% to 45%, or about 15% to 40%, or about 15% to 35%, or about 15% to 30%, or about 15% to 25%, or about 15% to 20%, or about 20% to 50%, or about 20% to 45%, or about 20% to 40%, or about 20% to 35%, or about 20% to 30%, or about 20% to 25%, or about 25% to 50%, or about 25% to 45%, or about 25% to 40%, or about 25% to 35%, or about 25% to 30%, or about 30% to 50%, or about 30% to 45%, or about 30% to 40%, or about 30% to 35%, or about 35% to 50%, or about 35% to 45%, or about 35% to 40%, or about 40% to 50%, or about 40% to 45%, or about 45% to 50% base-modified nucleoside monomer (e.g., a nucleoside monomer with a the pyrimidine nucleotide base comprising a chemical modification).
In some embodiments, the nucleoside monomers comprise nucleoside triphosphate monomers. In some embodiments, nucleoside triphosphate monomers comprises base-modified nucleoside triphosphate monomers (e.g., a nucleoside triphosphate monomer with a pyrimidine nucleotide base comprising a chemical modification) (
In some embodiments, the aptamers described herein are for binding a target protein. Non-limiting examples of target proteins include, but are not limited to, a spike protein (S1), a RBD (receptor binding domain) of S1, a tumor necrosis factor (TNF)-α protein, HIV (human immunodeficiency virus) reverse transcriptase, human epidermal growth factor receptor 2 (HER2), trypsin, angiotensin-converting enzyme 2 (ACE2), or thrombin. Other proteins may be targeted in accordance with the composition and methods described herein.
Non-limiting examples of aptamers for binding a spike protein (S1) include, but are not limited to SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, or SEQ ID NO: 20. Non-limiting examples phosphonamidite of aptamers for binding a tumor necrosis factor (TNF)-α protein include but are not limited to SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, or SEQ ID NO: 32.
In some embodiments, the present invention features nucleoside monomers according to the structure structures displayed in Table 1.
The single-stranded threose nucleic acid (TNA) aptamer (e.g., threomer) and/or the single stranded XNA aptamers described herein may be generated through enzymatic and/or chemical synthesis protocols. In some embodiments, the single-stranded threose nucleic acid (TNA) aptamer for binding a target protein comprising one or more base-modified threose nucleic acid (TNA) monomers, is created by using enzymatic synthesis using an engineered TNA polymerase and chemically synthesized TNA triphosphates. In other embodiments, the single-stranded threose nucleic acid (TNA) aptamer for binding a target protein comprising one or more base-modified threose nucleic acid (TNA) monomers, is created by automated solid phase oligonucleotide synthesis using chemically synthesized TNA phosphoramidites.
The following is a non-limiting example of the present invention. It is to be understood that said example is not intended to limit the present invention in any way. Equivalents or substitutes are within the scope of the present invention.
In an effort to improve the quality of TNA aptamers produced by in vitro selection, the potential for aromatic side chains to enhance the functional properties of large random-sequence TNA libraries was evaluated. The present study focused on tUTP derivatives that are chemically modified at the C5 position with Phe and Trp side chains (
First, the desired C5 modified tUTP substrates were synthesized (
The starting 5-iodo-1-(2′-O-benzoyl-L-threofuranosyl)-uracil nucleoside was prepared in eight steps from L-ascorbic acid (vitamin C) using a Vorbrüggen reaction to conjugate 5-iodo-uracil to an orthogonally protected threose sugar (
It was determined that the base modified TNA triphosphates were viable substrates for TNA synthesis by evaluating their incorporation into TNA oligonucleotides using a standard primer extension assay. Accordingly, an IR-labeled DNA primer annealed to a DNA template was incubated with a mixture of chemically synthesized tNTPs and Kod-RSGA for 2 hours at 55° C. In these reactions, the tNTP mixture contained either standard bases only or a tNTP solution in which the tTTP substrate was replaced with either tUTPPhe or tUTPTrp. Analysis of the primer extension reactions by denaturing polyacrylamide gel electrophoresis (PAGE) indicates that the primer was extended to full-length product in all cases, as evidenced by the presence of a discrete slower moving band in each lane of the gel (
GGATACCACCGAGACGACACACAATAGAGA
GGATACCACCGAGACGACACACAAUPheAGAGA
GGATACCACCGAGACGACACACAAUTrpAGAGA
aLC-ESI-Q-TOF, bPartial fragmentation of the benzyl moiety (C7H8), similar observation with MALDI-ToF
Encouraged by the synthesis and successful enzymatic incorporation of base-modified tNTPs into TNA, three chemically distinct libraries containing 1014 TNA oligonucleotides were prepared, each displayed on their encoding double-stranded (ds) DNA (
As a model system, TNA sequences were enriched for that bound to the glycosylated form of the S1 subunit of the spike protein of SARS-CoV-2 (
For each round of selection, the TNA libraries were incubated with free Ni-NTA beads to remove any TNA sequences that bound nonspecifically to the solid-support matrix. In round 1, a second negative selection step was performed on the naïve library using His-tagged hemagglutinin (HA), which was chosen as a generic off-target viral-coat protein. The eluted material was incubated with the S1 protein containing a C-terminal His-6 tag. Aptamers that bound specifically to the S1 protein were partitioned away from the unbound material by passing the solution over Ni-NTA beads to capture the aptamer-protein complexes. The beads were washed under high ionic strength conditions to remove weak affinity sequences that bound primarily through electrostatic interactions and the bound material was eluted from the Ni-NTA beads with imidazole. The eluted fractions were exchanged into water and amplified by PCR. Regeneration of the ssDNA library required a second PCR step using a PEG-modified DNA primer that allows for size separation of the PEGylated DNA template by denaturing PAGE. The ssDNA material served as the starting point for the next round of in vitro selection and amplification.
After round 4, the evolving pools of TNA sequences were subjected to high throughput next-generation DNA sequencing (NGS) using an Illumina NovaSeq platform to sequence the elution fractions from each round of selection (Table 3). The combined power of DNA display and NGS bypasses the need for extensive rounds of selection by identifying sequences with enrichment profiles that are indicative of high affinity binders. Bioinformatic analysis of the data reveals that populations containing the Phe and Trp side chains converged more rapidly toward a smaller number of unique sequences as compared to the unmodified library composed entirely of standard bases (
To better illuminate the relationship between sequence enrichment and function, members of each chemotype family were screened for affinity to the S1 protein. In this experiment, 65 sequences with promising enrichment profiles were selected and individually synthesized by polymerase-mediated primer extension on complementary DNA templates. Estimated binding affinities (KD) were determined by biolayer interferometry (BLI) using biotinylated aptamers that were immobilized onto the surface of streptavidin-coated biosensors and assayed for binding activity to S1 poised at a protein concentration of 100 nM. The screen revealed striking differences in the number of high-affinity aptamers produced from the modified libraries versus those obtained from the standard base library. A plot of estimated KD value versus library chemotype highlights these differences, which bin according to 3380 nM (n=30), 910 nM (n=20), and 430 nM (n=15) for the average binding affinities observed for the standard, Phe, and Trp libraries, respectively (
To more precisely evaluate the binding properties of the in vitro selected TNA aptamers isolated from the three chemotype libraries, full kinetic measurements were performed on a subset of the sequences that were predicted to bind S1 with a KD value of <50 nM. In total, 24 TNA aptamers were re-synthesized for further characterization by BLI (
To confirm the reproducibility of the data, a representative high performance S1 aptamer was evaluated from the standard and modified chemotype libraries in triplicate using independently synthesized material for each kinetic binding assay. The best modified and unmodified S1 aptamers (designated Trp-1 and standard-2, respectively) have average affinity values of 3.1∓1.0 nM and 34∓11 nM, respectively, for the S1 protein of SARS-CoV-2 (
Next, the binding affinity of threomers were improved by comparing experimental and computational methods for directed evolution. A doped library based on the Trp-1 sequence was synthesized that contained 85% identity and 15% diversity at each nucleotide position with a theoretical diversity of ˜3×109 unique members. The library was subjected to 1 round of selection for binding to S1 followed by NGS analysis to identify individual sequences present in the eluted material. The best experimental aptamer identified from ˜30 candidate sequences identified through a combination of enrichment and sequence context exhibited a KD of 850∓160 μM for S1 (
To evaluate the generality of functionally enhanced TNA libraries as a rich source of chemical diversity for producing superior TNA-based affinity reagents, a similar in vitro selection experiment was performed against a different protein target. For this study, tumor necrosis factor-alpha (TNF,
Last, the contribution of the modified side chains toward the activity and specificity of threomer binding to the S1 and TNF-α target proteins was evaluated. The importance of the hydrophobic amino acid side chains was evaluated by measuring the binding affinity of modified and unmodified versions of the Trp-1 and Trp-3 aptamers selected to bind S1 and TNF-α, respectively. In both cases, the resulting BLI sensorgrams clearly show that protein binding affinity is completely abrogated when the in vitro selected TNA sequences are synthesized with the natural base chemotype (
The data collected in the present study provides compelling evidence that functionally enhanced TNA libraries produce higher affinity binders than standard TNA libraries carrying only natural bases. This result was achieved through a combination of DNA display and high throughput NGS sequencing, which allowed for the isolation of base-modified threomers with KD values in the low nM range after only 3-4 rounds of selective amplification. The absence of significant numbers of high affinity binders from the unmodified library supports the hypothesis that planar aromatic side chains have the ability to mimic amino acid residues that are overrepresented at the paratope-epitope interface of antibody-antigen complexes. The generality of the conjugation chemistry established to construct uniformly modified TNA libraries is sufficiently versatile that it should be possible to explore new regions of chemical space by synthesizing a broader range of C5 modified tUTP substrates.
The path to functionally enhanced XNA libraries was limited by a number of synthetic challenges that are less severe for DNA-based affinity reagent technologies. TNA, like most XNAs, is assembled from building blocks that are not commercially available, and thus, must be obtained by chemical synthesis. In this case, a complete monomer set of all four TNA nucleoside triphosphates (tNTPs) requires 52 chemical transformations to convert vitamin C into each of the four tNTPs (tATP, tCTP, tTTP, and tGTP). The tUTPPhe and tUTPTrp substrates demanded an additional 30 chemical transformations (15 steps each) with the critical step being the palladium-catalyzed cross-coupling reaction required to functionalize the uracil nucleobase with an aromatic side chain. Once the substrates are prepared, it then becomes necessary to identify a polymerase that can synthesize functionally enhanced TNA oligonucleotides. In the current study, this was accomplished using Kod-RSGA, a recently evolved TNA polymerase developed for enhanced TNA synthesis activity on DNA templates.
Although antibodies remain the gold standard as protein affinity reagents, aptamers have a number of advantages that have caused them to grow in popularity. In addition to prolonged storage and shipping at ambient temperature, aptamers are produced by cell-free synthesis (chemical or enzymatic), which avoids viral or bacterial contamination problems associated with cellular protein production systems and allows for greater scalability. Aptamers have low immunogenicity, low batch-to-batch variability, and greater ease of chemical modifications than antibodies and other protein-based affinity reagents. However, despite these advantages, aptamers have historically suffered from poor biological stability and fast off-rates. The current study is part of a program of research designed to narrow the gap between antibodies and aptamers. The goal has been to establish an artificial genetic system that is both biologically stable and amenable to Darwinian evolution, and then to augment this system with the chemical functionality required to achieve high affinity interactions that are driven by slow off-rate binding kinetics.
In conclusion, the present invention has established threomers as a new class of biologically stable affinity reagents that function with enhanced target-binding activity.
Taq DNA polymerase, Bst 3.0 DNA polymerase, T4 DNA ligase, 10×T4 DNA ligase buffer, and 10× ThermoPol buffer were purchased from New England Biolabs (Ipswich, MA). PCRBIO HiFi polymerase was purchased from PCRBiosystems (Wayne, PA). Kod RSGA TNA polymerase was expressed and purified. Experiments were performed in DNA LoBind tubes, purchased from Eppendorf (Hamburg, Germany). DNA triphosphates were purchased from Thermo Fisher Scientific (Waltham, MA). TNA triphosphates bearing natural bases were synthesized. DNA oligonucleotides were purchased from Integrated DNA Technologies (Coralville, IA), purified by denaturing polyacrylamide gel electrophoresis (PAGE), electroeluted, buffer exchanged and concentrated using Millipore YM-3 or YM-30 Centricon centrifugal filter units, and quantified by UV absorbance via NanoDrop (Thermo Fisher Scientific). For the aptamer selection, Ni-NTA agarose resin was purchased from Qiagen (Hilden, Germany). His-tagged SARS-CoV-2 spike glycoprotein subunit 1 (S1), SARS-CoV-2 receptor binding domain (RBD) of spike glycoprotein subunit 1, and SARS-CoV-1 spike glycoprotein subunit 1 (S1) were purchased from ACROBiosystems (Newark, DE). His-tagged TNFα was expressed from E. coli as previously described. His-tagged TNFβ was purchased from Sino Biological (Wayne, PA). Streptavidin and Ni-NTA biosensor were purchased from ForteBio (Fremont, CA).
Three libraries were synthesized at the DNA level using a custom nucleotide distribution to minimize the occurrence of G-quadruplex structures. Each library contained a unique 6 nt barcode located between the forward primer and random region to signify the library chemotype (standard, Phe, and Trp). The initial libraries comprised an internal variable region of 40 nucleotides (30% A, 30% T, 20% G, 20% C) flanked on both sides by fixed-sequence primer binding sites. The libraries were purchased from the Keck Oligonucleotide Synthesis Facility (Yale University) and purified by 10% denaturing PAGE. The band corresponding to full-length product was excised, electroeluted, desalted, and quantified by UV absorbance. The phosphorylated DNA hairpin primer was ligated to the library by combining 5 nmol DNA library with 6 nmol of DNA hairpin in a final volume of 1 mL of 1×T4 DNA ligase buffer. The solution was denatured for 5 min at 95° C. and then annealed by incubating for 30 min at room temperature. Once annealed, 10,000 U of T4 DNA ligase was added and the reaction was incubated overnight at 24° C. The following day, the hairpin library was purified by 10% denaturing PAGE and quantified by UV absorbance.
The hairpin library was extended with tNTPs to form a chimeric TNA:DNA hairpin heteroduplex. In a 1 mL reaction volume containing 1 nmol hairpin library and 1× ThermoPol buffer was heated for 5 min at 95° C. to denature and then annealed by incubating for 30 min at 24° C. TNA polymerization was initiated by adding Kod RSGA (1 UM) and 100 M of each tNTP (tATP:tCTP:tGTP:tTTP or tUTPPhe or tUTPTrp) and incubating for 2 h at 55° C. Excess polymerase was extracted with phenol:chloroform:isoamyl alcohol (25:24:1, saturated with 10 mM Tris, pH 8.0, 1 mM EDTA). Following extraction, the TNA:DNA hairpin library was concentrated and the remaining tNTPs were removed by buffer exchange into water using a YM-30 Centricon filter device. Library was quantified by UV absorbance.
Next, the TNA strand was displaced by extending a DNA primer annealed to the hairpin loop with dNTPs. Accordingly, the TNA:DNA library (1 μM final) was resuspended in 1× ThermoPol buffer containing 500 UM dNTPs and 2 μM strand displacement primer. After heating for 5 min at 95° C. and slowly cooling to room temperature, the displacement reaction was initiated by adding Bst 3.0 DNA polymerase to a final concentration of 80 U/mL. The reaction was incubated for 1 h at 50° C. Bst polymerase was extracted with phenol:chloroform as described above. The TNA library was separated from the residual phenol and reaction components in a YM-30 Centricon. Any remaining phenol was removed by adding 900 μL of ethanol to the solution and drying completely. The libraries were resuspended in a selection buffer (S1: 150 mM NaCl, 25 mM Tris pH 8.0, TNFα: 20 mM imidazole, 450 mM NaCl, 20 mM Tris pH 8.0).
Aptamer selections were performed by passing each library through unmodified Ni-NTA agarose beads for 30 minutes at 24° C. with rotation to remove any sequences that bound to the affinity matrix. This step was performed for each round of selection. For the S1 selection, an additional negative selection step was performed with 1 μM hemagglutinin (HA) for 30 minutes at 24° C. with rotation, chosen as a generic viral coat protein. For this step, the flow through from the empty beads was passed through Ni-NTA beads containing the HA protein. The negative HA selection step was only performed in the first round of the S1 selection. The material collected from the flow-through of the negative selection was incubated with the appropriate His-tagged protein (either the S1 protein from SARS-CoV-2 or TNFα) poised at a concentration of 1 UM for 15 minutes at 24° C. with rotation. The mixture was incubated in a disposable plastic column with Ni-NTA beads for 15 minutes at 24° C. with rotation. The column was drained and washed three times with 400 μL of selection buffer to remove unbound and weakly bound molecules. A more stringent wash step was performed with 1 M NaCl to reduce the occurrence of nonspecific electrostatic interactions (S1: 1 M NaCl, 25 mM Tris pH 8.0, TNFα: 20 mM imidazole, 1 M NaCl, 20 mM Tris pH 8.0), followed by an additional wash with selection buffer to return to baseline salt concentrations before elution. Six 250 μL elutions were performed with elution buffer (S1: 500 mM imidazole, 150 mM NaCl, 25 mM Tris pH 8.0, TNFα: 100 mM imidazole, 450 mM NaCl, 20 mM Tris pH 8.0). Elution fractions were imaged on a LI-COR Odyssey CLx. The two elution fractions with the highest fluorescence were pooled and desalted using a NAP-5 DNA purification column (Cytiva, Marlborough, MA). TNA aptamers were amplified by PCR with library-specific PCR primers and purified using DNA Clean and Concentrator columns (Zymo Research, Irvine, CA). The single-stranded DNA library was regenerated through a second PCR using a PEGylated forward primer and the high fidelity PCRBIO HiFi polymerase, then purifying by 10% denaturing PAGE, cutting the corresponding band with a scalpel and recovering by electroelution.
86 bp library members that remained in the pool after each round of selection were PCR amplified and purified as above. Amplicons were prepared as barcoded libraries and sequenced on a NovaSeq6000 with the S4 flow cell with the paired end setting for 200 cycles for an estimated 2-400 M reads per library, per round.
Sequencing data was parsed by filtering, trimming and aligning reads. Enrichment scores over the course of the selection were calculated for the top 10,000 most abundant sequences in the final round.
A doped library of Trp-1 was synthesized at the DNA level with 15% doping of the other 3-nts at each position in the aptamer sequence. The DNA displayed TNA library was generated and taken through one round of selection exactly as described above with the exception that the S1 protein was poised at a concentration of 0.5 μM. TNA aptamers collected in the elution fractions were amplified by PCR with library-specific PCR primers, purified using DNA Clean and Concentrator columns (Zymo Research, Irvine, CA), and submitted for NGS analysis.
Aptamers identified by sequencing were prepared as biotin-labeled TNA molecules using the PBS8 biotin 20mer DNA primer and the corresponding template. A 500 μL reaction volume containing 0.5 nmol of both primer and template as well as 1× ThermoPol Buffer was heated for 5 min at 95° C. to anneal. TNA polymerization was initiated by adding 100 UM of each tNTP and Kod RSGA polymerase (1 UM for standard bases or Phe, 2 UM for Trp) and then incubated for 2 h at 55° C. Full length biotin-labeled aptamers were purified by 10% denaturing polyacrylamide gel electrophoresis for 1.5 h at 18 W constant. TNA was recovered from the gel by electroelution and then buffer exchanged into water and concentrated using a YM-3 Centricon centrifugal filter device. TNA concentration was quantified by NanoDrop absorbance. Aptamers at a concentration of 100 nM were folded in BLI binding buffer (125 mM NaCl, 20 mM HEPES pH 7.5, 2 mM CaCl2, 0.05% Tween 20) by denaturing for 15 min at 95° C. and then cooling for 1 h at room temperature.
TNA aptamers were screened to identify sequences of interest for further characterization. Prior to testing, all sensors were equilibrated in BLI binding buffer for ≥1 h. Aptamers at a concentration of 100 nM were loaded on a single biosensor. A fixed target concentration of 100 nM for S1 and 1000 nM for TNFα was used. The BLI run was performed with the following steps: a buffer only baseline for 60 sec to equilibrate sensors, loading the aptamer for 200 sec, a second buffer only baseline for 200 sec, an association phase with the target protein for 600 sec for S1 or 120 sec for TNFα, and a dissociation phase for 600 sec for S1 or 120 sec for TNFα. The plate and reagents were incubated at 30° C. for S1 samples or 24° C. for TNFα for the duration of the experiment and for 10 min prior to each run. Data was analyzed using the Octet Data Analysis HT software. For screens, Savitzky-Golay filtering was applied before fitting both association and dissociation curves together to calculate approximate KD values.
Aptamer sequences of interest were characterized with four different concentrations of target and one buffer only sensor to determine background. Prior to testing, all sensors were equilibrated in BLI binding buffer (defined above) for ≥1 h. After aptamer folding, the BLI run was performed with the following steps: a buffer only baseline for 60 sec to equilibrate sensors, loading the aptamer for 200 sec, a second buffer only baseline for 200 sec, an association phase with the target protein for 600 sec for S1 or 120 sec for TNFα, and a dissociation phase for 600 sec for S1 or 120 sec for TNFα. The plate and reagents were incubated at 30° C. for S1 or 24° C. for TNFα for the duration of the experiment and for 10 min prior to each run. Data was analyzed using the Octet Data Analysis HT software. For full kinetics measurements, the buffer only baseline sample was used to subtract background from all samples before applying Savitzky-Golay filtering and fitting both the association and dissociation curves together and applying a global fit to determine KD and other metrics.
A machine learning model based on the previously described Transformer architecture was trained using TensorFlow on the NGS data described above to predict the relative fraction of reads in the elution and flow-through pools from a TNA sequence. The Transformer encoder was trained with 8 attention heads per multiheaded layer and 3 encoder layers.
The model achieved a Most abundant versus Least abundant AUC of 0.69 and Most abundant vs Random AUC of 0.71 for the held out NGS test set. 10 k sequences were used for each of the Most abundant, Least abundant and Random pools.
Using this model, an in silico genetic algorithm was used to evolve a threomer with an estimated KD of 27 nM to the aforementioned threomer TNA 949 with a KD of 700±40 μM.
As used herein, the term “about” refers to plus or minus 10% of the referenced number.
Although there has been shown and described the preferred embodiment of the present invention, it will be readily apparent to those skilled in the art that modifications may be made thereto which do not exceed the scope of the appended claims. Therefore, the scope of the invention is only to be limited by the following claims. In some embodiments, the figures presented in this patent application are drawn to scale, including the angles, ratios of dimensions, etc. In some embodiments, the figures are representative only and the claims are not limited by the dimensions of the figures. In some embodiments, descriptions of the inventions described herein using the phrase “comprising” includes embodiments that could be described as “consisting essentially of” or “consisting of”, and as such the written description requirement for claiming one or more embodiments of the present invention using the phrase “consisting essentially of” or “consisting of” is met.
This application claims benefit of U.S. Provisional Application No. 63/227,489 filed Jul. 30, 2021, the specification of which is incorporated herein in their entirety by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2022/039086 | 8/2/2022 | WO |
Number | Date | Country | |
---|---|---|---|
63227489 | Jul 2021 | US |