The instant application is being filed along with a Sequence Listing text file in electronic format. The Sequence Listing is provided as a file entitled 2066_1011PCT_SL.txt, created on Aug. 1, 2019, which is 14,790,207 bytes in size. The subject matter of the Sequence Listing is incorporated herein by reference in its entirety.
The present disclosure relates to methods for identification of aptamers against a target of interest (e.g., an allergen). The disclosure also provides aptamers, signaling polynucleotides (SPNs), DNA chips, detection sensors and kits, and assays for detecting a target in a sample.
Nucleic acid aptamers are single-stranded oligonucleotides (DNAs, RNAs or DNA/RNA hybrids) that can bind to target molecules with high affinity and specificity. Nucleic acid aptamers are generally selected from a library of oligonucleotides with randomized sequences by an iterative process of adsorption, recovery and reamplification, for example, by conventional SELEX (Systematic Evolution of Ligands by Exponential Enrichment) and other closely related methods (See, e.g., U.S. Pat. Nos. 5,270,163; 5,567,588; 5,637,459; 5,670,637; 5,705,337; and 5,723,592). Aptamers can adapt unique secondary and tertiary structures and recognize targets with high affinity and specificity.
Aptamers provide a cost-effective alternative to antibodies as there is no need for aptamer selection in animals or cell lines, they have shelf-lives of years, and they can be easily modified to reduce cross-reactivity with undesired molecules. Aptamers have significant advantages over antibodies, such as better specificity and affinity, wider varieties of targets, easier synthesis and modification, higher stability and lower cost. These properties favor aptamers as new detection agents for wide applications in biosensor development, among other fields, for detecting the presence, absence and/or amount of target molecules in a sample. For example, aptamers and aptamer-based assays have been shown, among many other useful applications (e.g., diagnostic tests and therapy) as a promising alternative in food safety control, e.g., detection and control of pathogens, toxins, allergens and other forbidden contaminants in food matrices (Amaya-Gonzalez, et al., Sensors, 2013, 13:16292-16311; and Amaya-Gonzalez, et al., Anal. Chem. 2014, 86(5), 2733-2739). Aptamers based assays replace many immunoblotting methods using antibodies (e.g., ELISA).
Allergy (e.g., food allergy) is a common medical condition. It has been estimated that in the United States, up to 2 percent of adults and up to 8 percent of children, particularly those under three years of age, suffer from food allergies (about 15 million people), and this prevalence is believed to be increasing. Allergen detection, either in clinical settings or consumer based, is important to a person who is allergic to certain types of food, e.g., gluten and peanuts. Sensitive and specific detection agents against allergens are keys in developing detection assays that can efficiently and quickly test a suspect food product before consuming it. Aptamers that selectively bind to an allergen have been employed in many allergen detection sensors and assays (Weng and Neethirajan, Biosens Bioelectron, 2016, 85: 649-656; Svobodova et al., Food Chem., 2014, 165: 419-423; Tran et al., Biosens. Bioelectron, 2013, 43, 245-251; and Nadal et al., Plos One, 2012, 7(4): e35253). Studies have shown that an aptamer-based assay has significant advantages as compared to antibody-based immunoassay (e.g., ELISA).
The present disclosure developed a modified selection method for identifying aptamer sequences against a specific allergen target; the aptamers and/or signal polynucleotides derived from the aptamers can be directly used in detection assays with increased specificity and sensitivity. Specifically, the modified selection method combines several positive, negative and counter selection processes to identify aptamers that can specifically recognize a target molecule (e.g., an allergen protein) but the features (e.g., the primary and secondary structures) of the aptamers block the same aptamers bound to the target to hybridize to short oligonucleotides comprising sequences complementary to the same aptamers. Therefore, the target and the short complementary sequence do not bind to the same aptamer simultaneously.
The present disclosure provides screening methods tailored for selection of aptamers against target molecules that can be directly employed in competition-based target detection assays, such as allergen detection assays; the methods comprising several positive, negative (counter) selection processes to identify aptamers having particular primary and secondary structural features that when the aptamers are bound to target molecules to form aptamer.target complexes, they do not simultaneously hybridize to short oligonucleotides complementary to the aptamer sequences. The identified aptamers are suitable to develop chip sensors for target detection in which the aptamers or signal polynucleotides derived from the aptamers compete binding to their target molecule in the presence of oligonucleotides (i.e., anchor sequences) that are complementary to the sequences of the aptamers.
In some embodiments, the screening method comprises (a) preparing an input DNA library comprising a plurality of single stranded DNA (ssDNA) molecules, each of which comprises a central randomized nucleic acid sequence flanked by a constant sequence at the 5′ end and a constant sequence at the 3′ end, the constant 5′ end and the constant 3′ end functioning as primers; (b) selecting a pool of ssDNA molecules, from the input DNA library of (a), that substantially bind to a target material; (c) selecting a pool of ssDNA molecules, from the target binding pool of ssDNA molecules obtained in (b), that do not bind to the complementary sequences in the presence of the target material (i.e., do not simultaneously bind to the target and complementary sequences); (d) counter-selecting ssDNA molecules, from the positive binding pool of ssDNA molecules obtained in (c), that do not bind to the complementary sequences in the absence of the target material, or that substantially bind to counter target materials; and (e) subtracting the pool of ssDNA molecules obtained in (d) from the positive binding pool of ssDNA molecules in (c), and identifying candidate ssDNA molecules that specifically bind to the target of interest. Each sub-pool of ssDNA molecules can be identified through positive SELEX and/or on-chip selection processes and each process can be repeated several rounds at the same condition.
Accordingly, the aptamers identified via the present screening methods and signaling polynucleotides (SPNs) derived from the aptamers bind to their target molecule with high affinity and specificity. In some embodiments, the aptamers and SPNs may not hybridize to short complementary sequences in the presence of the target molecule, while they can bind to the short complementary sequences in the absence of the target molecule.
In some embodiments, the present screening method further comprises amplifying the ssDNA molecules in each pool after each selection process. The ssDNA molecules may be amplified by PCR using a pair of primers labeled with a fluorophore probe. The amplified and regenerated ssDNA molecules are therefore labeled with the fluorophore probe.
In some embodiments, the pool of ssDNA molecules that substantially bind to a target may be selected by a modified Graphene Oxide (GO)-SELEX process using an input ssDNA library and a target material. This positive selection may comprise the steps of (i) contacting the input ssDNA library with the target material wherein complexes are formed between the target and a plurality of ssDNA molecules present in the input library; (ii) partitioning the complexes formed in step (i) using a Graphene Oxide (GO) solution, and isolating the ssDNA molecules in the complexes to produce a subset of ssDNA molecules for the target material; (iii) contacting the subset of ssDNA molecules in (ii) with the same target material wherein complexes are formed between the target and a second plurality of ssDNA molecules present in the subset of ssDNA molecules to generate a second subset group of ssDNA molecules; and (iv) optionally repeating steps (ii) to (iii), one, two, three or more times to produce a respective third, fourth, fifth or more subset group of ssDNA molecules, thereby producing the enriched pool of ssDNA molecules that substantially bind to the target material.
In some embodiments, the positive pool of ssDNA molecules that do not bind to the complementary sequences in the presence of the target material may be selected through on-chip positive binding selection process using the target binding pool of ssDNA molecules (e.g., the pool of ssDNA molecules selected by the GO-SELEX process), the same target material and a solid support that is coated with a plurality of short oligonucleotides comprising sequences complementary to the sequences of the ssDNA molecules, e.g., the constant sequence at the 5′ end of the ssDNA molecules.
In some embodiments, the positive binding pool of ssDNA molecules selected by the on-chip positive selection process may be further refined to subtract non-specific ssDNA molecules. The counter selection may comprise: (i) counter selecting a pool of ssDNA molecules, from the positive pool of ssDNA molecules (e.g., the pool from the on-chip positive selection), that do not bind to the complementary sequences even in the absence of the target material (i.e. the non-binding ssDNA molecules); this selection including an on-chip non-binding counter process that uses the positive binding pool of ssDNA molecules as the input and a chip that is coated with short oligonucleotides comprising complementary sequences of the ssDNA molecules; and (ii) counter selecting a pool of ssDNA molecules, from the positive binding pool of ssDNA molecules, that substantially bind to counter target molecules; this selection including an on-chip counter binding process using the positive binding pool of ssDNA molecules as the input, one or more counter target materials and a chip that is coated with short oligonucleotides comprising sequences complementary to the sequences of the ssDNA molecules.
In some embodiments, the target material may be a common allergen such as a common food allergen. In one embodiment, the target material is peanut, almond, brazil nut, cashew, hazelnut, pecan, pistachio, walnut, gluten, whey and/or casein.
In another aspect, the present disclosure provides aptamers, signaling polynucleotides (SPNs), DNA chips, aptamer-based detection sensors and kits for detecting the presence, absence, and/or amount of a target (e.g., an allergen) in a sample.
In some embodiments, aptamer sequences that specifically bind to an allergen are selected by the present selection processes, wherein the allergen is a common food allergen, e.g., peanut, almond, brazil nut, cashew, hazelnut, pecan, pistachio, walnut, gluten, whey and casein. Aptamers that can bind to all nuts including peanut, almond, brazil nut, cashew, hazelnut, pecan, pistachio and walnut, may also be selected, for example, by multiple SELEX methods.
In some embodiments, aptamer sequences that specifically bind to peanut are selected, which may comprise a unique nucleic acid sequence selected from the group consisting of SEQ ID NOs.3 to 1002. In some examples, the aptamer against peanut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 1003 to 4002.
In some embodiments, aptamer sequences that specifically bind to almond are selected, which may comprise a unique nucleic acid sequence selected from the group consisting of SEQ ID Nos. 4003 to 5002. In some examples, the aptamer against almond may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 5003 to 8002.
In some embodiments, aptamer sequences that specifically bind to brazil nut are selected, which may comprise a unique nucleic acid sequence selected from the group consisting of SEQ ID NOs. 8003 to 9002. In some examples, the aptamer against brazil nut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 9003 to 12002.
In some embodiments, aptamer sequences that specifically bind to cashew are selected, which may comprise a unique nucleic acid sequence selected from the group consisting of SEQ ID NOs. 12003 to 13002. In some examples, the aptamer against cashew may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 13003 to 16002.
In some embodiments, aptamer sequences that specifically bind to hazelnut are selected, which may comprise a unique nucleic acid sequence selected from the group consisting of SEQ ID NOs. 16003 to 17002. In some examples, the aptamer against hazelnut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 17003 to 20002.
In some embodiments, aptamer sequences that specifically bind to pecan are selected, which may comprise a unique nucleic acid sequence selected from the group consisting of SEQ ID NOs. 20003 to 21002. In some examples, the aptamer against pecan may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 21003 to 24002.
In some embodiments, aptamer sequences that specifically bind to pistachio are selected which may comprise a unique nucleic acid sequence selected from the group consisting of SEQ ID NOs. 24003 to 25002. In some examples, the aptamer against pistachio may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 25003 to 28002.
In some embodiments, aptamer sequences that specifically bind to walnut are selected, which may comprise a unique nucleic acid sequence selected from the group consisting of SEQ ID NOs. 28003 to 29002. In some examples, the aptamer against walnut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 29003 to 32002.
In some embodiments, aptamer sequences that can bind to all nuts are selected, which may comprise a unique nucleic acid sequence selected from the group consisting of SEQ ID NOs. 32003 to 33002. In some examples, the aptamer against all nuts may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 33003 to 36002.
In some embodiments, aptamer sequences that specifically bind to gluten are selected, which may comprise a unique nucleic acid sequence selected from the group consisting of SEQ ID NOs. 40003 to 41002. In some examples, the aptamer against gluten may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 41003 to 44002.
In some embodiments, aptamer sequences that specifically bind to whey are selected, which may comprise a unique nucleic acid sequence selected from the group consisting of SEQ ID NOs. 44003 to 45002. In some examples, the aptamer against whey may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 45003 to 48002.
In some embodiments, aptamer sequences that specifically bind to casein are selected, which may comprise a unique nucleic acid sequence selected from the group consisting of SEQ ID NOs. 48003 to 49002. In some examples, the aptamer against casein may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 49003 to 52002.
In some embodiments, aptamer sequences that specifically bind to a target control material may be selected. Such control sequences can be used together with the aptamer sequences that bind to the target in a detection assay. The control aptamer sequences have similar response to the sample (e.g., the food matrix) as the target specific aptamers. However, the control aptamers will not respond to the target (e.g., a target allergen) and have no binding affinity to the target specific aptamers or to the short anchor sequences complementary to the target specific aptamers. For example, aptamer sequences that bind to peanut control material may be used together with the aptamer sequences against peanut for detecting the presence/absence of peanut in a food sample. The peanut control sequences and the aptamer specific to peanut may demonstrate a similar response to the food type to be tested. Therefore, the signal from the peanut control sequences can be used as internal sample control.
In some examples, aptamer sequences that bind to peanut control material are selected which may comprise a unique nucleic acid sequence selected from the group consisting of SEQ ID NOs. 36003 to 37002. In some examples, the aptamer sequence for peanut control may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 37003 to 40002.
In accordance with the present disclosure, a SPN may comprise an aptamer selected by the present method that specifically binds to a target of interest and a short nucleic acid sequence that is complementary to the aptamer sequence. The short complementary sequences may be printed on a solid surface for a detection assay. In some embodiments, the short complementary sequence may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 52003 to 52042.
In further another aspect, the present disclosure provides methods for detecting the presence, absence and/or amount of a target in a sample using aptamers and SPNs identified by the present screening methods. In some embodiments, the target is a food allergen and the sample to be tested is a food sample. The food allergen may be peanut, almond, brazil nut, cashew, hazelnut, pecan, pistachio, walnut, gluten, whey and casein.
The foregoing has outlined rather broadly the features and technical advantages of the present disclosure in order that the detailed description of the disclosure that follows may be better understood. Additional features and advantages of the disclosure will be described hereinafter which form the subject of the claims of the disclosure. It should be appreciated by those skilled in the art that the conception and specific embodiment disclosed may be readily utilized as a basis for modifying or designing other structures for carrying out the same purposes of the present disclosure. It should also be realized by those skilled in the art that such equivalent constructions do not depart from the spirit and scope of the disclosure as set forth in the appended claims. The novel features which are believed to be characteristic of the disclosure, both as to its organization and method of operation, together with further objects and advantages will be better understood from the following description when considered in connection with the accompanying figures. It is to be expressly understood, however, that each of the figures is provided for the purpose of illustration and description only and is not intended as a definition of the limits of the present disclosure. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. In the case of conflict, the present description will control.
The present screening methods modify conventional aptamer selection methods, combining several positive and negative (counter) selections to identify aptamers that specifically bind to a target of interest. These selections mimic the conditions of competition-based detection assays in which aptamers (or SPNs derived from the aptamers) are used to capture their target and short oligonucleotides comprising sequences complementary to the aptamers are used to detect the presence or absence of the aptamer:target complexes. The competition particularly is between the target to which an aptamer can bind with high level of specificity and affinity, and complementary sequences of the aptamer. The selected aptamer sequences can specifically bind to their target, but only hybridize to short complementary sequences in the absence of the target. The selected aptamer sequences cannot bind to the short complementary sequences in the presence of their target. The present screening methods also select control aptamer sequences for a specific target material. The control aptamer sequences can be used in parallel with target specific aptamers and serve as internal control. The detailed description of the screening methods is included.
In order for the present disclosure to be more readily understood, certain terms and phrases are defined below. Additional terms and phrases are also defined and set forth through the specification.
As used herein, the term “aptamer” refers to a nucleic acid molecule or a peptide that can bind to a specific target molecule. A nucleic acid aptamer is a nucleic acid molecule having at least one binding site for a target molecule, such as another nucleic acid sequence, protein, peptide, antibody, small organic molecule, mineral, cell and tissue. A nucleic acid aptamer can be a single stranded or double stranded deoxyribonucleic acid (ssDNA or dsDNA), or ribonucleic acid (RNA), or a hybrid of DNA/RNA. Nucleic acid aptamers typically range from 10-150 nucleotides in length, for example, from 15-120 nucleotides in length, or from 20-100 nucleotides in length, or from 20-80 nucleotides in length, or from 30-90 nucleotides in length, or from 50-90 nucleotides in length. The nucleic acid sequence of an aptamer may optionally have a minimum length of one of 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100 nucleotides. In the context of the present disclosure, the term “aptamer” refers to a nucleic acid aptamer. The terms “a single stranded DNA(ssDNA) molecule,” and “aptamer” are used interchangeably.
An aptamer can fold into specific and stable secondary, tertiary, or quaternary conformational structures that enable it to bind to a target with high specificity and affinity. The structures may include, but are not limited to, hairpin loop, bulge loop, internal loop, multi-branch loop, pseudoknot, or combinations thereof. For example, the binding site of an aptamer may comprise a stem loop conformation or G-quartets.
Aptamers against a target may be naturally occurring or made by synthetic or recombinant means. An aptamer can be selected from a random oligonucleotide library through repeated rounds of in vitro partition, selection and amplification of nucleic acid molecules, e.g., conventional SELEX. As used herein, the term “SELEX” refers to a methodology known in the art as “Systematic Evolution of Ligands by Exponential Enrichment (SELEX)”. SELEX, or equivalently In vitro selection, is a powerful and widely used method to select nucleic acid sequences (i.e., aptamers) that bind to a target (e.g., a protein) with specificity and affinity (Ellington A D, et al., Nature, 1990, 346: 818-822; Tuerk C, et al., Science, 1990, 249: 505-510; and Gold L, et al., Anmi Rev Biochem, 1995, 64: 763-797). The SELEX process and various modifications are described in the art, e.g., U.S. Pat. Nos. 5,270,163; 5,567,588; 5,696,249; 5,853,984; 6,083,696; 6,376190; 6, 262, 774; 6,569,620; 6,706,482; 6,730,482; 6,933,116; 8,975,388; 8,975026; and 9,382,533; the contents of each of which are incorporated herein by reference in their entirety. The SELEX process is based on the unique insight that nucleic acids have sufficient capacity for forming a variety of two- and three-dimensional structures and sufficient chemical versatility available within their monomers to act as ligands (i.e., form specific binding complexes) with virtually any chemical compound, whether monomeric or polymeric. Molecules of any size or composition can serve as targets. SELEX relies as a starting point upon a large library of single stranded oligonucleotides comprising randomized sequences. The oligonucleotides can be modified or unmodified DNAs, RNAs, or DNA/RNA hybrids. In some examples, the library comprises 100% randomized or partially randomized oligonucleotides.
Nucleic acid aptamers show robust binding affinities to their target, preferably binding to the target with an equilibrium (Kd) less than 10−6, 10−8, 10−10, or 10−12. Aptamers also bind to the target molecule with a very high degree of specificity. It is preferred that aptamers have a Kd with the target molecule at least 10, 100, 1000, 10,000, or 100,000-fold lower than the Kd of other non-targeted molecules. In some examples, the aptamer selection process may be tailored to select aptamers with pre-defined parameters such as equilibrium (Kd), rate constants (Koff and Kon) and thermodynamic parameters (ΔH and ΔS) of aptamer-target interaction.
Aptamers may comprise naturally occurring nucleotides, and/or modified nucleotides including but not limited to chemically modified nucleobases, unnatural bases (e.g., 2-aminopurine), nucleotide analogs, addition of a label (e.g., a fluorophore), addition of a conjugate, or mixtures of any of the above. The nucleic acid sequence of an aptamer can be modified as desired so long as the functional aspects are still maintained (e.g., binding to the target).
As used herein, the terms “nucleic acid”, “oligonucleotide” and “polynucleotide” are used interchangeably to refer to a polymer of nucleotides of any length, and such nucleotides may include deoxyribonucleotides (DNAs), ribonucleotides (RNAs), and/or analogs or chemically modified deoxyribonucleotides or ribonucleotides and RNA/DNA hybrids. The terms “nucleic acid”, “oligonucleotide” and “polynucleotide” include double- or single-stranded molecules as well as triple-helical molecules. A nucleic acid molecule may comprise at least one chemical modification.
As used herein, the term “primary structure” of a nucleic acid molecule refers to its nucleotide sequence. The “secondary structure” of a nucleic acid molecule include, but is not limited to, a hairpin loop, a bulge loop, an internal loop, a multi-branch loop, a pseudoknot or combinations thereof. “Pre-selected secondary structures” refer to those secondary structures that are selected and engineered into an aptamer by design.
As used herein, the term “complementary” refer to the natural binding of polynucleotides by base pairing such as A-T(U) and C-G pairs. Two single-stranded molecules may be partially complementary such that only some of the nucleic acids bind, or it may be “complete,” such that total complementarity exists between the single stranded molecules. The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of the hybridization between the nucleic acid strands. As used herein, the term “hybridization” or “hybridize to” refers to the process by which a polynucleotide strand anneals with a complementary strand through base pairing under defined hybridization conditions. Specific hybridization is an indication that two nucleic acid sequences share a high degree of identity. Specific hybridization complexes form under permissive annealing conditions.
As used herein, the term “high affinity” refers to the binding of a candidate aptamer to a target with binding dissociation constant Ka less than 100 nM. The “specific binding affinity” of an aptamer for its target means that the aptamer binds to its target generally with a much higher degree of affinity than it binds to other components in a test sample. In similar, the term “specifically binds” means that an aptamer reacts or associates more frequently, more rapidly, with greater duration and with greater affinity with a particular target molecule, than it does with non-target molecules. For example, an aptamer against a target allergen binds to that allergen or a structural part or fragment thereof with greater affinity, avidity, more readily, and/or with greater duration than it binds to unrelated allergen proteins and/or parts or fragments thereof. It is also understood by reading this definition that, for example, an aptamer that specifically binds to a first target may or may not specifically bind to a second target. As such, “specific binding” does not necessarily require exclusive binding or non-detectable binding of another molecule, this is encompassed by the term “selective binding”. The specificity of binding is defined in terms of the comparative dissociation constants (Kd) of the aptamer for its target as compared to the dissociation constant with respect to the aptamer and other materials in the environment or unrelated molecules in general. Typically, the Kd for the aptamer with respect to the target will be 2-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold less than the Ka with respect to the target and the unrelated molecule or accompanying molecule in the environment. Even more preferably, the Ka will be 50-fold, 100-fold, 150-fold or 200-fold less.
As used herein, the term “amplification” or “amplifying” means any process or combination of steps that increases the amount or number of copies of a molecule or class of molecules. The amplification of a nucleic acid molecule is generally carried out but not limiting to using polymerase chain reaction (PCR) (e.g., U.S. Pat. Nos. 4,683,195 and 4,683,202; the contents of each of which are herein incorporated by reference in their entirety).
As used herein, the term “library,” or “pool,” or “subset” refers to a plurality of compounds, e.g. single stranded DNA (ssDNA) molecules.
As used herein, the terms “target molecule,” “target material” and “target” are used interchangeably to refer to any molecule to which an aptamer can bind. “Target molecules” or “targets” can be, for example, proteins, polypeptides, nucleic acids, carbohydrates, lipids, polysaccharides, glycoproteins, hormones, receptors, antigens, antibodies, affybodies, antibody mimics, viruses, pathogens, toxic substances, substrates, metabolites, transition state analogs, cofactors, inhibitors, drugs, small molecules, dyes, nutrients, pollutants, growth factors, cells, tissues, or microorganisms and any fragment or portion of any of the foregoing. In one embodiment, a target may be an allergenic protein.
As used herein, the term “counter target” refers to a molecule belonging to a family which has a similar structure, a similar active site, or similar activity to a target or a target material. In the context of the present disclosure, a counter target can be any molecules to which a selected aptamer against a target of interest has no cross-specificity. Counter targets may be used in counter selection processes to refine aptamer candidates for separating sequences that cross-recognize other closely related molecules.
As used herein, the term “allergen” means a compound, substance or composition that causes, elicits or triggers an immune reaction in a subject. As such, allergens are typically referred to as antigens. An allergen is typically a protein or a polypeptide.
As used herein, the terms “polypeptide,” “peptide,” and “protein” are used interchangeably herein to refer to polymers of amino acids of any length.
As used herein, the term “sample” means a composition that contains or is assumed to contain one or more targets to be tested. A sample may be, but is not limited to, a biological sample obtained from a subject (including human and animal), a sample obtained from the environment (e.g., soil sample, water sample, agriculture sample such as a plant and a crop sample), a chemical sample, and a food sample.
In accordance with the present disclosure, the selection method is modified to identify aptamer candidates that can recognize a target molecule with high specificity and affinity and lower cross-reactivity with counter targets, and that do not hybridize to oligonucleotides that are complementary to the aptamer sequences in the presence of the target. The selected aptamers and SPNs derived from these aptamers can be used as detection agents in competition-based detection assays in which target molecules in a test sample and the complementary oligonucleotides compete binding to the aptamers (or SPNs).
In accordance with the present disclosure, the aptamer screening method may comprise (a) preparing an input DNA library comprising a plurality of single stranded DNA (ssDNA) molecules, each of which comprises a central randomized nucleic acid sequence flanked by a constant sequence at the 5′ end and a constant sequence at the 3′ end, the constant 5′ end and the constant 3′ end functioning as primers; (b) selecting a pool of ssDNA molecules, from the input DNA library of (a), that substantially bind to a target material; (c) selecting a pool of ssDNA molecules, from the target binding pool of ssDNA molecules obtained in (b), that do not bind to the complementary sequences in the presence of the target material (i.e., do not simultaneously bind to the target and complementary sequences); (d) counter-selecting ssDNA molecules, from the positive binding pool of ssDNA molecules obtained in (c), that do not bind to the complementary sequences in the absence of the target material (referred to as non-binding ssDNA molecules), or that substantially bind to counter target materials (cross-specificity); and (e) subtracting the pool of ssDNA molecules obtained in (d) from the positive binding pool of ssDNA molecules in (c), and identifying candidate ssDNA molecules that specifically bind to the target of interest.
The present screening methods combine several positive target binding selections (e.g., positive SELEX and on-ship SELEX), non-binding counter selections, complementary hybridization selections and counter target binding selections. Candidate aptamers are identified through repeated positive and negative selections, sequence amplification and sequencing analysis. The modified screening method affords improved efficiency in aptamer selection as compared to conventional SELEX and other known methods in the art and ensures selection of aptamers that preferably bind to a target molecule to short complementary nucleic acid sequences. The flow chart in
In some embodiments, a pool of ssDNA molecules that substantially bind to a target molecule may be selected by a positive target-binding selection process comprising repeated target binding, partition, isolation and amplification of nucleic acid sequences using an input library comprising randomized ssDNA (single stranded DNA) molecules and a target material. Conventional aptamer selection processes may be used such as systematic evolution of ligands by exponential enrichment (SELEX), selected and amplified binding site (SAAB), cyclic amplification and selection of targets (CASTing), or the like. As a non-limiting example, a plurality of sequences that form ssDNA:target complexes may be identified by performing several rounds of positive Graphene Oxide (GO)-SELEX selection using an input ssDNA library comprising randomized single stranded DNA sequences and a target material (
SELEX procedure generally involves a progressive selection, from a large library of double-stranded or single-stranded nucleic acids (DNAs, RNAs or DNA/RNA hybrids), of variable nucleic acid sequences that bind to a target of interest with high affinities and specificities by repeated rounds of target partition and amplification.
Each round of SELEX process consists of several steps including preparation of nucleic acid libraries, formation of nucleic acid-target complexes, separation between bound and unbound sequences, elution of aptamers, PCR amplification, and identification of aptamers specific to the target. Each round of selection enriches aptamer candidates from the nucleic acid library.
The input nucleic acid library may comprise a plurality of single-stranded DNA (ssDNA) molecules with randomized sequences. The ssDNA may be 50-150 nucleotides in length, for example, the ssDNA in the library is about 50 to 140 nucleotides in length, or about 50 to 130 nucleotides in length, or about 50 to 120 nucleotides in length, or about 50 to 100 nucleotides in length, or about 60 to 80 nucleotides in length, or about 70 to 90 nucleotides in length, or about 70-80 nucleotides in length. In some embodiments, the ssDNA in the library may be 60 nucleotides in length, or 61 nucleotides in length, or 62 nucleotides in length, or 63 nucleotides in length, or 64 nucleotides in length, or 65 nucleotides in length, or 66 nucleotides in length, or 67 nucleotides in length, or 68 nucleotides in length, or 69 nucleotides in length, or 70 nucleotides in length, or 71 nucleotides in length, or 72 nucleotides in length, or 73 nucleotides in length, or 74 nucleotides in length, or 75 nucleotides in length, or 76 nucleotides in length, or 77 nucleotides in length, or 78 nucleotides in length, or 79 nucleotides in length, or 80 nucleotides in length, or 81 nucleotides in length, or 82 nucleotides in length, or 83 nucleotides in length, or 84 nucleotides in length, or 85 nucleotides in length, or 86 nucleotides in length, or 87 nucleotides in length, or 88 nucleotides in length, or 89 nucleotides in length, or 90 nucleotides in length, or 91 nucleotides in length, or 92 nucleotides in length, or 93 nucleotides in length, or 94 nucleotides in length, or 95 nucleotides in length, or 96 nucleotides in length, or 97 nucleotides in length, or 98 nucleotides in length, or 99 nucleotides in length, or 100 nucleotides in length. Each ssDNA molecule in the library comprises a randomized nucleic acid sequence at the center flanked by a constant sequence at the 5′ end and a constant sequence at the 3′ end that serve as PCR primers, where the sequences of the primers are known, and the central randomized sequence may be 30 to 50 nucleotides in length. The randomized sequences can be produced in a number of ways including chemical synthesis and size selection from randomly cleaved cellular nucleic acids. Sequence variation in test nucleic acids can also be introduced or increased by mutagenesis before or during the selection/amplification iterations.
As a non-limiting example, the input ssDNA molecule library may be generated by automated chemical synthesis on a DNA synthesizer.
As used herein, the “central randomized nucleic acid sequence” within an ss DNA may also be referred to as the “inner sequence” of the ssDNA.
In one preferred embodiment, the ssDNA molecules in the input library are 76 nucleotides in length, wherein a central randomized nucleic acid sequence with 30 nucleotides in length is flanked by two 23 nucleotides primers at the 5′ end and 3′-end of each ssDNA. As a non-limiting example, the 5′ end primer may comprise a nucleic acid sequence of 5′ TAGGGAAGAGAAGGACATATGAT3′ (SEQ ID NO. 1) and the 3′ end primer may comprise a nucleic acid sequence of 5′ TTGACTAGTACATGACCACTTGA 3′ (SEQ ID NO. 2).
As used herein, the term “primer” refers to a short nucleic acid which is capable of acting as a point of initiation of synthesis (e.g., PCR) when placed under conditions in which synthesis of a primer extension product which is complementary to a nucleic acid strand is induced, (i.e., in the presence of nucleotides and an inducing agent such as DNA polymerase and at a suitable temperature and pH). The primer is preferably single stranded for maximum efficiency in amplification but may alternatively be double stranded.
The input DNA library may be mixed with a target wherein the complexes are formed between the target and a plurality of ssDNA molecules present in the library. The target may be any molecule (e.g., nucleic acids, proteins, small molecules, sugars, toxins, biomarkers, cells and pathogens). In some embodiments, the target is a protein, such as an allergen protein or mixed allergen components of an allergen. The allergen may include, but is not limited to, a food allergen, an allergen from the environment such as plants, animals, microorganisms, air or water, and a medical allergen (i.e., any allergen found in a medicine or medical device).
Food allergens include, but are not limited to proteins in legumes such as peanuts, peas, lentils and beans, as well as the legume-related plant lupin, tree nuts such as almond, cashew, walnut, Brazil nut, filbert/hazelnut, pecan, pistachio, walnut, beechnut, butternut, chestnut, chinquapin nut, coconut, ginkgo nut, lychee nut, macadamia nut, nangai nut and pine nut, egg, fish, shellfish such as crab, crawfish, lobster, shrimp and prawns, mollusks such as clams, oysters, mussels and scallops, milk, soy, wheat, gluten, corn, meat such as beef, pork, lamb, mutton and chicken, gelatin, sulphite, seeds such as sesame, sunflower and poppy seeds, and spices such as coriander, garlic and mustard, fruits, vegetables such as celery, and rice. Some exemplary allergenic proteins from food allergens may include the parvalbumins in codfish, tropomyosin in crustaceans, arginine kinase and myosin light chain, casein, α-lactalbumin and 3 lactoglobulin in milk, and globulin or vicilin seed storage protein.
Other target molecules include, but are not limited to, pathogens from a pathogenic microorganism in a sample, such as bacteria, yeasts, fungi, spores, viruses and prions; disease proteins (e.g., biomarkers for diseases diagnosis and prognosis); pesticides and fertilizers remained in the environment; and toxins. Targets may include non-protein compounds such as minerals and small molecules (e.g., antibiotics).
In some embodiments, the steps for selecting an enriched pool of ssDNA molecules that substantially bind to the target material may comprise (i) contacting the input ssDNA library with the target material wherein complexes are formed between the target and a plurality of ssDNA molecules present in the input library; (ii) partitioning the complexes formed in step (i) from the unbound ssDNA molecules and isolating the ssDNA molecules in the complexes to produce a subset of ssDNA molecules for the target material and amplifying the isolated subset of ssDNA molecules; (iii) contacting the enriched subset of ssDNA molecules from step (ii) with the same target material wherein complexes are formed between the target and a second plurality of ssDNA molecules present in the enriched library to generate a second enriched subset group of ssDNA molecules; and (iv) optionally repeating steps of binding, partition, isolation and amplification (steps (i) to (iii)), one, two, three, four or more times as desired to yield highly specific, high affinity ssDNA molecules to the target molecule, thereby producing the enriched pool of ssDNA molecules that substantially bind to the target material (e.g., Pool 1 in Table 1).
In one embodiment, a graphene oxide (GO)-SELEX process modified from the general SELEX method is performed to select the target binding pool. As used herein, the terms “graphene,” “graphene oxide (GO),” “graphene oxide nanosheet” and “graphene nanosheet” mean two-dimensional carbon structures and are used interchangeably throughout the present specification. The exposed nucleobases in the ssDNA molecules can be absorbed to the surface of graphene oxide (Chen et al., J. Agric. Food Chem. 2014; 62, 10368-10374). Accordingly, when a graphene oxide (GO) solution is added to the mixture of a ssDNA library and a target material, GO can adsorb the ssDNA sequences that are not bound to a specific target, to its surface, and let the sequences bound to the target free. The unbound sequences and GO can then be removed, e.g., by centrifugation, while the ssDNA molecules that bind to a specific target are not absorbed to the surface of GO and then recovered and employed in the following selection process. This process can avoid the need to immobilize the target material as used in conventional SELEX.
The GO-SELEX process is inexpensive, fast, and simple. In a conventional SELEX, many expensive, less efficient and time-consuming methods such as chromatography, an affinity column, and the like, are used to separate nucleic acid molecules which are bound to a target material from nucleic acid molecules which are not bound to the target material. The GO-SELEX process is characterized in that the separation of binding ssDNA molecules from non-binding ssDNA molecules can be carried out simply by centrifugation even if the target material or the counter-target material is not specifically immobilized to a specific carrier (Nguyen et al., Chem. Commun. 2014, 50, 10513-10516; the contents of which are incorporated by reference herein in their entirety.)
After removing GO absorbed ssDNA molecules (e.g., by centrifugation), the target material may be removed from the collected DNA:target complexes. Methods for participating proteins in a solution well known in the art, for example, ethanol precipitation and strataclean resin may be used. As a non-limiting example, a strataclean resin may be added to the supernatant recovered after centrifugation. The target material bound to the strataclean resin can be removed by centrifugation. The target removal step may be repeated for two, three, four or more times. A supernatant containing the enriched ssDNA molecules that bind to the target material may be used for next target binding selection round (
In some embodiments, the final concentration of ssDNA molecules that bind to the target material may be measured and compared to the initial concentration of the input ssDNA library. The ratio of the concentrations will be used to determine if another round of the GO-SELEX selection is needed. If the ratio is below 50%, another round of positive GO-SELEX process is carried out with the same condition. The same process is repeated until the recovery of ssDNA molecules that bind to the target material reaches to a satisfactory ratio, e.g., above 50% recovery. Rounds of partition and isolation are repeated until a desired goal is achieved, for example, two, three, four, five, six, seven, eight or more times with the same condition. In the most general case, selection is continued until no significant improvement in binding strength is achieved on repetition of the selection round.
The target binding pool of ssDNA molecules selected from the positive GO-SELEX process may be further amplified by performing a PCR using labeled primers, e.g., a biotinylated reverse primer and a fluorophore-labeled forward primer. In other aspects, the ssDNA molecules can be amplified by any other known method, such as sequencing the selected sequences and synthesizing them synthetically using an oligonucleotide synthesizer for the next round of binding and selection.
The fluorophore that is conjugated to the forward primer may be, but is not limited to, Cy5, Alexa Fluor 350, Alexa Fluor 430, Alexa Fluor 488, Alexa Fluor 532, Alexa Fluor 546, Alexa Fluor 594, Alexa Fluor 647, Alexa Fluor 658, Cyanine-3, Cyanine-5, fluorescein, Texas red, FITC (Fluorescein Isothiocyanate), rhodamine, or the like. In one preferred embodiment, the forward primer is conjugated with Cy5. In another embodiment, the forward primer is conjugated with Alexa Fluor 647.
After PCR amplification, the resulted double stranded DNA (dsDNA) molecules may be cleaned and further denatured to regenerate single stranded DNA (ssDNA) molecules. The biotinylated reverse primer allows for removal of the complementary strands to regenerate ssDNA molecules from the dsDNA molecules created during PCR amplification. As a non-limiting example, streptavidin coated magnetic beads may be added to the PCR product. The biotinylated complementary strands bind to the streptavidin coated magnetic beads. After denaturation of ssDNA molecules (e.g., addition of a base), the bound biotinylated complementary strands are separated and removed using a magnetic force. The desired ssDNA molecules with the fluorophore tags are collected. The fluorophore (e.g., Cy5 and Alexa Fluor 647) tagged ssDNA molecules that substantially bind to the target are used for next selection process.
The fluorophore (e.g., Cy5) tagged ssDNA molecules give several advantages in developing detection agents used in competition-based detection assays. The addition of Cy5 or other fluorescence markers to the ssDNA sequences at the beginning of aptamer selection can ensure that all aptamer candidates have proper secondary and tertiary structures when they are further developed as signaling polynucleotides (SPNs) used in detection assays. The addition of Cy5 or another fluorescence marker at the later stage may influence the secondary and tertiary structures of aptamer candidates. This modification can significantly reduce false hits during the selection.
As a non-limiting example, the GO-SELEX process to identify a pool of sequences that substantially bind to a target may comprise the steps of (i) mixing the input ssDNA library (e.g., Pool 0 in Table 1) with an allergen composition in a buffer solution; and these are induced to be bound to each other at normal temperature; (ii) adding a graphene oxide solution to the mixture of step (i) to remove ssDNA molecules which are not bound to the target; (iii) removing the target from the collected ssDNA molecules and amplifying the ssDNA molecules by performing a PCR using the PCR primers at the ends of the ssDNA molecules; and (iv) denaturing the double stranded PCT products and collecting ssDNA molecules labeled with fluorophore. Optionally, the positive GO-SELEX selection may be repeated for 2, 3, 4, 5, 6, 7, 8, 9, 10, or more rounds. The ssDNA molecules in the input library comprise approximately 76 nucleotides in length, including a primer for PCR amplification at each end and about 30 nucleotides (the binding site) at its center (i.e., the inner sequence of an aptamer). The target material is an allergen material, particularly a food allergen, comprising one allergenic component, or a mixture of allergenic components from a single allergen. Food allergens may include but are not limited to proteins in legumes such as peanuts, peas, lentils and beans, tree nuts (e.g., almond, brazil nut, cashew, hazelnut, pecan, pistachio and walnut), wheat, milk, fish, egg white and sea food.
In some embodiments, the 5′ constant sequence (i.e., the 5′ primer) comprises a nucleic acid sequence of SEQ ID NO. 1 and the 3′ constant sequence (i.e., the 3′ primer) comprises a nucleic acid sequence of SEQ ID NO. 2.
The target binding pool of ssDNA molecules (e.g., Pool 1 in Table 1) may be further partitioned to select a subset of sequences that substantially bind to the target and compete with short oligonucleotides having sequences complementary to the ssDNA molecules. In some embodiments, this positive target binding selection may be performed using solid supports (e.g., glass or plastic chips) that are coated with short oligonucleotides comprising sequences complementary to the ssDNA molecules. By this process, families of nucleic acid sequences which can simultaneously bind to a target molecule and their complementary sequences may be subtracted from the pool. This additional positive selection process is tailored to differentiate ssDNA molecules that bind to the target and to the complementary sequences attached to a solid support (e.g., a glass or plastic chip).
The short oligonucleotide anchors may comprise sequences complementary to the constant sequences at the ends of the ssDNA molecules. The oligonucleotide anchors may comprise sequences complementary to either the 5′ end or 3′ end sequence of the aptamers. The complementary sequence contains about 5-25 nucleotides, or 5-18 nucleotides, or 6-20 nucleotides, or 8-20 nucleotides. For example, it may comprise 5 nucleotides, 6 nucleotides, 7 nucleotides, 8 nucleotides, 9 nucleotides, 10 nucleotides, 11 nucleotides, 12 nucleotides, 13 nucleotides, 14 nucleotides, 15 nucleotides, 16 nucleotides, 17 nucleotides, 18 nucleotides, 19 nucleotides, 20 nucleotides, 21 nucleotides, 22 nucleotides, 23 nucleotides, 24 nucleotides, or 25 nucleotides. In one preferred example, the complementary anchor sequence contains 5-15 nucleotides. The oligonucleotide anchor may be 100%, or 99%, or 98%, or 97%, or 96%, or 95%, or 94%, or 93%, or 92%, or 91%, or 90% complementary to the sequence of an ssDNA. The short complementary sequences are covalently linked to a solid support such as a glass chip, directly or through a linker.
The solid support on which the oligonucleotides are covalently attached may include, but is not limited to, a glass, a polymer support (e.g., see, U.S. Pat. No. 5,919,525), polyacrylamide gel, or plastic (e.g., a microwell plate), or a nylon membrane. The glass may be a polymer glass (e.g., acrylic glass, polycarbonate and polyethylene terephthalate), or a silicate glass (e.g., Pyrex glass, quartz and germanium-oxide glass), or a porous glass, etc., Polymers may include, but are not limited to, polyimide, photoresist, SU-8 negative photoresist, polydimethylsiloxane (PDMS), silicone elastomer PDMS and COC. In one preferred embodiment, the solid support is a glass chip.
Different technologies may be used for attaching the short complementary sequences to the solid support at determined sites. These methods are well-known in the pertinent art. For example, the oligonucleotides can be deposited on specific sites on the solid support as microdroplets by ink jet, or piezoelectric, or other similar methods. The solid support may be pre-treated to provide active attaching surfaces for oligonucleotides. In addition, the density of the attached oligonucleotides may be measured and controlled on the solid support.
In one embodiment, The on-chip target binding selection may comprise the steps: (i) mixing the target binding pool of ssDNA molecules (i.e., Pool 1) with the same target material in a buffer solution; and they are induced to be bound to each other at normal temperature; (ii) contacting the mixture of step (i) with a solid support of which the surface is covalently coated with short oligonucleotides comprising sequences complementary to the sequences of ssDNA molecules; (iii) collecting the ssDNA:target complexes that are not bound to the solid support (e.g., the flow-through) (
By this selection, a subset of ssDNA molecules in Pool 1 that are not bound to the target material will hybridize to the complimentary sequences covalently attached to the solid support and be removed from the pool. In addition, a subset of ssDNA molecules bound to the target material may also hybridize to the complimentary sequences attached to the solid support. These ssDNA molecules stay on the solid support and are subtracted from the collected ssDNA pool.
The collected ssDNA molecules from the final incubation are cleaned and separated from the target material as described herein. Similar to the positive GO-SELEX selection, the concentration of the recovered ssDNA molecules is measured and compared to the input pool (Pool 1). In some embodiments, the on-chip positive selection may be repeated for two, three, four, five, six, seven, eight or more rounds until the ratio of the recovered ssDNA molecules reaches to a desired recovery ratio (e.g., more than 50% from the input pool).
The ssDNA molecules are then amplified by performing PCR and single stranded DNA molecules are recovered as described in the positive GO-SELEX process. By this selection process, a pool of ssDNA molecules that bind to the target material but do not hybridize to the complementary sequences in the presence of the target material is selected (i.e. positive binding pool (Pool 2) in Table 1). The selection process mimics a condition used in a competition-based detection assay. The combination of regular SELEX (e.g., GO-SELEX) and on-chip positive target binding processes increases the specificity and affinity of aptamers.
The positive pool of ssDNA molecules (Pool 2) containing aptamer candidates that bind to the target material in the presence of the complementary sequences may be further screened to isolate ssDNA molecules that do not bind to the complementary sequences even when the sequences are free, and sequences that substantially bind to counter target materials in addition to the target of interest. In some embodiments, these non-specific ssDNA sequences may be isolated by counter selection processes using solid supports (e.g., glass chips) precoated with short oligonucleotides comprising sequences complementary to the ssDNA molecules.
In some embodiments, an on-chip non-binding counter selection is performed to identify ssDNA molecules that do not hybridize to their complementary sequences in the pool even when they are free. A DNA solution comprising the positive pool of ssDNA molecules (Pool 2), without addition of the target material, is directly incubated with a solid support (e.g., a glass chip) that is precoated with short oligonucleotides comprising sequences complementary to the aptamers in the pool. After incubation, the DNA solution including the unbound ssDNA molecules is collected (i.e. the flow-through) (
In one preferred embodiment, the on-chip non-binding counter process may comprise the steps: (i) preparing a DNA solution comprising the positive binding pool of ssDNA molecules (Pool 2); (ii) contacting the DNA solution with a solid support coated with short oligonucleotides comprising sequences complementary to the ssDNA molecules; (iii) collecting the ssDNA solution after incubation; and (iv) contacting the collected solution again with a new solid support coated with the complementary sequences. These steps may be repeated for two, three, four, five, six, seven or eight rounds and the collected ssDNA solution from the final incubation will be cleaned and amplified for sequencing. In one embodiment, these steps are repeated for eight rounds and the collected ssDNA solution from the final incubation are cleaned and amplified for sequencing.
This on-chip non-binding counter selection creates a non-binding pool of ssDNA molecules (i.e., Pool 3 in Table 1) including ssDNA molecules that cannot hybridize to the complementary sequences even in the absence of the target material.
In other embodiments, an on-chip counter binding selection is performed to isolate any sequences that can bind to non-specific counter targets from the positive binding pool of ssDNA molecules (Pool 2 in Table 1). This counter selection process improves the target specificity of selected ssDNA molecules by eliminating nucleic acid sequences with cross-reactivity to one or more non-target molecules (e.g., counter targets).
In one preferred embodiment, the on-chip counter selection process may comprise the steps of (i) preparing a ssDNA solution comprising the positive binding pool of ssDNA molecules (Pool 2) and incubating the ssDNA solution with a counter target or a mixture of counter targets; (ii) contacting the mixture of step (i) with a solid support that is coated with short oligonucleotides comprising sequences complementary to the ssDNA molecules; (iii) collecting the ssDNA/counter target complexes after step (ii) (i.e. the flow-through) (
In some embodiments, the on-chip counter binding selection may be repeated for as many counter targets as desired, beginning each time with the same pool of ssDNA molecules from the positive binding pool (i.e., Pool 2 in Table 1). In some alternative embodiments, multiple counter targets can be run within the same round in parallel.
By this on-chip counter selection process, ssDNA sequences with cross-specificity towards undesirable related proteins (counter target material) are removed from the positive binding pool. This counter selection process creates a pool of ssDNA molecules (i.e., Pool 4 in Table 1) including the ssDNA molecules that can cross react with a counter target or several counter targets.
As non-limiting examples, the counter targets may be allergen proteins in the same family, including allergen proteins from different sources that can be attributed to these structurally related allergen families, e.g., prolamins family including seed storage proteins (e.g., Sec c 20 in Rye; Tri a 19 in wheat and Tri a 36 in wheat), non-specific lipid transfer proteins family (e.g., Act d 10 in Kiwi, Api g 2 in celery, Ara h 9 in peanut, Cas s 8 in chestnut, Cor a 8 in hazelnut, Jug r 3 in walnut, Lyc e 3 in tomato, Mus a 3 in banana, and Pru du 3 in almond), 2S albumins family including seed storage proteins (e.g., Ana o 3 in cashew nut, Ara h 2 in peanut, Ber e 1 in Brazil nut, Fag e 2 in buckwheat, Gly m 8 in soybean, Jug r 1 in walnut, Ses i 1 in sesame, and Sin a 1 in mustard), Bet VI family including pathogenesis related proteins (e.g., Api g 1/celery, Ara h 8/peanut, Cor a 1/hazelnut, Dau c 1/carrot, Gly m 4/soybean, Mal d 1/apple, and Pru p 1/peach), 7S (vicilin-like) globulins family (e.g., Ana o 1/cashew nut; Ara h 1/peanut; Gly m 5/soybean; Jug r 2/walnut; Pis v 3/pistachio), 11S (legumin-like) globulins family (e.g., Ana o 2/cashew nut; Ara h 3/peanut; Ber e 2/Brazil nut; Cor a 9/hazelnut; Gly m 6/soybean; Jug r 4/walnut; Pru du 6/almond), Cysteine protease C1 family (e.g., Act d 1/kiwi; Gly m Bd 30K/soybean), Profilins family including actin binding proteins (e.g., Act d 9/kiwi; Api g 4/celery; Ara h 5/peanut; Cuc m 2/melon; Dau c 4/carrot; Gly m 3/soybean; Lyc e 1/tomato; Mus a 1/banana; Ory s 12/rice; Pru av 4/cherry; Pru du 4/almond; Pru p 4/peach and Tri a 12/wheat), tropomyosin family including actin binding proteins in muscle (e.g., Pen m 1/shrimp), parvalbumin family including muscle proteins (e.g., Cyp c 1/carp; Gad c 1/cod; Ran e 2/frog; Sal s 1/salmon; Seb m 1/redfish; Xip g 1/swordfish), caseins family including mammalian milk proteins (e.g., Bos d 8-Bos d 12/cow's milk), transferrin family including sulfur-rich ion-binding glycoproteins from milk and hen's egg white (e.g., Bos d Lactoferrin/cow's milk; Gal d 3/hen's egg), serpins family including Serine protease inhibitors (e.g., Gal d 2/hen's egg), Arginine kinases family including Adenosine triphosphate:guanido phosphotransferases (e.g., Pen m 2/shrimp), Lipocalins family including carrier proteins (e.g., Bos d 5/cow's milk), Ovomucoids family including Kazal inhibitors (e.g., Gal d 1/hen's egg), Lysozyme family (e.g., Bos d 4/cow's milk; Gal d 4/hen's egg), and Albumins family including Serum albumins (e.g., Bos d 6/cow's milk; Gal d 5/hen's egg).
In accordance with the present screening method, the ssDNA pools (e.g., Pool 1, Pool 2, Pool 3 and Pool 4 in Table 1) from each selection may be cleaned, amplified and sequenced. In one embodiment, the method comprises an amplification of the individual ssDNA molecules using a polymerase chain reaction (PCR). The sequences within each pool are identified using deep sequencing. In some embodiments, the ssDNA molecules in the target binding pool (i.e., Pool 1) are amplified and sequenced. In parallel, an artifact library may be made by amplifying the input ssDNA library and the sequences in this artifact library are sequenced (See, e.g., the flow-chart of
The ssDNA molecules in the positive binding pool from the final round of the on-chip target binding selection (e.g., Pool 2 in Table 1) may be sequenced. The ssDNA molecules within this pool contain ssDNA sequences that preferentially bind to their target in the presence of their complementary sequences.
The non-specific ssDNA molecules from the final round of the on-chip non-binding counter selection and from the final round of the on-chip counter selection (e.g., Pools 3 and 4 in Table 1) may be sequenced. The ssDNA molecules within these pools contain ssDNA sequences that fail to hybridize to the complementary sequences even in the absence of the target material and sequences with cross-specificity to other counter targets.
The ssDNA sequences from each pool may be barcoded for identity. Following barcoding, ssDNA molecules from each pool may be pooled together and run deep sequencing in a single lane on the Illumina MiSeq System.
After sequencing and barcoding of the ssDNA sequences in each pool, and running the deep sequencing, the data are analyzed using any available bioinformatics tools. In some embodiments, heat maps are generated for each individual pool, which represent the frequencies of ssDNA sequences in each pool, by using a local occurrence of the open-source bioinformatics tool Galaxy (Thiel and Giangrande, Methods 2016, 97, 3-10; the contents of which are incorporated herein by reference in their entirety).
Potential aptamer hits are selected by analyzing the over-expressed sequences in each pool using the heat maps for sequences in each pool. Essentially, the heat maps of the ssDNA molecules from the non-binding pool (Pool 3) and the counter binding pool (Pool 4) are subtracted from the heat maps of the ssDNA molecules from the positive binding pool (Pool 2). The final data represent a pool of potential aptamer hits with characteristics including: (i) binding to the target protein with high specificity and affinity, (ii) hybridizing to their short complementary sequences only in the absence of the target but not binding to the short complementary sequences in the presence of the target; and (iii) no cross-reactivity to non-specific counter targets. These characteristics of the aptamer candidates make them suitable for target detection in a sample, e.g., in competition-based assays.
From the final pool of potential aptamer hits (e.g., Pool 5 in Table 1), a sequence family tree may be constructed to show the similarities between different aptamer sequences. Multiple sequences from various branches of the family tree structure can be selected and folded using expected assay conditions. The secondary and tertiary structures will be assessed, and those sequences that show multiple well-defined structures are selected for synthesis and further evaluation. The structures or motifs may include hairpin loops, symmetric and asymmetric bulges, pseudoknots and myriad combinations of the same. The equilibrium dissociation constant (Kd), and other parameters of the selected aptamers will be measured.
In accordance with the present disclosure, the selection method may further comprise the steps of (i) amplifying all the sequences in the first, second, third and fourth sub-pools, and barcoding each sequence from each pool; (ii) pooling the sequences from each sub-pool together and running sequencing together; (iii) analyzing the data from (ii) and separating each sequence data into the original sub-pool according to the barcode information; (iv) generating heat maps for each individual sub-pool that represent the frequencies of each sequence in the pool; and (v) subtracting the sequences in the heat maps of the third sub-pool and the fourth sub-pool from the heat maps of the second sub-pool, wherein the final pool of sequences are candidate aptamers that specifically bind to the target of interest and preferentially bind to the target of the interest in competing the binding of various short complementary sequences.
In accordance with the present disclosure, sequences that specifically bind to peanut, tree nuts including almond, brazil nut, cashew, hazelnut, pecan, pistachio and walnut, gluten, milk allergens whey and casein are selected. Aptamer sequences that bind to all nuts are also selected. As used here, the term “all nuts” refers to peanut and the tree nuts including almond, brazil nut, cashew, hazelnut, pecan, pistachio and walnut. A selected aptamer sequence that is specific to “all nuts” can bind to any of the nuts (i.e., peanut, almond, brazil nut, cashew, hazelnut, pecan, pistachio and walnut), e.g., one, two, three, four, five, six, seven or eight nuts present in samples.
In some embodiments, the sequences that specifically bind to peanut comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NO. 3 to 1002.
In some embodiments, the sequences that specifically bind to almond comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NO. 4003 to 5002.
In some embodiments, the sequences that specifically bind to brazil nut comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NO. 8003 to 9002.
In some embodiments, the sequences that specifically bind to cashew comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NO. 12003 to 13002.
In some embodiments, the sequences that specifically bind to hazelnut comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NO. 16003 to 17002.
In some embodiments, the sequences that specifically bind to pecan comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NO. 20003 to 21002.
In some embodiments, the sequences that specifically bind to pistachio comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NO. 24003 to 25002.
In some embodiments, the sequences that specifically bind to walnut comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NO. 28003 to 29002.
In some embodiments, the sequences that specifically bind to all nuts comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NO. 32003 to 33002.
In some embodiments, the sequences that specifically bind to gluten comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NO. 40003 to 41002.
In some embodiments, the sequences that specifically bind to whey comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NO. 44003 to 45002.
In some embodiments, the sequences that specifically bind to casein comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NO. 48003 to 49002.
In some embodiments, the present selection method may be modified to identify aptamer sequences that bind to multiple targets. The multiple target selection process provides an efficient method for identifying the best binding aptamers to a group of targets.
Many allergens, particularly food allergens, are composed of multiple allergenic components. These components may induce component specific IgE in a person's body. Some people are allergic to only one specific component but not the other components of the same allergen. Some people are allergic to all the components of an allergen. For example, milk includes two primary allergenic components: the whey proteins (alpha-lactalbumin and beta-lactoglobulin) and caseins. A person who is allergic to milk, may be allergic to only whey or casein, or to both whey and casein. In this context, an aptamer ligand that binds to the whey proteins only, or caseins only, or both the whey proteins and caseins, may be desirable to milk allergy.
In some embodiments, the present screening methods may be modified for selecting aptamers that can bind to the multiple components of an allergen.
As a non-limiting example, two, three, four or more rounds of the positive GO-SELEX selection are performed using the whole allergen material as the target material (e.g., the whole milk including casein and the whey proteins). The ssDNA molecules selected from this process (Pool 1) include a mixture of ssDNA sequences that bind to any of the various components of milk (e.g., caseins and the whey proteins). These milk binding sequences are used to run two parallel selection processes: a selection process for casein only and a selection process for the whey protein only. The two selection processes are performed as previously described for a single target. Importantly, during the counter selection process, a separate selection will be performed using only the other component as the counter target. That is, in the selection for the aptamer sequences that specifically bind to casein, a whey protein is used as the counter target in the counter selection process, while casein is used as the counter target for selecting aptamer sequences that specifically bind to a whey protein.
Following similar procedures, the various sub-pools of ssDNA molecules will be barcoded and submitted for deep sequencing. The casein and whey samples will be pooled separately from one another and run in separate lanes during deep sequencing. The bioinformatic analysis of the sequencing data will reveal the pool of aptamer hits for the target alone. In order to find an aptamer sequence that binds both casein and whey, overlaps between the special counter rounds may be collected.
In another example, a mixture of all nuts may be used as target materials, sequences that can recognize all nuts may be selected by the present methods. The selected sequences may bind to any nut, and the combinations of any nuts present in a test sample.
In another aspect of the disclosure, aptamers that specifically bind to allergen targets, signaling polynucleotides (SPNs) derived from the selected aptamers, and detection sensors comprising these aptamers and SPNs are provided. An aptamer that binds to a target allergen with high specificity and affinity may not hybridize to the short complementary sequence in the presence of the target allergen, and demonstrates little or no cross-specificity to any counter target.
A SPN may be derived from an aptamer sequence selected by the present method. The SPN may further comprise additional nucleotides at one end or both ends of the aptamer sequence. The sequence may be further modified to change its secondary and/or tertiary structures to make it more stable, to increase the binding affinity and/or specificity, or to add a fluorescence marker, or to be modified to comprise one or more conjugates.
Detection sensors comprising selected aptamers and SPNs are provided. In some embodiments, the detection sensor may include a SPN, a solid support and a short oligonucleotide comprising a nucleic acid sequence complementary to the SPN, wherein the oligonucleotide is covalently anchored to the solid support by one of the ends, directly or through a linker (e.g., a 6 carbon atom arm). The SPN comprises an inner sequence that specifically binds to a target of interest and it hybridizes to the complementary oligonucleotide when it is not bound to the target of interest. In one example, the short complementary sequences and the target of interest will compete binding to the SPN. In this competitive assay, for example, the SPN can either bind to the short complementary sequences attached on the solid support or a target of interest in a sample. Under conditions sufficient to allow the target of interest in the sample to compete with the short complementary sequences attached on the solid support, the SPN:target complexes can be detected and measured.
In accordance with the present disclosure, aptamer sequences that specifically bind to peanut, tree nuts including almond, brazil nut, cashew, hazelnut, pecan, pistachio and walnut, gluten, milk allergens whey and casein are selected. Aptamer sequences that bind to all nuts are also selected.
In some embodiments, the sequences that specifically bind to peanut comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NOs.3 to 1002. In some examples, a short nucleic acid sequence may be affixed to the 5-end of the inner sequence. The short nucleic acid sequence may be the 5′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO.1. Accordingly, the aptamer that specifically binds to peanut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs.1003 to 2002. In other examples, a short nucleic acid sequence may be affixed to the 3-end of the inner sequence. The short nucleic acid sequence may be the 3′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO.2. Accordingly, the aptamer that specifically binds to peanut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 2003 to 3002. In other examples, the inner sequence may comprise a 5′ end short sequence (i.e., SEQ ID NO. 1) and a 3′ end short sequence (i.e., SEQ ID NO. 2). Accordingly, the aptamer that specifically binds to peanut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 3003 to 4002. In one embodiment, the aptamer of the present disclosure that specifically binds to peanut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 3 to 4002 listed in Table 2, or variant thereof.
In some embodiments, the sequences that specifically bind to almond comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NOs. 4003 to 5002. In some examples, a short nucleic acid sequence may be affixed to the 5-end of the inner sequence. The short nucleic acid sequence may be the 5′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 1. Accordingly, the aptamer that specifically binds to almond may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 5003 to 6002. In other examples, a short nucleic acid sequence may be affixed to the 3-end of the inner sequence. The short nucleic acid sequence may be the 3′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 2. Accordingly, the aptamer that specifically binds to almond may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 6003 to 7002. In other examples, the inner sequence may comprise a 5′ end short sequence (i.e., SEQ ID NO. 1) and a 3′ end short sequence (i.e., SEQ ID NO. 2). Accordingly, the aptamer that specifically binds to almond may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 7003 to 8002. In one embodiment, the aptamer of the present disclosure that specifically binds to almond may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 4003 to 8002 listed in Table 3, or variant thereof.
In some embodiments, the sequences that specifically bind to brazil nut comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NOs. 8003 to 9002. In some examples, a short nucleic acid sequence may be affixed to the 5-end of the inner sequence. The short nucleic acid sequence may be the 5′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 1. Accordingly, the aptamer that specifically binds to brazil nut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 9003 to 10002. In other examples, a short nucleic acid sequence may be affixed to the 3-end of the inner sequence. The short nucleic acid sequence may be the 3′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 2. Accordingly, the aptamer that specifically binds to brazil nut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 10003 to 11002. In other examples, the inner sequence may comprise a 5′ end short sequence (i.e., SEQ ID NO. 1) and a 3′ end short sequence (i.e., SEQ ID NO. 2). Accordingly, the aptamer that specifically binds to brazil nut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 11003 to 12002. In one embodiment, the aptamer of the present disclosure that specifically binds to brazil nut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 8003 to 12002 listed in Table 4, or variant thereof.
In some embodiments, the sequences that specifically bind to cashew comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NOs. 12003 to 13002. In some examples, a short nucleic acid sequence may be affixed to the 5-end of the inner sequence. The short nucleic acid sequence may be the 5′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 1. Accordingly, the aptamer that specifically binds to cashew may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 13003 to 14002. In other examples, a short nucleic acid sequence may be affixed to the 3-end of the inner sequence. The short nucleic acid sequence may be the 3′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 2. Accordingly, the aptamer that specifically binds to cashew may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 14003 to 15002. In other examples, the inner sequence may comprise a 5′ end short sequence (i.e., SEQ ID NO. 1) and a 3′ end short sequence (i.e., SEQ ID NO. 2). Accordingly, the aptamer that specifically binds to cashew may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 15003 to 16002. In one embodiment, the aptamer of the present disclosure that binds to cashew may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 12003 to 16002 listed in Table 5, or variant thereof.
In some embodiments, the sequences that specifically bind to hazelnut comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NOs. 16003 to 17002. In some examples, a short nucleic acid sequence may be affixed to the 5-end of the inner sequence. The short nucleic acid sequence may be the 5′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 1. Accordingly, the aptamer that specifically binds to hazel nut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 17003 to 18002. In other examples, a short nucleic acid sequence may be affixed to the 3-end of the inner sequence. The short nucleic acid sequence may be the 3′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 2. Accordingly, the aptamer that specifically binds to hazel nut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 18003 to 19002. In other examples, the inner sequence may comprise a 5′ end short sequence (i.e., SEQ ID NO. 1) and a 3′ end short sequence (i.e., SEQ ID NO. 2). Accordingly, the aptamer that specifically binds to hazel nut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 19003 to 20002. In one embodiment, the aptamer of the present disclosure that specifically binds to hazelnut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 16003 to 20002 listed in Table 6, or variant thereof.
In some embodiments, the sequences that specifically bind to pecan comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NOs. 20003 to 21002. In some examples, a short nucleic acid sequence may be affixed to the 5-end of the inner sequence. The short nucleic acid sequence may be the 5′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 1. Accordingly, the aptamer that specifically binds to pecan may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 21003 to 22002. In other examples, a short nucleic acid sequence may be affixed to the 3-end of the inner sequence. The short nucleic acid sequence may be the 3′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 2. Accordingly, the aptamer that specifically binds to pecan may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 22003 to 23002. In other examples, the inner sequence may comprise a 5′ end short sequence (i.e., SEQ ID NO. 1) and a 3′ end short sequence (i.e., SEQ ID NO. 2). Accordingly, the aptamer that specifically binds to pecan may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 23003 to 24002. In one embodiment, the aptamer of the present disclosure that specifically binds to pecan may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 20003 to 24002 listed in Table 7, or variant thereof.
In some embodiments, the sequences that specifically bind to pistachio comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NOs. 24003 to 25002. In some examples, a short nucleic acid sequence may be affixed to the 5-end of the inner sequence. The short nucleic acid sequence may be the 5′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 1. Accordingly, the aptamer that specifically binds to pistachio may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 25003 to 26002. In other examples, a short nucleic acid sequence may be affixed to the 3-end of the inner sequence. The short nucleic acid sequence may be the 3′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 2. Accordingly, the aptamer that specifically binds to pistachio may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 26003 to 27002. In other examples, the inner sequence may comprise a 5′ end short sequence (i.e., SEQ ID NO. 1) and a 3′ end short sequence (i.e., SEQ ID NO. 2). Accordingly, the aptamer that specifically binds to pistachio may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 27003 to 28002. In one embodiment, the aptamer of the present disclosure that specifically binds to pistachio may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 24003 to 28002 listed in Table 8, or variant thereof.
In some embodiments, the sequences that specifically bind to walnut comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NOs. 28003 to 29002. In some examples, a short nucleic acid sequence may be affixed to the 5-end of the inner sequence. The short nucleic acid sequence may be the 5′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 1. Accordingly, the aptamer that specifically binds to walnut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 29003 to 30002. In other examples, a short nucleic acid sequence may be affixed to the 3-end of the inner sequence. The short nucleic acid sequence may be the 3′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 2. Accordingly, the aptamer that specifically binds to walnut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 30003 to 31002. In other examples, the inner sequence may comprise a 5′ end short sequence (i.e., SEQ ID NO. 1) and a 3′ end short sequence (i.e., SEQ ID NO. 2). Accordingly, the aptamer that specifically binds to walnut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 31003 to 32002. In one embodiment, the aptamer of the present disclosure that specifically binds to walnut may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 28003 to 32002 listed in Table 9, or variant thereof.
In some embodiments, the sequences that specifically bind to all nuts comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NOs. 32003 to 33002. In some examples, a short nucleic acid sequence may be affixed to the 5-end of the inner sequence. The short nucleic acid sequence may be the 5′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 1. Accordingly, the aptamer that specifically binds to all nuts may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 33003 to 34002. In other examples, a short nucleic acid sequence may be affixed to the 3-end of the inner sequence. The short nucleic acid sequence may be the 3′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 2. Accordingly, the aptamer that specifically binds to all nuts may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 34003 to 35002. In other examples, the inner sequence may comprise a 5′ end short sequence (i.e., SEQ ID NO. 1) and a 3′ end short sequence (i.e., SEQ ID NO. 2). Accordingly, the aptamer that specifically binds to all nuts may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 35003 to 36002. In one embodiment, the aptamer of the present disclosure that can bind to all nuts may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 32003 to 36002 listed in Table 10, or variant thereof.
In some embodiments, the sequences that can bind to control materials during detection assay can be selected through the present disclosure. As a non-limiting sequence, the sequences that bind to peanut control material are selected by the present method, which comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NOs. 36003 to 37002. In some examples, a short nucleic acid sequence may be affixed to the 5-end of the inner sequence. The short nucleic acid sequence may be the 5′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 1. Accordingly, the aptamer that specifically binds to peanut control material may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 37003 to 38002. In other examples, a short nucleic acid sequence may be affixed to the 3-end of the inner sequence. The short nucleic acid sequence may be the 3′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 2. Accordingly, the aptamer that specifically binds to peanut control material may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 38003 to 39002. In other examples, the inner sequence may comprise a 5′ end short sequence (i.e., SEQ ID NO. 1) and a 3′ end short sequence (i.e., SEQ ID NO. 2). Accordingly, the aptamer that specifically binds to peanut control material may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 39003 to 40002. In one embodiment, the aptamer of the present disclosure that can be used to detect peanut control material may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 36003 to 40002 listed in Table 11, or variant thereof.
In some embodiments, the sequences that specifically bind to gluten comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NOs. 40003 to 41002. In some examples, a short nucleic acid sequence may be affixed to the 5-end of the inner sequence. The short nucleic acid sequence may be the 5′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 1. Accordingly, the aptamer that specifically binds to gluten may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 41003 to 42002. In other examples, a short nucleic acid sequence may be affixed to the 3-end of the inner sequence. The short nucleic acid sequence may be the 3′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 2. Accordingly, the aptamer that specifically binds to gluten may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 42003 to 43002. In other examples, the inner sequence may comprise a 5′ end short sequence (i.e., SEQ ID NO. 1) and a 3′ end short sequence (i.e., SEQ ID NO. 2). Accordingly, the aptamer that specifically binds to gluten may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 43003 to 44002. In one embodiment, the aptamer of the present disclosure that specifically binds to gluten may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 40003 to 44002 listed in Table 12, or variant thereof.
In some embodiments, the sequences that specifically bind to whey comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NOs. 44003 to 45002. In some examples, a short nucleic acid sequence may be affixed to the 5-end of the inner sequence. The short nucleic acid sequence may be the 5′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 1. Accordingly, the aptamer that specifically binds to whey may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 45003 to 46002. In other examples, a short nucleic acid sequence may be affixed to the 3-end of the inner sequence. The short nucleic acid sequence may be the 3′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 2. Accordingly, the aptamer that specifically binds to whey may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 46003 to 47002. In other examples, the inner sequence may comprise a 5′ end short sequence (i.e., SEQ ID NO. 1) and a 3′ end short sequence (i.e., SEQ ID NO. 2). Accordingly, the aptamer that specifically binds to whey may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 47003 to 48002. In one embodiment, the aptamer of the present disclosure that specifically binds to whey may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 44003 to 48002 listed in Table 13, or variant thereof.
In some embodiments, the sequences that specifically bind to casein comprise an inner sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NOs. 48003 to 49002. In some examples, a short nucleic acid sequence may be affixed to the 5-end of the inner sequence. The short nucleic acid sequence may be the 5′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 1. Accordingly, the aptamer that specifically binds to casein may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 49003 to 50002. In other examples, a short nucleic acid sequence may be affixed to the 3-end of the inner sequence. The short nucleic acid sequence may be the 3′ primer sequence used in the random ssDNA library, i.e. the nucleic acid sequence of SEQ ID NO. 2. Accordingly, the aptamer that specifically binds to casein may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 50003 to 51002. In other examples, the inner sequence may comprise a 5′ end short sequence (i.e., SEQ ID NO. 1) and a 3′ end short sequence (i.e., SEQ ID NO. 2). Accordingly, the aptamer that specifically binds to casein may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 51003 to 52002. In one embodiment, the aptamer of the present disclosure that specifically binds to casein may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NOs. 48003 to 52002 listed in Table 14, or variant thereof.
In some embodiments, the SPN of the present disclosure comprises an aptamer selected by the present method and a short oligonucleotide anchor sequence that may be coated to a solid support. The short anchor oligonucleotide comprises a nucleic acid sequence complementary to a portion of the same aptamer sequence. In some embodiments, a SPN for detecting peanut allergen comprises an aptamer sequence selected from the group consisting of SEQ ID Nos. 3 to 4002 and one or more short anchor sequences that are complementary to the aptamer sequence. As a non-limiting example, the complementary sequence may comprise a nucleic acid sequence selected from SEQ ID NOs. 52003 to 52042 (as shown in Table 15). In some embodiments, the anchor oligonucleotide may be modified to contain a spacer at one end of the sequence. As a non-limiting example, the anchor sequence is modified to contain either a 12-Carbon atom spacer or a 6-Carbon atom spacer at the 5′ end of the sequence (Table 15), or a polyA tail at one end of the sequence. The short complementary sequences may be covalently attached to the solid support (e.g., a glass or plastic chip) directly or through a linker. Accordingly, the length of the linker (carbon atoms or polyA tail) from the solid surface can prevent steric hindrance and reduce the probability of interference due to auto-fluorescence of matrices.
In some embodiments, the present disclosure provides a detection kit for allergen detection. The kit comprises (a) a SPN comprising an aptamer sequence that specifically binds to a target of interest, wherein the aptamer does not bind to its complementary sequence in the presence of the target of interest; and (b) a solid support of which the surface is coated with short nucleic acid sequences that are complementary to the sequence of the aptamer. The detection kit may further comprise one or more buffer solutions and other reagents. The buffers are suitable for preparing sample solutions, SPN solutions, and/or other solutions necessary for running a detection assay (e.g., wash buffers). One or more of these kit components may be separated into individual containers, or they may be provided in their aggregated state. In some embodiments, the kit may comprise multiple SPNs specific to multiple allergen targets. For example, the kit may comprise a panel of SPNs specific to peanut and common tree nuts including almond, brazil nut, cashew, hazel nut, pecan, pistachio and walnut.
In some embodiments, the detection kit may further comprise one or more control aptamer sequences; the control sequences may be used to measure total protein and normalize the baseline. For example, a detection kit comprising SPNs specific to peanut for peanut detection may comprise peanut control sequences that can measure total protein and normalize the baseline during peanut detection. As a non-limiting example, a peanut detection kit may comprise one or more peanut specific aptamers comprising nucleic acid sequences selected from SEQ ID NOs. 3-4002 and one or more peanut control aptamers comprising nucleic acid sequences selected from SEQ ID NOs. 36003 to 40002.
In some embodiments, the present disclosure provides a method for detecting the presence and/or absence of an allergen in a food sample, the method comprising the steps of (i) preparing a sample to be tested solution and a SPN solution; (ii) mixing the sample and SPN solutions and incubating the mixture to induce the binding of the target to the SPN; (iii) contacting the mixture to a solid support that is coated with short oligonucleotides comprising sequences complementary to the SPN; and (iv) measuring a signal and detecting the presence and/or absence of the allergen of interest. The SPN may be labeled with a fluorophore at one end of the sequence, e.g., Cy5 and Alexa Fluor 647.
In some embodiments, the solid support is a glass chip (e.g., a borosilicate glass chip) wherein the surface of the glass chip is divided into several panels including at least one reactive panel and at least two control panels. The reactive panel of the glass chip are covalently coated with short oligonucleotides comprising sequences complementary to the SPN to which the SPN can hybridize to form a double stranded nucleic acid when the SPN is free from the binding of the target of interest. The reactive panel may be flanked by two control panels at each side. The control panels may be coated with random sequences that do not bind to the SPN nor the target.
The chip can be any size suitable for the use in a detection device/system, e.g., 10×10 mm. In some embodiments, the detection chip may be a plastic chip.
In some embodiments, the food sample may be processed with a homogenization buffer that contains a SPN specific to an allergen of interest (e.g., peanut). The food slurry passes over a reactive panel on a glass chip, embedded in a cartridge designed to position the chip to face a laser and an optical sensor. Wash buffer is flowed over the reactive panel, thereby removing any non-specific binding interactions from the panel. Multiple steps of the assay are read by the optical sensor and analyzed by an algorithm to provide an “allergen detected” or “allergen not detected” response. In the absence of the target allergen, the SPN is free to bind to the complementary oligonucleotides on the reactive panel, resulting in a high fluorescence signal. In the presence of the target allergen, the SPN:complement binding interface is occluded, thereby resulting in a decrease in fluorescence signal on the reactive panel.
Those skilled in the art will recognize or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments in accordance with the disclosure described herein. The scope of the present disclosure is not intended to be limited to the above Description, but rather is as set forth in the appended claims.
In the claims, articles such as “a,” “an,” and “the” may mean one or more than one unless indicated to the contrary or otherwise evident from the context. Claims or descriptions that include “or” between one or more members of a group are considered satisfied if one, more than one, or all of the group members are present in, employed in, or otherwise relevant to a given product or process unless indicated to the contrary or otherwise evident from the context. The disclosure includes embodiments in which exactly one member of the group is present in, employed in, or otherwise relevant to a given product or process. The disclosure includes embodiments in which more than one, or the entire group members are present in, employed in, or otherwise relevant to a given product or process.
It is also noted that the term “comprising” is intended to be open and permits but does not require the inclusion of additional elements or steps. When the term “comprising” is used herein, the term “consisting of” is thus also encompassed and disclosed.
Where ranges are given, endpoints are included. Furthermore, it is to be understood that unless otherwise indicated or otherwise evident from the context and understanding of one of ordinary skill in the art, values that are expressed as ranges can assume any specific value or subrange within the stated ranges in different embodiments of the disclosure, to the tenth of the unit of the lower limit of the range, unless the context clearly dictates otherwise.
In addition, it is to be understood that any particular embodiment of the present disclosure that falls within the prior art may be explicitly excluded from any one or more of the claims. Since such embodiments are deemed to be known to one of ordinary skill in the art, they may be excluded even if the exclusion is not set forth explicitly herein. Any particular embodiment of the compositions of the disclosure (e.g., any antibiotic, therapeutic or active ingredient; any method of production; any method of use; etc.) can be excluded from any one or more claims, for any reason, whether or not related to the existence of prior art.
It is to be understood that the words which have been used are words of description rather than limitation, and that changes may be made within the purview of the appended claims without departing from the true scope and spirit of the disclosure in its broader aspects.
While the present disclosure has been described at some length and with some particularity with respect to the several described embodiments, it is not intended that it should be limited to any such particulars or embodiments or any particular embodiment, but it is to be construed with references to the appended claims so as to provide the broadest possible interpretation of such claims in view of the prior art and, therefore, to effectively encompass the intended scope of the disclosure.
As illustrated in
To separate the bound ssDNAs from the target, 10% Strataclean resin is added to the collected supernatant which contains target protein and ssDNA complexes. The resulting mixture is heated to 80° C. for 3 minutes, followed by centrifugation at 10,000 g for 3 minutes. The pellet containing the resin bound target proteins is discarded and the supernatant is collected. The strataclean step is repeated for at least one more round. The concentration of ssDNAs in the final supernatant is measured and compared to the initial concentration prior to addition of target proteins and GO. The ssDNA ratio after each round of selection is used to determine if further round selection is necessary. For example, if the ratio is below 50%, the same conditions are repeated in the next round until recovery improves.
The collected final ssDNA pool is then amplified by PCR using a biotinylated reverse primer and a Cy5-tagged forward primer. The PCR amplified DNAs are cleaned for removal of any residual reagent (e.g., PCR Clean Up Kit) and measured for the concentration of DNA molecules. The clean PCR product is added to streptavidin coated magnetic beads. The biotinylated complimentary strand binds the streptavidin coated beads, then base is added to denature the dsDNA molecules. Using a magnet, the beads, with the biotinylated complimentary strand still bound, are pulled out of solution and the desired ssDNA strands with the Cy5-tag are collected. The isolated ssDNA pool is concentrated, measured and prepared for next selection round.
The ssDNA pool from Example 1 is diluted to 0.2 ng/μL in the extraction buffer. The same target solution is prepared and diluted to stringent conditions. 50 μL of the ssDNA solution is mixed with 50 μL of target protein and incubated for 1 minute at room temperature with shaking. This ssDNA/protein mixture is then added to two wells of a 16-well slide containing short complimentary anchors to the primer regions of the ssDNA molecules. After incubation for 1 minute at room temperature with shaking, the ssDNA/protein mixture is transferred to the next two wells of the same slide. This process is repeated for a total of eight incubations. Following the final incubation, the ssDNA/protein mixture is collected. The cleaning, amplification, and strand separation steps are the same as in the positive GO-SELEX selection (See Example 1). This on-glass selection can be repeated multiple times until the recovery ratio is acceptable.
The ssDNA molecules from the final round of positive on-glass SELEX (Example 2) are diluted to a concentration of 0.1 ng/μL in extraction buffer. 50 μL of the ssDNA solution is added to two wells of a new 16-well slide as described above. Without any protein present, all sequences in the pool that are capable of binding the complimentary sequences should bind. After incubation for 1 minute at room temperature with shaking, the ssDNA solution is transferred to the next two wells of the same slide and incubated for another 1 minute. This process is repeated for a total of eight incubations. Following the final incubation, the ssDNA is collected and saved for sequencing.
The ssDNA molecules from the final round of positive on-glass SELEX (Example 2) are diluted to a concentration of 0.1 ng/μL in extraction buffer. The counter proteins of interest are dissociated in extraction buffer and diluted to 10000 ppm. 50 uL of the ssDNA solution is mixed with 50 μL of the counter target mixture and the mixture is incubated for 1 minute at room temperature with shaking. This ssDNA/protein mixture is then added to two wells of a fresh 16-well slide as previously described. Any ssDNA in the mixture that also has the capability to bind these undesired counter targets will bind the proteins and not bind the short complimentary sequences on the glass. After incubation for 1 minute at room temperature with shaking, the ssDNA/protein mixture is transferred to the next two wells of the same slide. This process is repeated for a total of eight incubations. Following the final incubation, the ssDNA is collected, cleaned as previously described, and saved for sequencing.
Gluten is found in wheat, buckwheat, barley, and rye, which is composed of two primary fractions, the water and alcohol soluble gliadins, and the insoluble glutenins (Journal of AOAC International 2013; 96, 1-8). Due to these solubility differences, aptamers that recognize the gliadin fraction are selected using the combined SELEX methods.
Gluten Extraction from Food
Several different extraction methods are used and compared (Fallahbaghery et al., J. Agric. Food Chem. 2017; 65, 2857-2866; and Ito et al., Anal Bioanal Chem. 2016; 408, 5973-5984). Surfactants, salts, and reducing agents are tested. The surfactants tested include 0.1% Tween 20 or 1% SDS. The salts tested include 25 mM NaCl and 5 mM MgCl2 or 2 mM guanidinium HCl. The only reducing agent tested is 100 mM sodium sulfite, as other reducing agents present a serious health hazard for a consumer device.
In order to select the ideal extraction buffer, 24 different buffers were tested, and their extraction efficiency was measured by ELISA. The first round of testing was performed simply on wheat, while further rounds of testing used four different wheat-incurred foods: oatmeal, wine, ground pork, and ice cream. From this testing, the best gluten extraction buffer comprises 20 mM HEPES, 30% EtOH, 0.1% Tween20, 2 mM guanidinium HCl, 25 mM NaCl, and 5 mM MgCl2.
Determining the Ratio of GO and ssDNA Molecules
The optimal ratio of GO to ssDNA molecules in the extraction buffer is determined to achieve the maximal recovery of ssDNA molecules during the selection process. The optimal ratio is 10-fold excess of GO to ssDNA, but the affinity of ssDNAs to GO varies depending on salt content of the buffer. A dilution curve of GO using the same amount of ssDNA revealed that a 2000:1 mass ratio of GO to ssDNA is needed in the gluten extraction buffer.
Every target protein has distinct cross-reactivity concerns. For gluten, several counter proteins classes are tested, including tree nuts, commonly used wheat replacements (arrowroot, rice flour, buckwheat), and the other major allergens (egg, milk, soy).
Control sequences can be used in a detection assay, for example in an allergen detection assay to measure the total protein. Signals from control sequences may be incorporated into the assay algorithm in place of, or in addition to the fiducials. In this example, control sequences for peanut detection (i.e. peanut control sequences) were selected from the ssDNA library using SELEX methods as described herein. The criteria for control sequences include: 1) having similar response to a corresponding matrix, e.g., food type, as the target such as AraH1 (peanut allergen); 2) having no or litter response to the target material, e.g., peanut; 3) having no binding to either the aptamer against the target (AraH1) or its anchor sequences.
To select peanut control sequences, repeated selections with different binding materials were performed. Before each round of selection, a counter selection against 10,000 ppm peanut was performed and the collected sequences from each round of the counter selection were used. Table 16 lists the repeated selections with different binding materials.
Sequences collected from Rounds 16, 17 and 18 were sequenced and tested. The heat maps, predicted binding to an aptamer specific to the peanut allergen protein AraH1 (AraH1 probe) and the anchor sequences, and folded structures of each sequence were analyzed. Table 11 lists the top 1000 hits from the selection. 13 control sequences (Table 17) were picked and further characterized,
None of these peanut control sequences bind to the aptamer specific to AraH1 (AraH1 probe) up to the concentration of 100 nM.
The binding of the control sequences to an anchor sequence (AAAAATCAAGTGGTC; SEQ ID NO. 52003), the interference of each sequence with the binding of the AraH1 probe to peanut, and the affinity of each sequence to peanut, were evaluated. The data showed that three sequences PC36, PC60 and PC87 have minimal response to 5000 ppm peanut when tested at a concentration of 100 nM.
Two food types including sugar free wafer and strawberry poptart were compared for response to AraH1 aptamer and peanut control sequences PC36, PC60 and PC87 (each at a concentration of 100 nM). The foods were spiked with either 0 ppm or 5000 ppm peanut. The data indicate the AraH1 aptamer creates high signal for wafer but a lower signal for poptart.
The present application claims priority to U.S. Provisional Application No. 62/714,102 filed Aug. 3, 2018, entitled with “Methods for Aptamer Selection”; the contents of which are incorporated herein by reference in their entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2019/044772 | 8/2/2019 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62714102 | Aug 2018 | US |