The sequence listing that is contained in the file named “NBLE092US_ST25.txt,” which is 21 kilobytes as measured in Microsoft Windows operating system and was created on Mar. 21, 2017, is filed electronically herewith and incorporated herein by reference.
The present invention relates generally to the field of molecular biology. More specifically, the invention relates to methods of identifying pecan tree cultivars.
Pecan (Carya illinoinensis K.) is a valuable specialty crop. The pecan industry propagates pecan cultivars clonally through the use of graftwood. As clones, different trees from the same cultivar should be genetically identical. Pecan cultivars are currently identified visually, which can lead to different outcomes based on the individual performing the identification, challenges to identify subtle differences, and problems with variation in nut characteristics due to environmental factors such as water and nutrients, or simply based on historical assessment, wherein all trees that are stated to be derived from a particular cultivar are assumed to be genetically identical and derived from the stated cultivar. However, there is mounting evidence that individual trees allegedly from the same cultivar have important differences between them, for example susceptibility to the pecan scab disease, suggesting that these trees are not genetically identical to each other. Therefore, these trees could also differ in their agronomic performance, nut production (yield), disease resistance and nut characteristics. Unfortunately, pecan is currently lacking effective and reliable genome-wide level genomic resources to identify specific cultivars. Accordingly, methods and compositions for genetic identification of pecan cultivars would represent a tremendous advance in the field.
The present invention provides methods and compositions for identifying a cultivar of a pecan tree. In one embodiment, the method comprises detecting a polymorphism, for example a first insertion/deletion or single nucleotide polymorphism selected from the group of insertion/deletion and single nucleotide polymorphisms listed in Table 3, from a sample of nucleic acids from the pecan tree, wherein the presence of the first insertion/deletion or single nucleotide polymorphism identifies the cultivar of the pecan tree. In certain embodiments, the cultivar is Alley, Amling, Apache, Barton, Brooks, Burkett, Byrd, Caddo, Candy, Cape Fear, Carmichael, Carter, Cheyenne, Choctaw, Clark, Creek, Curtis, Dependable 6-2, Dependable 8-3, Desirable 5-2, Eclipse, Elliott, Evers, Excel, Forkert, Gafford, Giles, Halbert, Humble, Jackson BW, Jackson LA, Jenkins, Kanza, Kiowa, Lakota, Mahan, Major, Mohawk, Moreland, Nacono, Oconee, Odom, Pawnee, Podsednik, Riverside, Russell, Schley, Shawnee, Starking Hardy Giant (SHG), Shoshoni, Sioux, Stuart, Success, Sumner, Syrup Mill, VC168, Western. Schley (W. Schley), Wichita, Witte, or Zinner.
In particular embodiments the first insertion/deletion or single nucleotide polymorphism is detected by amplification of a first nucleic acid segment comprising the first single nucleotide polymorphism from the sample of nucleic acids, and analysis of said first nucleic acid segment by high-resolution melting or sequencing to detect the first insertion/deletion or single nucleotide polymorphism. In certain embodiments, the first nucleic acid segment is sequenced directly to detect the first insertion/deletion or single nucleotide polymorphism. In some embodiments the first insertion/deletion or single nucleotide polymorphism is within a locus in the genome of the pecan tree having a nucleotide sequence of any one of SEQ ID NOs:33-49. In other embodiments the first insertion/deletion or single nucleotide polymorphism is within a locus in the genome of the pecan tree flanked by a pair of oligonucleotide primers of sufficient length to hybridize specifically to any one of SEQ ID NOs:33-49. In particular embodiments the pair of oligonucleotide primers is selected from the group consisting of SEQ ID NO:1 and SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4, SEQ ID NO:5 and SEQ ID NO:6, SEQ ID NO:7 and SEQ ID NO:8, SEQ ID NO:9 and SEQ ID NO:10, SEQ ID NO:11 and SEQ ID NO:12, SEQ ID NO:13 and SEQ ID NO:14, SEQ ID NO:15 and SEQ ID NO:16, SEQ ID NO:17 and SEQ ID NO:18, SEQ ID NO:19 and SEQ ID NO:20, SEQ ID NO:21 and SEQ ID NO:22, SEQ ID NO:23 and SEQ ID NO:24, SEQ ID NO:25 and SEQ ID NO:26, SEQ ID NO:27 and SEQ ID NO:28, SEQ ID NO:29 and SEQ ID NO:30, SEQ ID NO:31 and SEQ ID NO:32.
In additional embodiments, the method further comprises detection of a second single nucleotide polymorphism by amplification of a second nucleic acid segment comprising the second single nucleotide polymorphism from the sample of nucleic acids, and analysis of the second nucleic acid segment by high-resolution melting or sequencing to detect the second single nucleotide polymorphism. In certain embodiments, the second nucleic acid segment is sequenced directly to detect the second single nucleotide polymorphism. In other embodiments, the method further comprises detection of a third single nucleotide polymorphism by amplification of a third nucleic acid segment comprising the third single nucleotide polymorphism from the sample of nucleic acids, and analysis of the third nucleic acid segment by high-resolution melting or sequencing to detect the third single nucleotide polymorphism. In certain embodiments, the third nucleic acid segment is sequenced directly to detect the third single nucleotide polymorphism. In yet other embodiments, the method further comprises detection of a fourth single nucleotide polymorphism by amplification of a fourth nucleic acid segment comprising the fourth single nucleotide polymorphism from the sample of nucleic acids, and analysis of the fourth nucleic acid segment by high-resolution melting or sequencing to detect the fourth single nucleotide polymorphism. In certain embodiments, the fourth nucleic acid segment is sequenced directly to detect the fourth single nucleotide polymorphism. In still other embodiments, the method further comprises detection of a fifth single nucleotide polymorphism by amplification of a fifth nucleic acid segment comprising the fifth single nucleotide polymorphism from the sample of nucleic acids, and analysis of the fifth nucleic acid segment by high-resolution melting or sequencing to detect the fifth single nucleotide polymorphism. In certain embodiments, the fifth nucleic acid segment is sequenced directly to detect the fifth single nucleotide polymorphism.
The present invention also provides kits for determining a cultivar of a pecan tree. In some embodiments the kit comprises at least a first pair of oligonucleotide primers of sufficient length to hybridize specifically to any one of SEQ ID NOs:33-49. In particular embodiments the at least a first pair of oligonucleotide primers is selected from the group consisting of SEQ ID NO:1 and SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4, SEQ ID NO:5 and SEQ ID NO:6, SEQ ID NO:7 and SEQ ID NO:8, SEQ ID NO:9 and SEQ ID NO:10, SEQ ID NO:11 and SEQ ID NO:12, SEQ ID NO:13 and SEQ ID NO:14, SEQ ID NO:15 and SEQ ID NO:16, SEQ ID NO:17 and SEQ ID NO:18, SEQ ID NO:19 and SEQ ID NO:20, SEQ ID NO:21 and SEQ ID NO:22, SEQ ID NO:23 and SEQ ID NO:24, SEQ ID NO:25 and SEQ ID NO:26, SEQ ID NO:27 and SEQ ID NO:28, SEQ ID NO:29 and SEQ ID NO:30, SEQ ID NO:31 and SEQ ID NO:32. In certain embodiments the kit comprises all of the following oligonucleotide primer sets: SEQ ID NO:1 and SEQ ID NO:2; SEQ ID NO:3 and SEQ ID NO:4; SEQ ID NO:5 and SEQ ID NO:6; SEQ ID NO:7 and SEQ ID NO:8; SEQ ID NO:9 and SEQ ID NO:10; SEQ ID NO:11 and SEQ ID NO:12; SEQ ID NO:13 and SEQ ID NO:14; SEQ ID NO:15 and SEQ ID NO:16; SEQ ID NO:17 and SEQ ID NO:18; SEQ ID NO:19 and SEQ ID NO:20; SEQ ID NO:21 and SEQ ID NO:22; SEQ ID NO:23 and SEQ ID NO:24; SEQ ID NO:25 and SEQ ID NO:26; SEQ ID NO:27 and SEQ ID NO:28; SEQ ID NO:29 and SEQ ID NO:30; SEQ ID NO:31 and SEQ ID NO:32, wherein the cultivar is Alley, Amling, Apache, Barton, Brooks, Burkett, Byrd, Caddo, Candy, Cape Fear, Carmichael, Carter, Cheyenne, Choctaw, Clark, Creek, Curtis, Dependable 6-2, Dependable 8-3, Desirable 5-2, Eclipse, Elliott, Evers, Excel, Forkert, Gafford, Giles, Halbert, Humble, Jackson BW, Jackson LA, Jenkins, Kanza, Kiowa, Lakota, Mahan, Major, Mohawk, Moreland, Nacono, Oconee, Odom, Pawnee, Podsednik, Riverside, Russell, Schley, Shawnee, Starking Hardy Giant (SHG), Shoshoni, Sioux, Stuart, Success, Sumner, Syrup Mill, VC168, Western Schley (W. Schley), Wichita, Witte, or Zinner. In some embodiments the kit further comprises instructions for determining the cultivar of the pecan tree. In certain aspects, the kits can be used to determine that an unknown pecan tree is not any of the above-listed cultivars.
The following drawings form part of the present specification and are included to further demonstrate certain aspects of the invention. The invention may be better understood by reference to one or more of these drawings in combination with the detailed description of specific embodiments presented herein.
SEQ ID NOs:1-32—DNA sequences of forward and reverse primers for identifying cultivars of pecan.
SEQ ID NOs:33-49—DNA sequences of regions surrounding insertion/deletion or single nucleotide polymorphisms identified in pecan cultivars.
The present invention provides compositions and methods of determining a cultivar of a pecan tree, comprising detecting one or more single nucleotide polymorphism(s) or insertion/deletions from a sample of nucleic acids from the pecan tree, wherein the presence of the one or more single nucleotide polymorphism(s) identifies the cultivar of the pecan tree.
Pecan (Carya illinoinensis (Wangenh.) K. Koch), perhaps one of the most important horticultural crops, is a member of the Juglandaceae family. The family Juglandaceae contains both walnuts (Juglans spp.) and hickories (Carya spp.). The genus Carya has 20 species including pecan, 13 of which are native to the United States (Grauke and Thompson, 1996). Pecans are indigenous to a large area in North America stretching from southern Illinois down the Mississippi river valley and south into the northern area of Mexico, and west to the branches of the Llano and San Saba rivers. Pecan is a long-lived heterodichogamous tree species with male and female flowers on the same tree maturing at separate times (Thompson, 1985) that promotes cross-pollination with wind-dispersed pollen (Wood, et al., 1997). Therefore, pecan exhibits high heterozygosity and severe inbreeding depression. Commercial pecan orchards were established early on by selecting open-pollinated nuts from a mature tree that was selected based on specific desirable traits. However, each resulting nut from the same tree was genetically different because it was open pollinated with pollen from any of the male flowers in the orchard. Vegetative propagation methods were later developed, and the first record of tree nurseries selling grafted pecan trees was documented in 1879 (Crane, et al., 1937). Modified techniques for grafting were later developed (McHatton, 1957; Wood, 1990) and used in the commercial pecan industry to generate and sell named improved cultivars that were clonally propagated.
The pecan industry propagates pecan cultivars clonally through the use of graftwood. However, the inventors have discovered by genetic evaluation of multiple individual trees from the same cultivar the presence of genetic differences between them, suggesting that these trees are not genetically identical to each other. Therefore, these trees could also differ in their agronomic performance, disease resistance, nut yields and other nut characteristics including nutrient composition.
With the present invention, for the first time, it is possible to define pecan trees at the genetic level, and correctly identify the cultivar of any particular pecan tree. Particular polymorphisms or groups of polymorphisms at the DNA level were found to be determinative for assigning a cultivar designation. In certain embodiments, the present invention provides information to growers on the cultivar or cultivars currently grown in an existing orchard, enables growers to make educated decisions about how to manage their orchard (for example, to plan for fungicide applications if the trees are susceptible to pecan scab), plan for the future, and clearly articulate the product of their trees, given that the nuts from different cultivars may have different nutrient profiles. In additional embodiments, the present invention allows growers to identify pecan cultivars growing in an orchard that they are considering purchasing. In further embodiments, the present invention allows for the generation of a DNA profile of key pecan cultivars that have the most desirable characteristics (including, but not limited to, nut quality and pecan scab resistance) so that a grower can ensure the genetics of the trees under consideration for purchase, or to assist in breeding efforts. In other embodiments, the present invention allows interested nurseries to sell DNA-certified′ cultivars to validate the genetic identity of the trees being sold to growers, which translates to conservation of genetic identity, premium pricing for highly sought after cultivars, higher consumer confidence and a premium per tree for certification of the genetics.
DNA primers are isolated nucleic acids that are annealed to a complementary target DNA strand by nucleic acid hybridization to form a hybrid between the primer and the target DNA strand, then extended along the target DNA strand by a polymerase, e.g., a DNA polymerase. A DNA primer pair or a DNA primer set of the present disclosure refer to two DNA primers useful for amplification of a target nucleic acid sequence, e.g., by the polymerase chain reaction (PCR) or other conventional nucleic acid amplification methods.
DNA primers are generally 15 nucleotides or more in length, often 18 nucleotides or more, 24 nucleotides or more, or 30 nucleotides or more. In certain embodiments the DNA primers are 16 nucleotides in length, 17 nucleotides in length, 19 nucleotides in length, 20 nucleotides in length, 21 nucleotides in length, 22 nucleotides in length, 23 nucleotides in length, 25 nucleotides in length, 26 nucleotides in length, 27 nucleotides in length, 28 nucleotides in length, or 29 nucleotides in length. Such primers are selected to be of sufficient length to hybridize specifically to a target sequence under high stringency hybridization conditions. Preferably, primers according to the present disclosure have complete sequence similarity with the target sequence, although probes differing from the target sequence that retain the ability to hybridize to target sequences may be designed by conventional methods.
The nucleic acid primers of the present disclosure hybridize under stringent conditions to a target DNA molecule. Any conventional nucleic acid hybridization or amplification method can be used to identify the presence of an insertion/deletion or single nucleotide polymorphism in a sample. Nucleic acid molecules, also referred to as nucleic acid segments, or fragments thereof are capable of specifically hybridizing to other nucleic acid molecules under certain circumstances. As used herein, two nucleic acid molecules are said to be capable of specifically hybridizing to one another if the two molecules are capable of forming an anti-parallel, double-stranded nucleic acid structure. A nucleic acid molecule is said to be the “complement” of another nucleic acid molecule if they exhibit complete complementarity. As used herein, molecules are said to exhibit “complete complementarity” when every nucleotide of one of the molecules is complementary to a nucleotide of the other. Two molecules are said to be “minimally complementary” if they can hybridize to one another with sufficient stability to permit them to remain annealed to one another under at least conventional “low-stringency” conditions. Similarly, the molecules are said to be “complementary” if they can hybridize to one another with sufficient stability to permit them to remain annealed to one another under conventional “high-stringency” conditions. Conventional stringency conditions are described by Sambrook, et al., Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory, Cold Spring Harbor (1989), and by Haymes, et al., In: Nucleic Acid Hybridization, A Practical Approach, IRL Press, Washington, D.C. (1985). Departures from complete complementarity are therefore permissible, as long as such departures do not completely preclude the capacity of the molecules to form a double-stranded structure. In order for a nucleic acid molecule to serve as a primer it need only be sufficiently complementary in sequence to be able to form a stable double-stranded structure under the particular solvent and salt concentrations employed.
Appropriate stringency conditions that promote DNA hybridization, for example, include 6.0× sodium chloride/sodium citrate (SSC) at about 45° C., followed by a wash of 2.0×SSC at 50° C., are known to those skilled in the art or can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. For example, the salt concentration in the wash step can be selected from a low stringency of about 2.0×SSC at 50° C. to a high stringency of about 0.2×SSC at 50° C. In addition, the temperature in the wash step can be increased from low stringency conditions at room temperature, about 22° C., to high stringency conditions at about 65° C. Both temperature and salt may be varied, or either the temperature or the salt concentration may be held constant while the other variable is changed. In one embodiment, a DNA primer of the present disclosure will specifically hybridize to one or more of the nucleic acid molecules set forth in SEQ ID NOs: 33-49, or complements thereof or fragments of either under moderately stringent conditions, for example at about 2.0×SSC and about 65° C. In another embodiment, a DNA primer of the present disclosure will specifically hybridize to one or more of the nucleic acid molecules set forth in SEQ ID NOs:33-49 or complements or fragments of either under high stringency conditions. In certain aspects of the present disclosure, a DNA primer of the present disclosure has the nucleic acid sequence set forth in SEQ ID NOs:1-32 or complements thereof or fragments of either.
Regarding the amplification of a target nucleic acid sequence (e.g., by PCR) using a particular amplification primer pair, “stringent conditions” are conditions that permit the primer pair to hybridize only to the target nucleic acid sequence, for example any one of SEQ ID NOs:33-49, to which a primer having the corresponding wild-type sequence (or its complement) would bind and preferably to produce a unique amplification product, the amplicon, in a DNA thermal amplification reaction. The term “specific for (a target sequence)” indicates that a primer hybridizes under stringent hybridization conditions only to the target sequence in a sample comprising the target sequence.
As used herein, “amplified DNA” or “amplicon” refers to the product of nucleic acid amplification method directed to a target nucleic acid molecule that is part of a nucleic acid template. The amplicon may range in length from the combined length of the primer pair plus one nucleotide base pair, preferably plus about fifty nucleotide base pairs, more preferably plus about two hundred-fifty nucleotide base pairs, and even more preferably plus about four hundred-fifty nucleotide base pairs or more. Alternatively, a primer pair can be derived from genomic sequence on both sides of the insertion/deletion or single nucleotide polymorphism so as to produce an amplicon that includes the entire insertion/deletion or single nucleotide polymorphism. A member of a primer pair derived from the pecan genomic sequence adjacent to the insertion/deletion or single nucleotide polymorphism is located a distance from the insertion/deletion or single nucleotide polymorphism, this distance can range from one nucleotide base pair up to about one thousand nucleotide base pairs or more. The use of the term “amplicon” specifically excludes primer dimers that may be formed in the DNA thermal amplification reaction.
Nucleic acid amplification can be accomplished by any of the various nucleic acid amplification methods known in the art, including the polymerase chain reaction (PCR). Amplification methods are known in the art and are described, inter alia, in U.S. Pat. Nos. 4,683,195 and 4,683,202 and in PCR Protocols: A Guide to Methods and Applications, ed. Innis, et al., Academic Press, San Diego, 1990. These methods as well as other methods known in the art of DNA amplification may be used in the practice of the present disclosure.
DNA kits that are based on DNA amplification methods contain DNA primer molecules that hybridize specifically to a target DNA and amplify a diagnostic amplicon comprising one or more insertion/deletion or single nucleotide polymorphism under the appropriate reaction conditions. A kit that contains DNA primers that are homologous or complementary to any portion of the target nucleic acid sequences as set forth in any of SEQ ID NOs:33-49 or to any portion thereof. DNA molecules useful as DNA primers can be selected from the disclosed target nucleic acid sequences as set forth in any of SEQ ID NOs:33-49 by those skilled in the art of DNA amplification.
The following example is included to demonstrate preferred embodiments of the invention. It should be appreciated by those of skill in the art that the techniques disclosed in the example that follows represent techniques discovered by the inventors to function well in the practice of the invention, and thus can be considered to constitute preferred modes for its practice. However, those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the concept, spirit and scope of the invention. More specifically, it will be apparent that certain agents which are both chemically and physiologically related may be substituted for the agents described herein while the same or similar results would be achieved. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the invention as defined by the appended claims.
This example describes the generation of pecan transcriptome, the mining of those sequences for genome-wide SNP discovery in pecan, and utilization of a subset of validated SNPs to genotype pecan cultivars from the Carya species germplasm collection for characterization of named cultivars at the DNA level.
The pecan cultivars Pawnee and Kanza differ for various agronomic characteristics in the field including nut size and disease response to pecan scab. The transcriptome sequences covered a range of functional categories based on the black walnut (Juglans nigra L.) annotation in the absence of a sequenced and annotated pecan genome. A total of 43,417 candidate SNPs were identified between the two pecan cultivars. A subset of candidate SNPs were evaluated and validated using high resolution melting (HRM) analysis in the original pecan cultivars used for SNP discovery. Sixteen of the validated SNPs were used to survey a set of sixty two pecan cultivars and those generated specific marker profiles capable of distinguishing between sixty of the pecan cultivars evaluated, at the genetic level.
Methods
Plant Materials
One year old trees from the pecan cultivars ‘Pawnee’ and ‘Kanza’ (source: WoMack Nursery, DeLeon, Tex.) were grafted onto three year old ‘Apache’ rootstock and planted at the Samuel Roberts Noble Foundation's McMillan research farm in Marshall County, Okla. All trees were bare root trees that were root pruned before planting. Vegetation was controlled with herbicides to maintain a 1.8-meter wide vegetation-free strip centered on the tree rows. Trees were planted on a 10.6 meter spacing. All trees were irrigated with a solid set sprinkler system. Trees were planted on a Bastrop fine sandy loam soil (fine-loamy, mixed, active, thermic, udic Paleustalfs) soil. Plant tissues were sampled for three Pawnee trees (tree coding=NF07PEC0001-0038, NF07PEC0001-0039, NF07PEC0001-0040 with the corresponding sample coding=14P38, 14P39 and 14P40), and three Kanza trees (NF07PEC0001-0018, NF07PEC0001-0019, NF07PEC0001-0020 with the sample coding 14K18, 14K19 and 14K20, respectively). Terminal tips and approximately 30 leaflets ranging from young to more mature tissue were harvested from multiple tree positions. Samples were harvested in the field, placed on dry ice for transport back to the lab, and stored in a −80° C. freezer until processing.
DNA Extraction Protocol of the Pecan Cultivars Corresponding to the ORTET Samples
Carya germplasm leaf samples were collected from sixty two ORTET trees representing pecan cultivars at the USDA-ARS Texas A & M facility in College Station, Tex. Samples were collected on ice, lyophilized for four days and DNA extracted using the Plant DNAeasy Mini Kit (Qiagen, Valencia, Calif.) using the following standard procedure (based on the manufacturer's instructions): 1) Preheat a water bath or heating block to 65° C.; 2) Use TissueLyser for tissue disruption as follows: place 15 mg of the lyophilized sample tissue into a 2 ml safe-lock microcentrifuge tube, together with a 3 mm tungsten carbide bead; 3) Place the tubes into the TissueLyser Adapter Set 2×24, and fix into the clamps of the TissueLyser. Immediately grind the samples for 1 min at 30 Hz, and do this three times; 4) Add 400 μl Buffer AP1 and 4 μl RNase A stock solution (100 mg/ml) to a maximum of 15 mg (dried) disrupted plant tissue and vortex vigorously. Vortex at maximum speed for 1 min or more to remove any clumps; 5) Incubate the mixture for 10 min at 65° C. to lyse the cells. Mix two or three times during incubation by inverting the tube; 6) Add 130 μl Buffer AP2 to the lysate, mix, and incubate for 5 min on ice to precipitate proteins and polysaccharides; 7) Centrifuge the lysate for 7 min at 20,000×g (14,000 rpm); 8) Pipet the lysate into the QIAshredder Mini spin column placed in a 2 ml collection tube and centrifuge for 2 min at 20,000×g (14,000 rpm); 9) Transfer the flow-through fraction into a new tube without disturbing the cell-debris pellet. Typically 300 μl of lysate is recovered; 10) Add 1.5 volumes of Buffer AP3/E to the cleared lysate, and mix immediately using a pipette; 11) Transfer 650 μl of the previous mixture, including any precipitate that may have formed, into the DNeasy Mini spin column placed in a 2 ml collection tube. Centrifuge for 1 min at 13,000×g, and discard the flow-through; 12) Transfer the remaining mixture and centrifuge for 2 min at 13,000×g, and discard the flow-through. Discard flow-through and collection tube; 13) Place the DNeasy Mini spin column into a new 2 ml collection tube, add 500 μl Buffer AW, and centrifuge for 1 min at 13,000×g. Discard the flow-through; 14) Add 500 μl Buffer AW to the DNeasy Mini spin column, and centrifuge for 2 min at 20,000×g to dry the membrane. Discard flow-through; 15) Centrifuge 1 min at 13,000×g and discard any remaining flow-through; 16) Transfer the DNeasy Mini spin column to a 1.5 ml tube, and pipet 50 μl Buffer AE directly onto the DNeasy membrane. Incubate for 15 min at room temperature (15-25° C.), and then centrifuge for 1 min at 13,000×g to elute. DNA was quantified using a NanoDrop Spectrophotometer ND-100 (NanoDrop Technologies, Willington, Del.) and DNA dilutions were made to a working concentration of 5 ng/μl.
cDNA Preparation and Sequencing
Total RNA was extracted from 50-75 mg of tissue using Sigma Spectrum Plant Total RNA kit with a 40° C. incubation, using 750 μl of binding buffer and Protocol A and quantified using a NanoDrop Spectrophotometer ND-100 (NanoDrop Technologies, Willington, Del.). Total RNA integrity was assessed using a Bioanalyzer 2100 (Agilent, Santa Clara, Calif.). Messenger RNA (mRNA) isolation was performed using the oligo (dT) magnetic beads. The mRNA was fragmented into short fragments (200-500 bp) with a fragment buffer treatment. The first strand cDNA was synthesized by random hexamer-primer with the mRNA fragment as templates. Buffers, dNTPs, RNase H and DNA polymerase I were used to synthesize the second strand. The double strand cDNAs were purified with the Qiaquick PCR extraction kit and used for end-repair and base ‘A’ addition. Finally, sequencing adaptors were ligated to the fragments, and these were purified with agarose gel electrophoresis and amplified using PCR to generate libraries for sequencing using Illumina HiSeg™ 2000.
Pecan Sequence Analysis
The raw reads were quality trimmed with a custom Perl script, removing bases from the ends of each read until two consecutive bases with quality scores >=30 were reached. Read pairs with one or both reads less than 30 bp long after trimming were discarded. The trimmed reads from all six samples were pooled and de novo assembled using Trinity r2013-02-25 (trinityrnaseq.github.io) with 24 threads and 100 GB of RAM allocated to the JVM. The assembled transcripts were aligned with the black walnut transcriptome (hardwoodgenomics.org/content/black-walnut-unigene-version-120313) with BLASTX, using an E-value cutoff of 0.00001 and a minimum query alignment of 75%. Annotation was assigned to each transcript based on its walnut homolog's SWISS-PROT, PFAM, and Gene Ontology classification data. Each sample was then mapped to the de novo assembly using Tophat2 v2.0.12 (ccb.jhu.edu/software/tophat/index.shtml) with a mean read pair inner distance of 100 bp, inner distance standard deviation of 50, and a maximum intron length of 25,000. Transcript assembly and quantification was done by Cufflinks v2.2.1 with 24 threads and the default values of all other parameters. The six transcript sets identified by Cufflinks were merged using Cuffmerge v2.2.1. Differential expression testing was performed with Cuffdiff v2.2.1, treating the three Kanza samples and the three Pawnee samples as replicates.
Pecan SNP Discovery and Annotation
SNPs and indels (insertions/deletions) were called using the Samtools v0.1.18 (samtools.sourceforge.net) “mpileup” command and bcftools (part of the Samtools package). The bcftools output was reformatted and filtered with a custom Perl script, keeping only those SNPs and INDELs with total coverage >=5, reference and alternative coverages both >=2, and alternative allele frequencies of at least 10. A confidence score was generated based on the percent of nucleotides matching the surrounding regions of the location of the SNP. Low confidence regions with less than 90% identity where a SNP was called were not included in the final SNP predictions. SNPs were quality controlled by remapping the transcript reads back onto the assembly and checked against the ACE alignment files. Small indels were also identified and the process was biased towards getting good quality SNPs vs. getting all the SNPs at the expense of false SNP calls. The walnut annotation includes homologs in the SWISS-PROT database, PFAM classification, and Gene Ontology classification.
SNP Genotyping and HRM Analysis
Primer3 was used to design primers targeting candidate SNPs from the generated pecan sequences. Criteria for primer design include a predicted annealing temperature (Tm) of 59° C. to 61° C. with an optimum temperature of 60° C., primer length ranging between 18-24 bp, and PCR amplicon lengths of 40 to 200 bp. All PCR reactions for HRM were performed in 384-well plates using a 9700 Thermal Cycler (Applied Biosystems, Foster City, Calif., USA) using a total volume of 15 μL per well. HRM enables the identification of sequence variants in a target region without sequencing. HRM was successfully used for SNP genotyping and mapping in plants (Han, et al., 2012). The PCR reaction mixture consisted of 5 ng of genomic DNA, 0.25 μM of forward and reverse primer, 1× LightScanner High Sensitivity Master Mix (Idaho Technologies, Salt Lake, Utah, USA) and 10 μL mineral oil. After an initial denaturation step of 2 min at 95° C., 45 PCR cycles were performed with 30 s of denaturation at 94° C. and 30 s at the target annealing temperature of 60° C., followed by a final hold at 4° C. Samples were then transferred to a LightScanner 384-well system (Idaho Technologies, Salt Lake, Utah) and a melting cycle was performed by increasing the temperature at 0.1° C. s−1 from 56 to 95° C. Melting data was analyzed and visualized with the LightScanner Software with CALL-IT 2.0 (Idaho Technologies, Salt Lake, Utah) using the small amplicon module.
Amplicon Sequencing by Sanger Method
PCR were performed at 60° C. in final volume of 20 μl using forward and reverse primers (5 μM each, 1 μl each), 2 μl 10×PCR buffer, 2 μl MgCl2 (25 mM), 2 μl dNTP mix (2 mM), 9.80 μl of nuclease free water, 0.2 μl Promega GoTaq DNA polymerase (5U/μl) and 2 μl of DNA at 10 ng/μl. PCR products were cleaned with the ExoSAP-IT kit (Affymetrix, Cleveland, Ohio) and sequenced in a final volume of 10 μl using both forward and reverse primers (1.6 μM, 2 μl) designed for HRM and a BigDye (0.7 μl) terminator v3.1 Cycle Sequencing Kit (Thermal Fisher Inc., Foster City, Calif.), A.B.I. buffer (3.65 μl), DMSO (0.5 μl) and water until a final reaction volume of 10 μl. An ABI3730 automated sequencer (Thermal Fisher Inc., Foster City, Calif.) was used to perform the PCR reactions. Sequences were mapped and aligned using Geneious v.7.0.5 (geneious.com; Kearse, et al., 2012).
Results
SNP Discovery and Validation
The total number of reads for the two pecan genotypes ranged from 4.63 Gb to 9.81 Gb (Table 1; Statistics of the clean sequence reads from the two pecan cultivars (Pawnee=14P38, 14P39, 14P30) and Kanza (14L18, 14K19 and 14K20)) and covered a range of functional categories based on the gene ontology assignments (
‡% Q20 Before Filter
Candidate SNPs were called based on sequence alignments between the Pawnee and Kanza genotypes. A total of 43,417 candidate SNPs were identified between the two genotypes (
Using a set of sixteen SNP primer pairs (Table 2), sixty pecan cultivars were distinguished (from the 62 pecan ORTETs evaluated) based on the HRM profiles of the pecan ORTET samples (
The specific primers suitable to distinguish each of the pecan cultivars are described below. Although detection by “color” is detailed below, Tables 4, 5 and 6 include the actual insertion/deletion or single nucleotide polymorphism(s) corresponding to each color. It will be understood that detection by “color” or actual sequence of the insertion/deletion or single nucleotide polymorphism(s) can be used to distinguish the pecan cultivars as follows:
1. Russell is green group with primer CI0053.
2. Curtis is orange group with primer CI0048.
3. Mahan is aqua group using CI0051.
4. Jenkins is grey blue using CI0051.
5. Major is blue using CI0089.
6. Elliott, Gafford and Halbert are in the blue group with primer CI0081. Primer CI0048 distinguishes Elliott (green), Gafford (blue) and Halbert (red).
7. Wichita, Burkett and Cape Fear are in the orange group with primer CI0081. Primer CI0084 distinguishes Wichita (blue), Burkett (red) and Cape Fear (grey).
8. Byrd, Evers, Halbert, Russell and Wichita are in the green group with primer CI0062. Primer CI0001 identifies Evers (grey), Byrd (orange) and Halbert (red) from Russell and Wichita (orange).
9. Alley, Candy, Riverside, Stuart and Syrup Mill are in the green group with primer CI0051. Primer CI0025 separates Riverside (red), Alley and Stuart (orange), and Candy and Syrup Mill (grey). Primer CI0001 separates Alley (red) and Stuart (blue), Candy (red) and Syrup Mill (orange).
10. Carmichael is in the aqua group with primer CI0081. Primer CI0001 separates Carmichael (red), Byrd (orange) and Evers (grey).
11. Apache, Jackson BW, Caddo, Cape Fear, Forkert, Jenkins, Moreland, Schley, Sioux, Stuart, Sumner and Zinner with Primer CI0053 are in the orange group. Primer CI0012 distinguishes between Apache and Jackson BW (blue) from the rest of the cultivars (red and green). Primer CI0008B distinguishes between Apache (red) and Jackson BW (grey blue). Primer CI0015 also distinguishes Apache (red) and Jackson BW (grey). Primer CI0018 is another option to distinguish Apache (blue) and Jackson BW (grey).
12. Kanza, Curtis and Elliott are in the grey group using CI0043. Primer CI0018 distinguishes between Curtis (blue), Elliot (red) and Kanza (grey). Also, primer CI0025 distinguishes between Curtis (orange), Elliott (red) and Kanza (grey). Primer CI0051 is another option to distinguish between Curtis (blue), Elliott (red) and Kanza (grey). Primer CI0084 also distinguishes between Curtis (red), Elliott (green) and Kanza (grey).
13. Candy, Kanza, Syrup Mill and Witte are in the grey group using CI0025. Primer CI0001 can distinguish between Candy (red), Kanza (grey), Syrup Mill (orange) and Witte (blue).
14. Elliott and Witte are in the green group using CI0048. Primer CI0084 can distinguish between Witte (red) and Elliott (green), and Evers (blue) and Dependable 6-2 (grey).
15. Starking Hardy Giant (SHG), Caddo and Clark are the only ones in the green group with primer CI0012. Primer CI0029 distinguishes between Caddo (grey), Clark (red) and Starking Hardy Giant (blue).
16. W. Schley, Brooks, Burkett and Podsednik are in the orange group with primer CI0015. Primer CI0012 distinguishes between W. Schley and Brooks (red), and Burkett and Pod sednik (blue). Primer CI0089 separates W. Schley (grey) from Brooks (red).
17. Stuart, Jackson LA and Witte are in the blue group with primer CI0001. Primer CI0008B distinguishes between Jackson LA (green), Stuart (grey blue), and Witte (blue). Primer CI0048 also separates Jackson LA (blue), Stuart (red), and Witte (green).
18. Brooks and Caddo are the only two genotypes in the orange group using CI0062. CI0001 distinguishes between Caddo (grey) and Brooks (orange). Primer CI0084 also distinguishes between Caddo (blue) and Brooks (red). CI0012 is another alternative to distinguish Caddo (green) and Brooks (red). Finally, CI0053 also distinguishes between Caddo (orange) and Brooks (grey).
19. Giles, Shoshoni, Creek, Cape Fear and Syrup Mill are in the green group using CI0043. Primer CI0081 distinguishes between Cape Fear (orange), Giles, Shoshoni, Creek and Syrup Mill (grey). Primer CI0001 distinguishes between Syrup Mill (orange), Shoshoni and Evers (grey), and Giles and Creek (red). Primer CI008B differentiates Evers and Giles (grey blue), Creek (blue) and Shoshoni (orange).
20. Russell, Shoshoni and Syrup Mill are in the orange group using CI0008B. Primer CI0025 distinguishes between Russell (red), Shoshoni (orange) and Syrup Mill (grey). Also, primer CI0058 distinguishes Russell (grey), Shoshoni (red) and Syrup Mill (grey blue). In addition, primer CI0089 separates Russell (green), Shoshoni (red) and Syrup Mill (grey).
21. Amling, Oconee, Alley, Candy, Carter, Kanza and Major are in the grey group with primer CI0012. Primer CI0001 separates Amling and Oconee (orange group), Carter, Kanza and Major (grey), and Alley and Candy (red group). Primer CI0008B distinguishes between Amling and Kanza (grey blue), Carter (green), Major (red) and Oconee (blue).
22. Kiowa, Moreland and Zinner are the only ones in the orange group using primer CI0051. Primer CI0048 differentiates Kiowa (grey), Moreland (blue) and Zinner (red). Also, primer CI0084 differentiates Kiowa (red), Moreland (grey) and Zinner (blue).
23. Clark, Eclipse, Lakota and Pawnee are in the red group using primer CI0029. Primer CI0084 distinguishes between Pawnee (red), and Clark, Eclipse and Lakota (grey). Primer CI0001 distinguishes between Clark (grey), Eclipse (red) and Lakota (orange). Also, primer CI0051 distinguishes between Clark (grey), Eclipse (blue), and Lakota (red).
24. Excel, Podsednik, Curtis, Giles and Apache are in the blue group with primer CI0018. Primer CI0001 distinguishes Excel, Curtis and Podsednik (orange), from Apache (grey) and Giles (red). Primer CI0048 distinguishes between Excel (grey), Curtis (orange) and Podsednik (red).
25. Nacono, Shawnee, VC168, Barton, Caddo, Oconee, Odom, Wichita, Zinner, Evers and Excel are in the blue group of primer CI00084. Primer CI0001 separates Caddo, Evers, Nacono, Odom, Shawnee, VC168 and Zinner in the grey group while Barton, Excel, Oconee, and Wichita are in the orange group. Primer CI0008B separates Odom (green), Shawnee and Wichita (red), Caddo, Evers, Nacono and Zinner (grey blue), and Barton, Excel, Oconee and VC168 (blue). Primer CI0029, separates Nacono (blue) from Caddo, Evers, Zinner (grey). Primer CI001 distinguishes VC168 (grey) from Barton, Excel and Oconee (orange). Primer CI0015 distinguishes between Barton (grey), Excel (grey blue) and Oconee (red).
26. Schley, Sumner, Forkert, Apache, Caddo, Cape Fear, Jackson BW, Jenkins, Moreland, Sioux, Stuart and Zinner are in the orange group using primer CI0053. Primer CI0001 distinguishes between Stuart (blue), Cape Fear, Jenkins and Sioux (orange), and Apache, Caddo, Forkert, Jackson BW, Moreland, Schley, Sumner and Zinner (grey). Primer CI0051 distinguishes between Cape Fear (blue), Jenkins (grey blue) and Sioux (red). Primer CI0025 separates Caddo, Jackson BW and Sumner (orange), and Apache, Forkert, Moreland, Schley and Zinner (red). Primer CI0012 distinguishes Caddo (green), Jackson BW (blue) and Sumner (red). Primer CI0008B separates Forkert (blue) and Zinner (grey blue), from Apache, Moreland and Schley (red). Primer CI0018 distinguishes between Apache (blue), Moreland (red) and Schley (grey).
27. Cheyenne, Humble, Alley, Barton, Carter, Desirable 18-18, Desirable 5-2, Gafford, Kiowa, Oconee, Odom, Philema, Shoshoni, Syrup Mill and W. Schley are in the blue group with primer CI0053. Primer CI0025 separates Cheyenne, Barton, Carter, Desirable 18-18, Desirable 5-2, Kiowa, Oconee, Odom, Philema (red) from Humble, Alley, Gafford, Shoshoni, W. Schley (orange) and Syrup Mill (grey). Primer CI0018 distinguishes Humble (red) from Alley, Gafford, Shoshoni and W. Schley (grey). Primer CI0051 separates Cheyenne, Barton and Oconee (blue) and Carter, Desirable 18-18, Desirable 5-2, Odom and Philema (grey), and Kiowa (orange). Primer CI001 distinguishes between Cheyenne (grey) and Barton and Oconee (blue). Primer CI0012 separates Barton (red) and Oconee (grey).
28. Choctaw, Byrd, Elliott, Forkert, Giles, Lakota, Mohawk, Pawnee, Schley, Shawnee, Sioux, Sumner and Wichita are in the red group with primer CI0051. Primer CI0048 separates Choctaw, Sioux and Sumner (grey) from Byrd, Forkert, Giles, Lakota, Pawnee, Schley, Shawnee and Wichita (red), Elliott (green) and Mohawk (blue). Primer CI0058 distinguishes between Choctaw (red), Sioux (grey blue) and Sumner (grey).
29. Success, Russell, Desirable 5-2, Desirable 18-18, Philema, Apache, Burkett, Odom, Shawnee and VC168 are in the green group with primer CI0089. Primer CI001 separates accessions into two groups: Group 1=Success, Desirable 5-2, Desirable 18-18, Philema, Russell (orange), and Group 2=Apache, Burkett, Odom, Shawnee and VC168 (grey). Within Group 1, Primer CI0084 distinguishes Success (grey), Russell (green), Desirable 5-2, Desirable 18-18 and Philema (red). Primer CI0012 distinguishes Desirable 5-2 (red) from Desirable 18-18 and Philema (blue). Within Group 2, corresponding to entries in the grey group from Primer CI001, Primer CI0084 separates Apache and Burkett (red) and Odom, Shawnee and VC168 (blue). Primer CI008B distinguishes Apache (red) and Burkett (grey blue), and also distinguishes Odom (green), Shawnee (red) and VC168 (blue).
30. Dependable 6-2 (grey blue) and Dependable 8-3 (grey) are distinguished using primer CI0058. Dependable 6-2 originates from an experimental station in Louisiana while Dependable 8-3 originates from Austin, Tex. Both of these ORTET trees were sampled from the Carya germplasm collection located in College Station, Tex., which are maintained as separate ‘Dependable’ entries due to observed phenotypic differences between them.
31. Philema and Desirable 18-18. These are the only two pecan cultivars in which it is not possible to distinguish between them because they have the same genetic profile at all 16 SNP loci evaluated. ‘Desirable’ is reported as one of the first pecan cultivars developed from a controlled cross. The cross was made in the early 1900's by Carl F. Forkert from Ocean Springs, Jackson County, Miss. The parentage is unknown but may be ‘Success’בJewett’. ‘Desirable’ was introduced around 1915 but was not widely disseminated prior to Forkert's death in 1928. This cultivar would probably have been lost if scions had not been sent to the U.S. Pecan Field Station, in Philema, Ga. in 1925.
The SNPs developed in pecan, a valuable species with limited genomic resources, generated a resource of genome-wide SNPs not previously available. The transcriptome sequencing and HRM analysis were used for SNP discovery and validation of the predicted SNPs and these were then used for germplasm characterization and fingerprinting of pecan cultivars with both HRM and Sanger sequencing. The majority of pecan cultivars represented by the ORTET samples for each cultivar evaluated can be distinguished based on individual SNPs or a combination of SNPs. Opportunities for these approaches include the capacity to define the genetic identity of a pecan tree in an orchard based on a DNA fingerprint that enables growers to make educated decisions about how to manage their orchard, particularly as it relates to disease susceptibility and fungicide applications. Also, interested nurseries can certify the genetics of the trees they are selling and growers can make the choice to purchase DNA-certified or non-certified trees.
Thousands of SNPs were identified in pecan, a relevant horticultural species with limited genomic resources. The validated SNPs were used successfully to characterize specific pecan cultivars based on their genetic fingerprint. Additionally, the developed SNPs can also be used to evaluate the identity of grafted wood from pecan cultivars grown, identify the pecan cultivar of existing orchards at the DNA level, assess the genetic diversity of pecan cultivars, and understand the genetic determinants underlying variation for key traits of value to the pecan breeding programs. The identification of SNPs in pecans, in contrast to other types of sequence polymorphism, may result in differences between coding regions of genes that could be functionally important.
While the invention has been described in connection with specific embodiments thereof, it will be understood that the present invention is capable of further modifications by one of skill in the art. It is to be understood that, unless otherwise indicated, the present disclosure is not limited to particular materials, reagents, reaction materials, manufacturing processes, or the like, as such can vary. It is also to be understood that the terminology used herein is for purposes of describing particular embodiments only, and is not intended to be limiting. It is also possible in the present disclosure that steps can be executed in different sequence where this is logically possible. The present disclosure is therefore intended to encompass any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departures from the present disclosure as come within known or customary practice within the art to which the invention pertains and as may be applied to the essential features herein before set forth.
As used herein, the term “cultivar” will be understood to be the same as “variety,” such that the terms “pecan cultivar” and “pecan variety” will have the same meaning. The term “about” is used herein to indicate that a value includes the standard deviation of error for the device or method being employed to determine the value. The use of the term “or” in the claims is used to mean “and/or” unless explicitly indicated to refer to alternatives only or the alternatives are mutually exclusive, although the disclosure supports a definition that refers to only alternatives and to “and/or.” When not used in conjunction closed wording in the claims or specifically noted otherwise, the words “a” and “an” denote “one or more.”
The terms “comprise,” “have,” and “include” are open-ended linking verbs. Any forms or tenses of one or more of these verbs, such as “comprises,” “comprising,” “has,” “having,” “includes,” and “including,” are also open-ended. For example, any method that “comprises,” “has” or “includes” one or more steps is not limited to possessing only those one or more steps and also covers other unlisted steps. Similarly, any cell that “comprises,” “has” or “includes” one or more traits is not limited to possessing only those one or more traits and covers other unlisted traits.
All of the materials and methods disclosed and claimed herein can be made and used without undue experimentation as instructed by the above disclosure. Although the materials and methods of this invention have been described in terms of preferred embodiments and illustrative examples, it will be apparent to those of skill in the art that variations can be applied to the materials and methods described herein without departing from the concept, spirit and scope of this invention. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of this invention as defined by the appended claims.
This application claims the benefit of U.S. Provisional Patent Application Ser. No. 62/311,201, filed Mar. 21, 2016, which is herein incorporated by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
62311201 | Mar 2016 | US |