Compositions associated with soybean reproductive growth and methods of use

Information

  • Patent Grant
  • 9951391
  • Patent Number
    9,951,391
  • Date Filed
    Friday, March 7, 2014
    10 years ago
  • Date Issued
    Tuesday, April 24, 2018
    6 years ago
Abstract
Molecular markers associated with soybean reproductive stage, methods of their use, and compositions having one or more marker loci are provided. Methods comprise detecting at least one marker locus, detecting a haplotype, and/or detecting a marker profile. Methods may further comprise crossing a selected soybean plant with a second soybean plant. Isolated polynucleotides, primers, probes, kits, systems, etc., are also provided.
Description
FIELD OF THE INVENTION

This invention relates to compositions associated with reproductive stage in soybean plants and methods of their use.


BACKGROUND

Soybeans (Glycine max L. Merr.) are a major cash crop and investment commodity in North America and elsewhere. Soybean is the world's primary source of seed oil and seed protein. Improving soybean adaptation for various growing regions and environmental conditions is crucial for maximizing yields.


There remains a need for means to identify genomic regions associated with reproductive stages in soybean plants. The compositions and methods provide important tools for use in plant breeding programs to optimize or maximize the reproductive growth stage, and/or to develop varieties adapted for various growing regions or environments.


SUMMARY

Molecular markers associated with soybean reproductive stages, methods of their use, and compositions having one or more marker loci are provided. Methods comprise detecting at least one marker locus, detecting a haplotype, and/or detecting a marker profile. Methods may further comprise crossing a selected soybean plant with a second soybean plant. Isolated polynucleotides, primers, probes, kits, systems, etc., are also provided.





BRIEF DESCRIPTION OF THE FIGURES


FIG. 1 provides a list of markers/marker loci on linkage group A1 (chromosome 5) associated with soybean reproductive growth.



FIG. 2 provides a list of markers/marker loci on linkage group A2 (chromosome 8) associated with soybean reproductive growth.



FIG. 3 provides a list of markers/marker loci on linkage group B1 (chromosome 11) associated with soybean reproductive growth.



FIG. 4 provides a list of markers/marker loci on linkage group B2 (chromosome 14) associated with soybean reproductive growth.



FIG. 5 provides a list of markers/marker loci on linkage group C1 (chromosome 4) associated with soybean reproductive growth.



FIG. 6 provides a list of markers/marker loci on linkage group C2 (chromosome 6) associated with soybean reproductive growth.



FIG. 7 provides a list of markers/marker loci on linkage group D1a (chromosome 1) associated with soybean reproductive growth.



FIG. 8 provides a list of markers/marker loci on linkage group D1b (chromosome 2) associated with soybean reproductive growth.



FIG. 9 provides a list of markers/marker loci on linkage group D2 (chromosome 17) associated with soybean reproductive growth.



FIG. 10 provides a list of markers/marker loci on linkage group E (chromosome 15) associated with soybean reproductive growth.



FIG. 11 provides a list of markers/marker loci on linkage group F (chromosome 13) associated with soybean reproductive growth.



FIG. 12 provides a list of markers/marker loci on linkage group G (chromosome 18) associated with soybean reproductive growth.



FIG. 13 provides a list of markers/marker loci on linkage group H (chromosome 12) associated with soybean reproductive growth.



FIG. 14 provides a list of markers/marker loci on linkage group I (chromosome 20) associated with soybean reproductive growth.



FIG. 15 provides a list of markers/marker loci on linkage group J (chromosome 16) associated with soybean reproductive growth.



FIG. 16 provides a list of markers/marker loci on linkage group L (chromosome 19) associated with soybean reproductive growth.



FIG. 17 provides a list of markers/marker loci on linkage group M (chromosome 7) associated with soybean reproductive growth.



FIG. 18 provides a list of markers/marker loci on linkage group N (chromosome 3) associated with soybean reproductive growth.



FIG. 19 provides a list of markers/marker loci on linkage group O (chromosome 10) associated with soybean reproductive growth.





SUMMARY OF SEQUENCES

SEQ ID NOs: 1-512 comprise nucleotide sequences of regions of the soybean genome, each capable of being used as a probe or primer, either alone or in combination, for the detection of a marker locus associated with reproductive growth in soybean. In certain examples, Primer1 and Primer2 are used as allele specific primers and Probe1 and Probe2 are used as allele probes. The SEQ ID NOs provided in the “Region” column of the table below are each a genomic DNA region encompassing the respective marker locus. In some examples, the primers and/or probes detect the polymorphism on based on a polynucleotide complementary to the genomic region provided here. It is to be understood that the sequences provided are sufficient for one of skill in the art to detect a locus associated with reproductive growth in soybean regardless of the orientation (forward, or reverse) of the strand used for detection.


















Primer1
Primer2
Probe1
Probe2
Region



SEQ ID
SEQ ID
SEQ ID
SEQ ID
SEQ ID


Locus
NO:
NO:
NO:
NO:
NO:




















S01435-1
1
2
3
4
5


S01239-1
6
7
8
9
10


S00780-1
11
12
13
14
15


S06925-1
16
17
18
19
20


S09951-1
21
22
23
24
25


S00170-1
26
27
28
29
30


S04059-1
31
32
33
34
35


S07851-1
36
37
38
39
40


S11659-1
41
42
43
44
45


S04279-1
46
47
48
49
50


S02211-1
51
52
53
54
55


S08942-1
56
57
58
59
60


S05742-1
61
62
63
64
65


S09155-1
66
67
68
69
70


S02037-1
71
72
73
74
75


S13136-1
76
77
78
79
80


S17291-001

81
82
83
84


S13139-1
85
86
87
88
89


S17292-001

90
91
92
93


S13146-1
94
95
96
97
98


S17293-001

99
100
101
102


S17294-001

103
104
105
106


S17581-001
107
108
109
110
111


S17691-001
112
113
114

115


S17701-001
116
117
118
119
120


S03703-1
121
122
123
124
125


S17297-001

126
127
128
129


S17298-001

130
131
132
133


S17299-001

134
135
136
137


S17300-001

138
139
140
141


S17301-001

142
143
144
145


S17306-001

146
147
148
149


S17310-001

150
151
152
153


S17311-001

154
155
156
157


S17312-001

158
159
160
161


S17313-001

162
163
164
165


S17316-001

166
167
168
169


S17317-001

170
171
172
173


S17318-001

174
175
176
177


S17322-001

178
179
180
181


S17326-001

182
183
184
185


S17327-001

186
187
188
189


S17328-001

190
191
192
193


S17329-001

194
195
196
197


S10746-1
198
199
200
201
202


S17331-001

203
204
205
206


S17332-001

207
208
209
210


S17337-001

211
212
213
214


S13093-1
215
216
217
218
219


S12211-1
220
221
222
223
224


S04555-1
225
226
227
228
229


S08519-1
230
231
232
233
234


S12876-1
235
236
237
238
239


S05937-1
240
241
242
243
244


S08575-1
245
246
247
248
249


S08669-1
250
251
252
253
254


S11212-1
255
256
257
258
259


S00543-1
260
261
262
263
264


S01452-1
265
266
267
268
269


S11993-1
270
271
272
273
274


S13446-1
275
276
277
278
279


S00252-1
280
281
282
283
284


S04060-1
285
286
287
288
289


S02664-1
290
291
292
293
294


S00281-1
295
296
297
298
299


S01109-1
300
301
302
303
304


S13844-1
305
306
307
308
309


S05058-1
310
311
312
313
314


S04660-1
315
316
317
318
319


S09955-1
320
321
322
323
324


S08034-1
325
326
327
328
329


S10293-1
330
331
332
333
334


S03813-1
335
336
337
338
339


S02042-1
340
341
342
343
344


S16601-001
345
346
347
348
349


S01481-1
350
351
352
353
354


S11309-1
355
356
357
358
359


S11320-1
360
361
362
363
364


S04040-1
365
366
367
368
369


S00863-1
370
371
372
373
374


S17151-001

375
376
377
378


S17153-001

379
380
381
382


S17154-001

383
384
385
386


S17156-001

387
388
389
390


S17159-001

391
392
393
394


S08590-1
395
396
397
398
399


S17242-001

400
401
402
403


S17166-001
404
405
406
407
408


S17167-001
409
410
411
412
413


S08539-1
414
415
416
417
418


S17178-001

419
420
421
422


S17179-001

423
424
425
426


S17180-001

427
428
429
430


S17181-001

431
432
433
434


S17182-001

435
436
437
438


S17183-001

439
440
441
442


S02780-1
443
444
445
446
447


S12107-1
448
449
450
451
452


S03624-1
453
454
455
456
457


S01953-1
458
459
460
461
462


S00111-1
463
464
465
466
467


S04180-1
468
469
470
471
472


S01008-1
473
474
475
476
477


S12862-1
478
479
480
481
482


S12867-1
483
484
485
486
487


S04966-1
488
489
490
491
492


S10631-1
493
494
495
496
497


S01574-1
498
499
500
501
502


S16594-001
503
504
505
506
507


S02777-1
508
509
510
511
512









DETAILED DESCRIPTION

The timing of soybean flowering and maturity are important agronomical traits that are associated with yield. These traits are largely affected by the genetic response to environmental signals such as day-length and temperature. Through selective breeding for flowering and maturity phenotypes, soybean varieties have been developed that are ideally suited for maximizing yield within a particular environment. Field testing for reproductive characteristics is laborious and challenging, and it cannot be accomplished until late in the plant life cycle. Having markers that can be used to select for reproductive growth expedite the introgression of desired alleles into elite cultivars.


Multiple genetic loci have been identified as containing genes that control the reproductive growth period of soybean. Relative maturity (RM) in soybean plays a significant role in determining final seed yield, and it is common for seed yield and the length of reproductive growth to have a positive correlation. Extending the reproductive period through manipulation of these loci is important for maximizing yield potential. However, it is important to evaluate soybean varieties in the correct environments. Utilizing markers associated with soybean reproductive growth that distinguish between early and late alleles, such as early and late alleles for initiation of flowering, provides the ability to segregate soybean populations into the correct testing environment, without having to conduct a preliminary progeny test on the line to identify an appropriate environment. It is also desirable to increase genetic diversity by crossing soybeans line with disparate reproductive habits, such as late flowering by early flowering crosses. This process has been utilized with limited success in the past due to the low frequency of desirable segregates that have a specific reproductive periods for the target area of adaptation environment. By utilizing molecular markers associated with reproductive growth, a breeder can identify plants in early generations which likely will have reproductive characteristics for the target environment, rather than having to phenotype and select a preferred reproductive growth phenotype in a previous growing season, therefore saving time and other resources. For example, a parent with relative maturity (RM) of 3.1 crossed with a second parent with RM 1.7 will produce progeny with an expected RM range from about 1.5 to about 3.5. If the breeder is only interested in testing the lines from this population that are <2.0 RM, the breeder would have to grow out a large number of progeny and select only those that mature as <2.0 RM. But, using molecular markers associated with reproductive growth, single plants can be selected having a <2.0 RM by selecting preferred locus, allele, haplotype, and/or marker profile. It is also desirable to increase the amount of time a soybean plant is in the reproductive growth stage. For example, one could select for an earlier flowering date without affecting the pod maturity.


Nucleotide polymorphisms, including SNPs as well as insertions/deletions (INDELs) have been identified that are closely linked to and in linkage disequilibrium (LD) with the reproductive growth loci in soybean. These polymorphisms allow for marker-assisted selection (MAS) of these loci, expediting the creation and precise selection soybean plants with a desired reproductive growth phenotype. This will allow for more precision in developing varieties tailored to a particular environment.


At least eight loci affecting flowering and maturity, known as E genes (E1-E8), have been identified (see, e.g., Cober et al. (1996) Crop Sci 36:601-605; Cober et al. (1996) Crop Sci 36:606-610; Asumadu et al. (1998) Ann Bot 82:773-778; Cober et al. (2001) Crop Sci 41:721-727; Abe et al. (2003) Crop Sci 43:1300-1304; Tasma & Shoemaker (2003) Crop Sci 41:319-328; Cober & Voldeng (2001) Crop Sci 41:698-701; Cober & Voldeng (2001) Crop Sci 41:1823-1926; and, Cober et al. (2010) Crop Sci 50:524-527). The E1, E2, and E3 loci have been recently cloned and found to encode a nuclear localized E1 protein (Xia et al. (2012) Proc Natl Acad Sci USA doi/10.1073/pnas.1117982109 E2155-E2164), a GIGANTEA homolog (Watanabe et al. (2011) Genetics 188:395-407), and a phytochrome A homolog respectively (Watanabe et al. (2009) Genetics 182:1251-1262). Recessive loss-of-function mutant alleles at these three loci can independently condition earlier flowering phenotypes.


A method for identifying a soybean plant or germplasm having a trait locus associated with reproductive growth, the method comprising detecting at least one allele of one or more marker loci associated with reproductive growth in soybean is provided. In some examples, a trait locus associated with reproductive growth is a locus associated with reproductive development, time to initiation of flowering (R1), time from planting to initiation of flowering (R1), time from emergence (VE) to initiation of flowering (R1), early flowering, length of reproductive growth, time from initiation of flowering (R1) to pod fill, length of flowering, time from initiation of flowering (R1) to beginning maturity (R7), time to full bloom (R2), time from first trifoliate (V1) to pre-flowering (V6), and the like.


In some examples, the method involves detecting at least one marker locus associated with reproductive growth in soybean. In some examples the method comprises detecting at least one polymorphism within 30 cM of a marker locus on LG A1 (ch 5), LG A2 (ch 8), LG B1 (ch 11), LG B2 (ch 14), LG C1 (ch 4), LG C2 (ch 6), LG D1a (ch 1), LG D1b (ch 2), LG D2 (ch 17), LG E (ch 15), LG F (ch 13), LG G (ch 18), LG H (ch 12) LG I (ch 20), LG J (ch 16), LG L (ch 19), LG M (ch 7), LG N (ch 3), and/or LG O (ch 10), or any combination thereof. In some examples the method comprises detecting at least one polymorphism within about 0-25 cM, 0-20 cM, 0-15 cM, 0-10 cM, 0-5 cM, or about 0-2.5 cM on LG A1 (ch 5), LG A2 (ch 8), LG B1 (ch 11), LG B2 (ch 14), LG C1 (ch 4), LG C2 (ch 6), LG D1a (ch 1), LG D1b (ch 2), LG D2 (ch 17), LG E (ch 15), LG F (ch 13), LG G (ch 18), LG H (ch 12) LG I (ch 20), LG J (ch 16), LG L (ch 19), LG M (ch 7), LG N (ch 3), and/or LG O (ch 10), or any combination thereof.


In some examples the method comprises detecting at least one polymorphism within about 0-50 kb, 0-100 kb, 0-200 kb, 0-500 kb, 0-750 kb, or about 0-1000 kb on LG A1 (ch 5), LG A2 (ch 8), LG B1 (ch 11), LG B2 (ch 14), LG C1 (ch 4), LG C2 (ch 6), LG D1a (ch 1), LG D1b (ch 2), LG D2 (ch 17), LG E (ch 15), LG F (ch 13), LG G (ch 18), LG H (12), LG I (ch 20), LG J (ch 16), LG L (ch 19), LG M (ch 7), LG N (ch 3), and/or LG O (ch 10), or any combination thereof.


In some examples the method comprises detecting at least one polymorphism linked to a marker locus selected from the group consisting of S01435-1 on LG A1 (ch 5), S01239-1 and/or S00780-1 on LG A2 (ch 8), S06925-1, S09951-1, and/or S00170-1 on LG B1 (ch 11), S04059-1 and/or S07851-1 on LG B2 (ch 14), S11659-1, S04279-1, S02211-1, and/or S08942-1 on LG C1 (ch 4), S05742-1, S09155-1, S02037-1, S13136-1, S17291-001, S13139-1, S17292-001, S13146-1, S17293-001, S17294-001, S17581-001, S17691-001, S17701-001, S03703-1, S17297-001, S17298-001, S17299-001, S17300-001, S17306-001, S17310-001, S17311-001, S17312-001, S17312-001, S17316-001, S17317-001, S17318-001, S17322-001, S17326-001, S17327-001, S17328-001, S17329-001, S10746-1, S17331-001, S17332-001, S17337-001, S13093-1, S12211-1, S04555-1, and/or S17301-001 on LG C2 (ch 6), S08519-1 on LG D1a (ch 1), S12876-1, S05937-1, S08575-1, S08669-1, S11212-1, and/or S00543-1 on LG D1b (ch 2), S01452-1 and/or S11993-1 on LG D2 (ch 17), S13446-1 on LG E (ch 15), S00252-1, S04060-1, S02664-1, and/or S00281-1 on LG F (ch 13), S01109-1, S13844-1, S05058-1 and/or S04660-1 on LG G (ch 18), S09955-1 on LG H (ch 12), S08034-1 and/or S10293-1 on LG I (ch 20), S03813-1 and/or S02042-1 on LG J (ch 16), S16601-001, S01481-1, S11309-1, S11320-1 and/or S04040-1 on LG L (ch 19), S00863-1, S17151-001, S17153-001, S17154-001, S17156-001, S17159-001, S08590-1, S17242-001, S17166-001, S17167-001, S08539-1, S17178-001, S17179-001, S17180-001, S17181-001, S17182-001, S17183-001, S02780-1, S12107-1, S03624-1, S01953-1, S00111-1, S04180-1, and/or S01008-1 on LG M (ch 7), S12861-1, S04966-1, and/or S12867-1 on LG N (ch 3), and S10631-1, S01574-1, S16594-001, and/or S02777-1 on LG O (ch 10), or any combination thereof.


In some examples the method comprises detecting at least one polymorphism within about 0-25 cM, 0-20 cM, 0-15 cM, 0-10 cM, 0-5 cM, or about 0-2.5 cM of a marker locus selected from the group consisting of S01435-1 on LG A1 (ch 5), S01239-1 and/or S00780-1 on LG A2 (ch 8), S06925-1, S09951-1, and/or S00170-1 on LG B1 (ch 11), S04059-1 and/or S07851-1 on LG B2 (ch 14), S11659-1, S04279-1, S02211-1, and/or S08942-1 on LG C1 (ch 4), S05742-1, S09155-1, S02037-1, S13136-1, S17291-001, S13139-1, S17292-001, S13146-1, S17293-001, S17294-001, S17581-001, S17691-001, S17701-001, S03703-1, S17297-001, S17298-001, S17299-001, S17300-001, S17306-001, S17310-001, S17311-001, S17312-001, S17312-001, S17316-001, S17317-001, S17318-001, S17322-001, S17326-001, S17327-001, S17328-001, S17329-001, S10746-1, S17331-001, S17332-001, S17337-001, S13093-1, S12211-1, S04555-1, and/or S17301-001 on LG C2 (ch 6), S08519-1 on LG D1a (ch 1), S12876-1, S05937-1, S08575-1, S08669-1, S11212-1, and/or S00543-1 on LG D1b (ch 2), S01452-1 and/or S11993-1 on LG D2 (ch 17), S13446-1 on LG E (ch 15), S00252-1, S04060-1, S02664-1, and/or S00281-1 on LG F (ch 13), S01109-1, S13844-1, S05058-1 and/or S04660-1 on LG G (ch 18), S09955-1 on LG H (ch 12), S08034-1 and/or S10293-1 on LG I (ch 20), S03813-1 and/or S02042-1 on LG J (ch 16), S16601-001, S01481-1, S11309-1, S11320-1, and/or S04040-1 on LG L (ch 19), S00863-1, S17151-001, S17153-001, S17154-001, S17156-001, S17159-001, S08590-1, S17242-001, S17166-001, S17167-001, S08539-1, S17178-001, S17179-001, S17180-001, S17181-001, S17182-001, S17183-001, S02780-1, S12107-1, S03624-1, S01953-1, S00111-1, S04180-1, and/or S01008-1 on LG M (ch 7), S12861-1, S04966-1, and/or S12867-1 on LG N (ch 3), and S10631-1, S01574-1, S16594-001, and/or S02777-1 on LG O (ch 10), or any combination thereof.


In some examples the method comprises detecting at least one polymorphism within about 0-50 kb, 0-100 kb, 0-200 kb, 0-500 kb, 0-750 kb, or about 0-1000 kb of a marker locus selected from the group consisting of S01435-1 on LG A1 (ch 5), S01239-1 and/or S00780-1 on LG A2 (ch 8), S06925-1, S09951-1, and/or S00170-1 on LG B1 (ch 11), S04059-1 and/or S07851-1 on LG B2 (ch 14), S11659-1, S04279-1, S02211-1, and/or S08942-1 on LG C1 (ch 4), S05742-1, S09155-1, S02037-1, S13136-1, S17291-001, S13139-1, S17292-001, S13146-1, S17293-001, S17294-001, S17581-001, S17691-001, S17701-001, S03703-1, S17297-001, S17298-001, S17299-001, S17300-001, S17306-001, S17310-001, S17311-001, S17312-001, S17312-001, S17316-001, S17317-001, S17318-001, S17322-001, S17326-001, S17327-001, S17328-001, S17329-001, S10746-1, S17331-001, S17332-001, S17337-001, S13093-1, S12211-1, S04555-1, and/or S17301-001 on LG C2 (ch 6), S08519-1 on LG D1a (ch 1), S12876-1, S05937-1, S08575-1, S08669-1, S11212-1, and/or S00543-1 on LG D1b (ch 2), S01452-1 and/or S11993-1 on LG D2 (ch 17), S13446-1 on LG E (ch 15), S00252-1, S04060-1, S02664-1, and/or S00281-1 on LG F (ch 13), S01109-1, S13844-1, S05058-1 and/or S04660-1 on LG G (ch 18), S09955-1 on LG H (ch 12), S08034-1 and/or S10293-1 on LG I (ch 20), S03813-1 and/or S02042-1 on LG J (ch 16), S16601-001, S01481-1, S11309-1, S11320-1, and/or S04040-1 on LG L (ch 19), S00863-1, S17151-001, S17153-001, S17154-001, S17156-001, S17159-001, S08590-1, S17242-001, S17166-001, S17167-001, S08539-1, S17178-001, S17179-001, S17180-001, S17181-001, S17182-001, S17183-001, S02780-1, S12107-1, S03624-1, S01953-1, S00111-1, S04180-1, and/or S01008-1 on LG M (ch 7), S12861-1, S04966-1, and/or S12867-1 on LG N (ch 3), and S10631-1, S01574-1, S16594-001, and/or S02777-1 on LG O (ch 10), or any combination thereof.


In some examples the method comprises detecting at least one polymorphism closely linked to a marker locus selected from the group consisting of S01435-1 on LG A1 (ch 5), S01239-1 and/or S00780-1 on LG A2 (ch 8), S06925-1, S09951-1, and/or S00170-1 on LG B1 (ch 11), S04059-1 and/or S07851-1 on LG B2 (ch 14), S11659-1, S04279-1, S02211-1, and/or S08942-1 on LG C1 (ch 4), S05742-1, S09155-1, S02037-1, S13136-1, S17291-001, S13139-1, S17292-001, S13146-1, S17293-001, S17294-001, S17581-001, S17691-001, S17701-001, S03703-1, S17297-001, S17298-001, S17299-001, S17300-001, S17306-001, S17310-001, S17311-001, S17312-001, S17312-001, S17316-001, S17317-001, S17318-001, S17322-001, S17326-001, S17327-001, S17328-001, S17329-001, S10746-1, S17331-001, S17332-001, S17337-001, S13093-1, S12211-1, S04555-1, and/or S17301-001 on LG C2 (ch 6), S08519-1 on LG D1a (ch 1), S12876-1, S05937-1, S08575-1, S08669-1, S11212-1, and/or S00543-1 on LG D1b (ch 2), S01452-1 and/or S11993-1 on LG D2 (ch 17), S13446-1 on LG E (ch 15), S00252-1, S04060-1, S02664-1, and/or S00281-1 on LG F (ch 13), S01109-1, S13844-1, S05058-1 and/or S04660-1 on LG G (ch 18), S09955-1 on LG H (ch 12), S08034-1 and/or S10293-1 on LG I (ch 20), S03813-1 and/or S02042-1 on LG J (ch 16), S16601-001, S01481-1, S11309-1, S11320-1, and/or S04040-1 on LG L (ch 19), S00863-1, S17151-001, S17153-001, S17154-001, S17156-001, S17159-001, S08590-1, S17242-001, S17166-001, S17167-001, S08539-1, S17178-001, S17179-001, S17180-001, S17181-001, S17182-001, S17183-001, S02780-1, S12107-1, S03624-1, S01953-1, S00111-1, S04180-1, and/or S01008-1 on LG M (ch 7), S12861-1, S04966-1, and/or S12867-1 on LG N (ch 3), and S10631-1, S01574-1, S16594-001, and/or S02777-1 on LG O (ch 10), or any combination thereof.


In some examples the method comprises detecting at least one polymorphism in a marker locus selected from the group consisting of S01435-1 on LG A1 (ch 5), S01239-1 and/or S00780-1 on LG A2 (ch 8), S06925-1, S09951-1, and/or S00170-1 on LG B1 (ch 11), S04059-1 and/or S07851-1 on LG B2 (ch 14), S11659-1, S04279-1, S02211-1, and/or S08942-1 on LG C1 (ch 4), S05742-1, S09155-1, S02037-1, S13136-1, S17291-001, S13139-1, S17292-001, S13146-1, S17293-001, S17294-001, S17581-001, S17691-001, S17701-001, S03703-1, S17297-001, S17298-001, S17299-001, S17300-001, S17306-001, S17310-001, S17311-001, S17312-001, S17312-001, S17316-001, S17317-001, S17318-001, S17322-001, S17326-001, S17327-001, S17328-001, S17329-001, S10746-1, S17331-001, S17332-001, S17337-001, S13093-1, S12211-1, S04555-1, and/or S17301-001 on LG C2 (ch 6), S08519-1 on LG D1a (ch 1), S12876-1, S05937-1, S08575-1, S08669-1, S11212-1, and/or S00543-1 on LG D1b (ch 2), S01452-1 and/or S11993-1 on LG D2 (ch 17), S13446-1 on LG E (ch 15), S00252-1, S04060-1, S02664-1, and/or S00281-1 on LG F (ch 13), S01109-1, S13844-1, S05058-1 and/or S04660-1 on LG G (ch 18), S09955-1 on LG H (ch 12), S08034-1 and/or S10293-1 on LG I (ch 20), S03813-1 and/or S02042-1 on LG J (ch 16), S16601-001, S01481-1, S11309-1, S11320-1, and/or S04040-1 on LG L (ch 19), S00863-1, S17151-001, S17153-001, S17154-001, S17156-001, S17159-001, S08590-1, S17242-001, S17166-001, S17167-001, S08539-1, S17178-001, S17179-001, S17180-001, S17181-001, S17182-001, S17183-001, S02780-1, S12107-1, S03624-1, S01953-1, S00111-1, S04180-1, and/or S01008-1 on LG M (ch 7), S12861-1, S04966-1, and/or S12867-1 on LG N (ch 3), and S10631-1, S01574-1, S16594-001, and/or S02777-1 on LG O (ch 10), or any combination thereof.


In some examples, the method comprises detecting a polymorphism using at least one marker selected from the group consisting of a marker selected from the group consisting of S01435-1-001 on LG A1 (ch 5), S01239-1-A and/or S00780-1-A on LG A2 (ch 8), S06925-1-Q1, S09951-1-Q1, and/or S00170-1-A on LG B1 (ch 11), S04059-1-A and/or S07851-1-Q1 on LG B2 (ch 14), S11659-1-Q1, S04279-1-A, S02211-1-A, and/or S08942-1-Q1 on LG C1 (ch 4), S05742-1-Q1, S09155-1-Q1, S02037-1-A, S13136-1-Q1, S17291-001-K001, S13139-1-Q1, S17292-001-K001, S13146-1-Q1, S17293-001-K001, S17294-001-K001, S17581-001-Q008, S17691-001-Q001, S17701-001-Q001, S03703-1-Q1, S17297-001-K001, S17298-001-K001, S17299-001-K001, S17300-001-K001, S17306-001-K001, S17310-001-K001, S17311-001-K001, S17312-001-K001, S17312-001-K001, S17316-001-K001, S17317-001-K001, S17318-001-K001, S17322-001-K001, S17326-001-K001, S17327-001-K001, S17328-001-K001, S17329-001-K001, S10746-1-Q1, S17331-001-K001, S17332-001-K001, S17337-001-K001, S13093-1-Q1, S12211-1-Q1, S04555-1-Q1, and/or S17301-001-K001 on LG C2 (ch 6), S08519-1-Q1 on LG D1a (ch 1), S12876-1-Q1, S05937-1-Q1, S08575-1-Q1, S08669-1-Q1, S11212-1-Q1, and/or S00543-1-A on LG D1b (ch 2), S01452-1-A and/or S11993-1-Q2 on LG D2 (ch 17), S13446-1-Q1 on LG E (ch 15), S00252-1-A, S04060-1-A, S02664-1-A, and/or S00281-1-A on LG F (ch 13), S01109-1-Q002, S13844-1-Q1, S05058-1-Q1 and/or S04660-1-A on LG G (ch 18), S09955-1-Q1 on LG H (ch 12), S08034-1-Q1 and/or S10293-1-Q1 on LG I (ch 20), S03813-1-A and/or S02042-1-A on LG J (ch 16), S16601-001-Q001, S01481-1-A, S11309-1-Q1, S11320-1-Q1, and/or S04040-1-A on LG L (ch 19), S00863-1-A, S17151-001-K001, S17153-001-K001, S17154-001-K001, S17156-001-K001, S17159-001-K001, S08590-1-Q1, S17242-001-K001, S17166-001-Q006, S17167-001-Q007, S08539-1-Q1, S17178-001-K001, S17179-001-K001, S17180-001-K001, S17181-001-K001, S17182-001-K001, S17183-001-K001, S02780-1-Q1, S12107-1-Q1, S03624-1-Q001, S01953-1-A, S00111-1-A, S04180-1-A, and/or S01008-1-B on LG M (ch 7), S12861-1-Q1, S04966-1-Q1, and/or S12867-1-Q002 on LG N (ch 3), and S10631-1-Q1, S01574-1-A, S16594-001-Q10, and/or S02777-1-A on LG O (ch 10), or any combination thereof.


In other examples, the method involves detecting a haplotype comprising two or more marker loci, for example, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or 31 marker loci, or more. In certain examples, the haplotype comprises two or more markers selected from the group consisting of S01435-1-001 on LG A1 (ch 5), S01239-1-A and/or S00780-1-A on LG A2 (ch 8), S06925-1-Q1, S09951-1-Q1, and/or S00170-1-A on LG B1 (ch 11), S04059-1-A and/or S07851-1-Q1 on LG B2 (ch 14), S11659-1-Q1, S04279-1-A, S02211-1-A, and/or S08942-1-Q1 on LG C1 (ch 4), S05742-1-Q1, S09155-1-Q1, S02037-1-A, S13136-1-Q1, S17291-001-K001, S13139-1-Q1, S17292-001-K001, S13146-1-Q1, S17293-001-K001, S17294-001-K001, S17581-001-Q008, S17691-001-Q001, S17701-001-Q001, S03703-1-Q1, S17297-001-K001, S17298-001-K001, S17299-001-K001, S17300-001-K001, S17306-001-K001, S17310-001-K001, S17311-001-K001, S17312-001-K001, S17312-001-K001, S17316-001-K001, S17317-001-K001, S17318-001-K001, S17322-001-K001, S17326-001-K001, S17327-001-K001, S17328-001-K001, S17329-001-K001, S10746-1-Q1, S17331-001-K001, S17332-001-K001, S17337-001-K001, S13093-1-Q1, S12211-1-Q1, S04555-1-Q1, and/or S17301-001-K001 on LG C2 (ch 6), S08519-1-Q1 on LG D1a (ch 1), S12876-1-Q1, S05937-1-Q1, S08575-1-Q1, S08669-1-Q1, S11212-1-Q1, and/or S00543-1-A on LG D1b (ch 2), S01452-1-A and/or S11993-1-Q2 on LG D2 (ch 17), S13446-1-Q1 on LG E (ch 15), S00252-1-A, S04060-1-A, S02664-1-A, and/or S00281-1-A on LG F (ch 13), S01109-1-Q002, S13844-1-Q1, S05058-1-Q1 and/or S04660-1-A on LG G (ch 18), S09955-1-Q1 on LG H (ch 12), S08034-1-Q1 and/or S10293-1-Q1 on LG I (ch 20), S03813-1-A and/or S02042-1-A on LG J (ch 16), S16601-001-Q001, S01481-1-A, S11309-1-Q1, S11320-1-Q1, and/or S04040-1-A on LG L (ch 19), S00863-1-A, S17151-001-K001, S17153-001-K001, S17154-001-K001, S17156-001-K001, S17159-001-K001, S08590-1-Q1, S17242-001-K001, S17166-001-Q006, S17167-001-Q007, S08539-1-Q1, S17178-001-K001, S17179-001-K001, S17180-001-K001, S17181-001-K001, S17182-001-K001, S17183-001-K001, S02780-1-Q1, S12107-1-Q1, S03624-1-Q001, S01953-1-A, S00111-1-A, S04180-1-A, and/or S01008-1-B on LG M (ch 7), S12861-1-Q1, S04966-1-Q1, and/or S12867-1-Q002 on LG N (ch 3), and S10631-1-Q1, S01574-1-A, S16594-001-Q10, and/or S02777-1-A on LG O (ch 10), or any combination thereof. In further examples, the haplotype comprises markers from the set of markers described in FIGS. 1-19, or the set of markers described in Table 26-27.


In other examples, the method involves detecting a marker profile comprising two or more marker loci, for example, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or 31 marker loci, or more. In some examples the method uses marker assisted selection to stack two or more loci in a soybean plant, cell, seed, or germplasm. In some examples the method uses a marker profile to produce a soybean plant, cell, seed, or germplasm having a desired predicted flowering time. In some examples the desired predicted flowering time, is a desired flowering time for a specific adapted growing zones or area of adaptability, including but not limited to day length, latitude, environmental class, management zone, maturity group and/or relative maturity. In some examples, the area of adaptability may include using soybean to produce a second crop during a growing season. Second crops are commonly planted in areas with longer growing seasons, however the selected crop may need different reproductive characteristics to be adapted for the second growing cycle in the season than it would for the first growing cycle of the season. Any method of environmental classification can be used, including but not limited to those described in U.S. Pat. No. 8,032,389, and Loeffler et al. (2005) Crop Sci 45:1708-1716, each of which is herein incorporated by reference in its entirety. In certain examples, the marker profile comprises two or more markers selected from the group consisting of S01435-1-001 on LG A1 (ch 5), S01239-1-A and/or S00780-1-A on LG A2 (ch 8), S06925-1-Q1, S09951-1-Q1, and/or S00170-1-A on LG B1 (ch 11), S04059-1-A and/or S07851-1-Q1 on LG B2 (ch 14), S11659-1-Q1, S04279-1-A, S02211-1-A, and/or S08942-1-Q1 on LG C1 (ch 4), S05742-1-Q1, S09155-1-Q1, S02037-1-A, S13136-1-Q1, S17291-001-K001, S13139-1-Q1, S17292-001-K001, S13146-1-Q1, S17293-001-K001, S17294-001-K001, S17581-001-Q008, S17691-001-Q001, S17701-001-Q001, S03703-1-Q1, S17297-001-K001, S17298-001-K001, S17299-001-K001, S17300-001-K001, S17306-001-K001, S17310-001-K001, S17311-001-K001, S17312-001-K001, S17312-001-K001, S17316-001-K001, S17317-001-K001, S17318-001-K001, S17322-001-K001, S17326-001-K001, S17327-001-K001, S17328-001-K001, S17329-001-K001, S10746-1-Q1, S17331-001-K001, S17332-001-K001, S17337-001-K001, S13093-1-Q1, S12211-1-Q1, S04555-1-Q1, and/or S17301-001-K001 on LG C2 (ch 6), S08519-1-Q1 on LG D1a (ch 1), S12876-1-Q1, S05937-1-Q1, S08575-1-Q1, S08669-1-Q1, S11212-1-Q1, and/or S00543-1-A on LG D1b (ch 2), S01452-1-A and/or S11993-1-Q2 on LG D2 (ch 17), S13446-1-Q1 on LG E (ch 15), S00252-1-A, S04060-1-A, S02664-1-A, and/or S00281-1-A on LG F (ch 13), S01109-1-Q002, S13844-1-Q1, S05058-1-Q1 and/or S04660-1-A on LG G (ch 18), S09955-1-Q1 on LG H (ch 12), S08034-1-Q1 and/or S10293-1-Q1 on LG I (ch 20), S03813-1-A and/or S02042-1-A on LG J (ch 16), S16601-001-Q001, S01481-1-A, S11309-1-Q1, S11320-1-Q1, and/or S04040-1-A on LG L (ch 19), S00863-1-A, S17151-001-K001, S17153-001-K001, S17154-001-K001, S17156-001-K001, S17159-001-K001, S08590-1-Q1, S17242-001-K001, S17166-001-Q006, S17167-001-Q007, S08539-1-Q1, S17178-001-K001, S17179-001-K001, S17180-001-K001, S17181-001-K001, S17182-001-K001, S17183-001-K001, S02780-1-Q1, S12107-1-Q1, S03624-1-Q001, S01953-1-A, S00111-1-A, S04180-1-A, and/or S01008-1-B on LG M (ch 7), S12861-1-Q1, S04966-1-Q1, and/or S12867-1-Q002 on LG N (ch 3), and S10631-1-Q1, S01574-1-A, S16594-001-Q10, and/or S02777-1-A on LG O (ch 10), or any combination thereof. In further examples, the marker profile comprises markers from the set of markers described in FIGS. 1-19, or the set of markers described in Tables 26-27.


In other examples, the one or more marker locus detected comprises one or more markers within a chromosome interval selected from the group consisting of an interval on linkage group A1 flanked by and including Satt364 and BARC-020479-04637, an interval on linkage group A2 flanked by and including S01239-1 and S00780-1, an interval on linkage group B1 flanked by and including S06925-1 and S00170-1, an interval on linkage group B2 flanked by and including S04059-1-A and S07851-1-Q1, an interval on linkage group B2 flanked by and including BARC-052789-11619 and BARC-013273-00464, an interval on linkage group C1 flanked by and including S11659-1 and S02211-1, an interval on linkage group C1 flanked by and including BARC-042189-08197 and BARC-019093-0331, an interval on linkage group C1 flanked by and including S02211-1 and S08942-1, an interval on linkage group C2 flanked by and including S05742-1 and BARCSOYSSR_06_0283, an interval on linkage group C2 flanked by and including S05742-1 and BARC-035239-07157, an interval on linkage group C2 flanked by and including BARC-0299-06757 and Satt322, an interval on linkage group C2 flanked by and including S13136-1 and S17294-001, an interval on linkage group C2 flanked by and including S17297-001 and S17317-001, an interval on linkage group C2 flanked by and including S17318-001 and S17331-001, an interval on linkage group D1a flanked by and including BARC-024147-04784 and BARC-045297-08928, an interval on linkage group D1b flanked by and including BARC-029753-06334 and BARC-013995-01298, an interval on linkage group D1b flanked by and including S12876-1 and S08575-1, an interval on linkage group D1b flanked by and including S08669-1 and S11212-1, an interval on linkage group D1b flanked by and including S08575-1 and S08669-1, an interval on linkage group D2 flanked by and including Satt389 and BARC-040583-007786, an interval on linkage group D2 flanked by and including S01452-1 and S11993-1, an interval on linkage group E flanked by and including BARC-020425-04614 and Satt231, an interval on linkage group F flanked by and including S00252-1 and Sat_039, an interval on linkage group F flanked by and including S04060-1 and S00281-1, an interval on linkage group G flanked by and including BARC-020027-04405 and Satt309, an interval on linkage group G flanked by and including S13844-1 and BARC-013305-00475, an interval on linkage group G flanked by and including Sat_064 and BARC-013305-00475, an interval on linkage group H flanked by and including BARC-018437-03181 and Satt629, an interval on linkage group I flanked by and including S08034-1 and S10293-1, an interval on linkage group I flanked by and including S10293-1 and Satt299, an interval on linkage group J flanked by and including Sct_046 and Satt693, an interval on linkage group J flanked by and including Satt547 and BARC-030817-06946, an interval on linkage group M flanked by and including S00863-1 and S17167-001, an interval on linkage group M flanked by and including BARCSOYSSR_07_0017 and S08590-1, an interval on linkage group M flanked by and including S08590-1 and S17167-001, an interval on linkage group M flanked by and including S00111-1 and Sat_121, an interval on linkage group M flanked by and including S00111-1 and, and/or S01008-1, an interval on linkage group N flanked by and including Sat_236 and Satt339, an interval on linkage group N flanked by and including S12862-1 and S12867-1, an interval on linkage group N flanked by and including Sat_125 and BARC-039729-07559, and an interval on linkage group O flanked by and including S02777-1 and BARC-029629-06265, or any interval provided in FIGS. 1-19.


In further examples, the one or more marker locus detected comprises one or more markers within one or more of the genomic DNA regions of SEQ ID NOs: 1-512. In other examples, the one or more marker locus detected comprises one or more markers within one or more of the genomic regions of SEQ ID NOs:5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 84, 89, 93, 98, 102, 106, 111, 115, 120, 125, 129, 133, 137, 141, 145, 149, 153, 157, 161, 165, 169, 173, 177, 181, 185, 189, 193, 197, 202, 206, 210, 214, 219, 224, 229, 234, 239, 244, 249, 254, 259, 264, 269, 274, 279, 284, 289, 294, 299, 304, 309, 314, 319, 324, 329, 334, 339, 344, 349, 354, 359, 364, 369, 374, 378, 382, 386, 390, 394, 399, 403, 408, 413, 418, 422, 426, 430, 434, 438, 442, 447, 452, 457, 462, 467, 472, 477, 482, 487, 492, 497, 502, 507, or, 512. In some examples, the one or more polymorphism detected may be less than 1 cM, 1 cM, 5 cM, 10 cM, 15 cM, 20 cM, or 30 cM from SEQ ID NOs: 1-512. In further examples, the one or more marker locus detected comprises one or more markers within a chromosome interval described in FIGS. 1-19.


In some examples, the method comprises detecting one or more polymorphisms linked to one or more loci, said loci comprising a polymorphism selected from the group consisting of Gm05:30568085, Gm08:7464336, Gm08:15841570, Gm11:4674824, Gm11:5231500, Gm11:7847341, Gm14:46138053, Gm14:47331319, Gm04:5754268, Gm04:8295779, Gm04:39691731, Gm04:44725098, Gm06:410442, Gm06:11659627, Gm06:15457913, Gm06:16391391, Gm06:16499786, Gm06:16593381, Gm06:16670047, Gm06:16804435, Gm06:17498270, Gm06:18203964, Gm06:19743496, Gm06:19986645, Gm06:20007173, Gm06:20084642, Gm06:20501491, Gm06:21197184, Gm06:21500085, Gm06:22501610, Gm06:25700006, Gm06:28501458, Gm06:28671736, Gm06:29499523, Gm06:30203054, Gm06:31694650, Gm06:32503141, Gm06:33196184, Gm06:35509548, Gm06:37712913, Gm06:38467854, Gm06:39168136, Gm06:39533730, Gm06:40766974, Gm06:41476201, Gm06:42450296, Gm06:47500976, Gm06:47521797, Gm06:48475049, Gm06:49978151, Gm06:22700011, Gm01:759365, Gm02:4893148, Gm02:9714426, Gm02:11502780, Gm02:15446229, Gm02:33158449, Gm02:45776142, Gm17:16136646, Gm17:39804515, Gm15:50237460, Gm13:235439, Gm13:20365663, Gm13:20744030, Gm13:35174140, Gm18:305113, Gm18:58086324, Gm18:61591142, Gm18:61831970, Gm12:11512115, Gm20:39051858, Gm20:41216234, Gm16:4678569, Gm16:36524407, Gm19:47535046, Gm19:47826727, Gm19:48252040, Gm19:48638646, Gm19:50222676, Gm07:1141099, Gm07:1830296, Gm07:1923026, Gm07:2179883, Gm07:2310058, Gm07:2679749, Gm07:3009018, Gm07:4282676, Gm07:4319368, Gm07:4342479, Gm07:5576650, Gm07:6288899, Gm07:6340656, Gm07:6347675, Gm07:6614649, Gm07:6616695, Gm07:6623333, Gm07:6671535, Gm07:7096376, Gm07:7774056, Gm07:8674220, Gm07:35590550, Gm07:36459825, Gm07:36638366, Gm03:38491492, Gm03:39583405, Gm03:46209939, Gm10:43974548, Gm10:44725777, Gm10:44732850, Gm10:50495033, or any combination thereof.


In some examples, the method comprises detecting a haplotype or a marker profile comprising two or more of the polymorphisms linked to marker loci, said loci comprising a polymorphism selected from the group consisting of Gm05:30568085, Gm08:7464336, Gm08:15841570, Gm11:4674824, Gm11:5231500, Gm11:7847341, Gm14:46138053, Gm14:47331319, Gm04:5754268, Gm04:8295779, Gm04:39691731, Gm04:44725098, Gm06:410442, Gm06:11659627, Gm06:15457913, Gm06:16391391, Gm06:16499786, Gm06:16593381, Gm06:16670047, Gm06:16804435, Gm06:17498270, Gm06:18203964, Gm06:19743496, Gm06:19986645, Gm06:20007173, Gm06:20084642, Gm06:20501491, Gm06:21197184, Gm06:21500085, Gm06:22501610, Gm06:25700006, Gm06:28501458, Gm06:28671736, Gm06:29499523, Gm06:30203054, Gm06:31694650, Gm06:32503141, Gm06:33196184, Gm06:35509548, Gm06:37712913, Gm06:38467854, Gm06:39168136, Gm06:39533730, Gm06:40766974, Gm06:41476201, Gm06:42450296, Gm06:47500976, Gm06:47521797, Gm06:48475049, Gm06:49978151, Gm06:22700011, Gm01:759365, Gm02:4893148, Gm02:9714426, Gm02:11502780, Gm02:15446229, Gm02:33158449, Gm02:45776142, Gm17:16136646, Gm17:39804515, Gm15:50237460, Gm13:235439, Gm13:20365663, Gm13:20744030, Gm13:35174140, Gm18:305113, Gm18:58086324, Gm18:61591142, Gm18:61831970, Gm12:11512115, Gm20:39051858, Gm20:41216234, Gm16:4678569, Gm16:36524407, Gm19:47535046, Gm19:47826727, Gm19:48252040, Gm19:48638646, Gm19:50222676, Gm07:1141099, Gm07:1830296, Gm07:1923026, Gm07:2179883, Gm07:2310058, Gm07:2679749, Gm07:3009018, Gm07:4282676, Gm07:4319368, Gm07:4342479, Gm07:5576650, Gm07:6288899, Gm07:6340656, Gm07:6347675, Gm07:6614649, Gm07:6616695, Gm07:6623333, Gm07:6671535, Gm07:7096376, Gm07:7774056, Gm07:8674220, Gm07:35590550, Gm07:36459825, Gm07:36638366, Gm03:38491492, Gm03:39583405, Gm03:46209939, Gm10:43974548, Gm10:44725777, Gm10:44732850, Gm10:50495033, or any combination thereof. In other examples, the haplotype or marker profile comprises two or more polymorphisms described in FIGS. 1-19 and/or Tables 26-27. In some examples, the haplotype or the marker profile may comprise a combination of early alleles and late alleles.


In some examples, the at least one favorable allele of one or more marker loci is selected from the group consisting of an allele of a marker provided in Table 24. In some examples, the at least one favorable allele of one or more marker loci is selected from the group consisting of an early allele of a marker provided in Table 25, or any combination thereof.


Detecting may comprise isolating nucleic acids, amplifying the marker locus or a portion of the marker locus and detecting the resulting amplified marker amplicon. In particular examples, the amplifying comprises admixing an amplification primer or amplification primer pair and, optionally at least one nucleic acid probe, with a nucleic acid isolated from the first soybean plant or germplasm, wherein the primer or primer pair and optional probe is complementary or partially complementary to at least a portion of the marker locus and is capable of initiating DNA polymerization by a DNA polymerase using the soybean nucleic acid as a template; and, extending the primer or primer pair in a DNA polymerization reaction comprising a DNA polymerase and a template nucleic acid to generate at least one amplicon. In particular examples, the detection comprises real time PCR analysis.


In still further aspects, the information disclosed herein regarding marker alleles, haplotypes, and/or marker profiles can be used to aid in the creation and/or selection of breeding plants, lines, and populations for a preferred reproductive growth phenotype, including but not limited to at least one or more of a preferred time to initiation of flowering, early flowering, relative maturity, and/or length of reproductive growth. Further, the marker alleles, haplotypes, and/or marker profiles can be used for use in introgression into elite soybean germplasm, exotic soybean germplasm, or any other soybean germplasm. In some examples the marker alleles, haplotypes, and/or marker profiles can be used to aid in the creation and/or selection of breeding plants, lines, and populations for a preferred reproductive growth phenotype for a specific area of adaptation or target environment. Also provided is a method for introgressing a soybean QTL, marker, haplotype, and/or marker profile associated with at least a preferred time or length of at least one reproductive stage into soybean germplasm. Methods are provided wherein one or more loci, markers, haplotypes and/or marker profiles are used to create and/or select soybean plants having at a preferred time or length of at least one reproductive stage. Plants so created and selected can be used in a soybean breeding program. Through the process of introgression, the QTL, marker, haplotype, and/or marker profile associated with a preferred time or length of at least one reproductive stage, such as a preferred time to initiation of flowering, early flowering, and/or length of reproductive growth, is introduced from plants identified using marker-assisted selection (MAS) to other plants. According to the method, agronomically desirable plants and seeds can be produced containing the QTL, marker, haplotype, and/or marker profile associated with a preferred time or length of at least one reproductive stage from germplasm containing the QTL, marker, haplotype, and/or marker profile.


Also provided herein is a method for producing a soybean plant adapted for a preferred reproductive growth phenotype. First, donor soybean plants for a parental line containing at least one preferred reproductive growth QTL, marker, haplotype and/or marker profile are selected. According to the method, selection can be accomplished via MAS as explained herein. Selected plant material may represent, among others, an inbred line, a hybrid line, a heterogeneous population of soybean plants, or an individual plant. According to techniques well known in the art of plant breeding, this donor parental line is crossed with a second parental line. In some examples, the second parental line is a high yielding line. This cross produces a segregating plant population composed of genetically heterogeneous plants. Plants of the segregating plant population are screened for the tolerance QTL, marker, or haplotype. Further breeding may include, among other techniques, additional crosses with other lines, hybrids, backcrossing, or self-crossing. The result is a line of soybean plants that has a preferred reproductive growth phenotype and optionally also has other desirable traits from one or more other soybean lines.


Also provided is a method of soybean plant breeding comprising crossing at least two different soybean parent plants, wherein the parent soybean plants differ in time to R1 reproductive stage, obtaining a population of progeny soybean seed from said cross, genotyping the progeny soybean seed with at least one genetic marker, and, selecting a subpopulation comprising at least one soybean seed possessing a genotype for altered time to R1 reproductive stage, wherein the mean time to R1 reproductive stage of the selected subpopulation is altered as compared to the mean time to R1 reproductive stage of the non-selected progeny. In some examples the mean time to R1 reproductive stage of the selected subpopulation of progeny is at least 3-7 days different, or at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or more days different than the mean time to R1 reproductive stage of the non-selected progeny. In other examples the mean time to R1 reproductive stage of the selected subpopulation of progeny is at least 2, 3, 4, 5, 6, 7, or 8 days different than the mean time to R1 reproductive stage of the non-selected progeny. In some examples, the two different soybean parent plants also differ by maturity. The maturity groups of the parent plants may differ by one or more maturity subgroups, by one or more maturity groups, or by 1 or more days to maturity. In some examples the parents differ in maturity by at least 10 days, between 10 days-20 days, between 10 days-30 days, by at least 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, or 0.9 maturity subgroups, by at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12 maturity groups. In some examples one parent is adapted for a northern growing region, and the second parent is not adapted for a northern growing region. In some examples the parent adapted for a northern growing region comprises a better reproductive growth phenotype for a northern growing region than the parent not adapted for a northern growing region. In some examples, the method further comprises obtaining progeny better adapted for a northern growing region.


In some examples the methods include identifying trait loci in a mixed defined plant population comprising multiple plant families (see, e.g., U.S. Pat. No. 6,399,855, herein incorporated by reference in its entirety). The method comprises quantifying a phenotypic trait across lines sampled from the population, identifying at least one genetic marker associated with the phenotypic trait by screening a set of markers and identifying the quantitative trait loci based on the association of the phenotypic trait and the genetic marker(s). In some examples the plant population consists of diploid plants, either hybrid or inbred. The phenotypic traits associated with the locus are quantitative such that a numerical value can be ascribed to the trait, and the association of the genetic loci and the phenotypic trait is determined through specified statistical models. In some examples the statistical models are linear models with fixed effects and random effects. In a other examples the statistical model is a mixed effects model.


Soybean plants, seeds, tissue cultures, variants and mutants having a preferred reproductive growth phenotype produced by the foregoing methods are also provided. Soybean plants, seeds, tissue cultures, variants and mutants comprising one or more of the marker loci, one or more of the favorable alleles, and/or one or more of the haplotypes and having a preferred reproductive growth phenotype are provided. Also provided are isolated nucleic acids, kits, and systems useful for the identification, prediction, and/or selection methods disclosed herein.


In some examples, the soybean plant, germplasm, plant part, or seed having a preferred reproductive growth phenotype further comprises one or more other traits of interest including but not limited to improved resistance to one or more ALS-inhibiting herbicides, a hydroxyphenylpyruvatedioxygenase inhibitor, a phosphanoglycine (including but not limited to a glyphosate), a sulfonamide, an imidazolinone, a bialaphos, a phosphinothricin, a metribuzin, a mesotrione, an isoxaflutole, an azafenidin, a butafenacil, a sulfosate, a glufosinate, a dicamba, a 2,4-D, and a protox inhibitor. In some examples, resistance to the herbicidal formulation is conferred by a transgene. In some examples, the plant or germplasm further comprises a trait selected from the group consisting of drought tolerance, stress tolerance, disease resistance, herbicide resistance, enhanced yield, modified oil, modified protein, tolerance to chlorotic conditions, and insect resistance, or any combination thereof. In some examples, the trait is selected from the group consisting of brown stem rot resistance, charcoal rot drought complex resistance, Fusarium resistance, Phytophthora resistance, stem canker resistance, sudden death syndrome resistance, Sclerotinia resistance, Cercospora resistance, anthracnose resistance, target spot resistance, frogeye leaf spot resistance, soybean cyst nematode resistance, root knot nematode resistance, rust resistance, high oleic content, low linolenic content, aphid resistance, stink bug resistance, and iron chlorosis deficiency tolerance, or any combination thereof. In some examples, one or more of the traits is conferred by one or more transgenes, by one or more native loci, or any combination thereof.


In another example a method of producing a cleaned soybean seed is provided, the method comprising cleaning a soybean seed having at least one locus conferred a preferred reproductive growth phenotype is provided. In some examples, the cleaned soybean seed has enhanced yield characteristics when compared to a soybean seed which has not been cleaned. Cleaned soybean seed produced by the methods are also provided.


In another example a method of producing a treated soybean seed is provided, the method comprising treating a soybean seed having at least one locus conferred a preferred reproductive growth phenotype is provided. In some examples, the seed treatment comprises a fungicide, an insecticide, or any combination thereof. In some examples the seed treatment comprises trifloxystrobin, metalaxyl, imidacloprid, Bacillus spp., and any combination thereof. In some examples the seed treatment comprises picoxystrobin, penthiopyrad, cyantraniliprole, chlorantraniliprole, and any combination thereof. In some examples, the seed treatment improves seed germination under normal and/or stress environments, early stand count, vigor, yield, root formation, nodulation, and any combination thereof when compared to a soybean seed which has not been treated. In some examples seed treatment reduces seed dust levels, insect damage, pathogen establishment and/or damage, plant virus infection and/or damage, and any combination thereof. Treated soybean seed produced by the methods are also provided.


It is to be understood that this invention is not limited to particular embodiments, which can, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting. Further, all publications referred to herein are incorporated by reference for the purpose cited to the same extent as if each was specifically and individually indicated to be incorporated by reference herein.


As used in this specification and the appended claims, terms in the singular and the singular forms “a,” “an,” and “the,” for example, include plural referents unless the content clearly dictates otherwise. Thus, for example, reference to “plant,” “the plant,” or “a plant” also includes a plurality of plants; also, depending on the context, use of the term “plant” can also include genetically similar or identical progeny of that plant; use of the term “a nucleic acid” optionally includes, as a practical matter, many copies of that nucleic acid molecule; similarly, the term “probe” optionally (and typically) encompasses many similar or identical probe molecules.


Additionally, as used herein, “comprising” is to be interpreted as specifying the presence of the stated features, integers, steps, or components as referred to, but does not preclude the presence or addition of one or more features, integers, steps, or components, or groups thereof. Thus, for example, a kit comprising one pair of oligonucleotide primers may have two or more pairs of oligonucleotide primers. Additionally, the term “comprising” is intended to include examples encompassed by the terms “consisting essentially of” and “consisting of” Similarly, the term “consisting essentially of” is intended to include examples encompassed by the term “consisting of.”


Certain definitions used in the specification and claims are provided below. In order to provide a clear and consistent understanding of the specification and claims, including the scope to be given such terms, the following definitions are provided:


“Allele” means any of one or more alternative forms of a genetic sequence. In a diploid cell or organism, the two alleles of a given sequence typically occupy corresponding loci on a pair of homologous chromosomes. With regard to a SNP marker, allele refers to the specific nucleotide base present at that SNP locus in that individual plant.


The term “amplifying” in the context of nucleic acid amplification is any process whereby additional copies of a selected nucleic acid (or a transcribed form thereof) are produced. An “amplicon” is an amplified nucleic acid, e.g., a nucleic acid that is produced by amplifying a template nucleic acid by any available amplification method.


“Backcrossing” is a process in which a breeder crosses a progeny variety back to one of the parental genotypes one or more times.


The term “chromosome segment” designates a contiguous linear span of genomic DNA that resides in planta on a single chromosome. “Chromosome interval” refers to a chromosome segment defined by specific flanking marker loci.


“Cultivar” and “variety” are used synonymously and mean a group of plants within a species (e.g., Glycine max) that share certain genetic traits that separate them from other possible varieties within that species. Soybean cultivars are inbred lines produced after several generations of self-pollinations. Individuals within a soybean cultivar are homogeneous, nearly genetically identical, with most loci in the homozygous state.


An “elite line” is an agronomically superior line that has resulted from many cycles of breeding and selection for superior agronomic performance. Numerous elite lines are available and known to those of skill in the art of soybean breeding.


An “elite population” is an assortment of elite individuals or lines that can be used to represent the state of the art in terms of agronomically superior genotypes of a given crop species, such as soybean.


An “exotic soybean strain” or an “exotic soybean germplasm” is a strain or germplasm derived from a soybean not belonging to an available elite soybean line or strain of germplasm. In the context of a cross between two soybean plants or strains of germplasm, an exotic germplasm is not closely related by descent to the elite germplasm with which it is crossed. Most commonly, the exotic germplasm is not derived from any known elite line of soybean, but rather is selected to introduce novel genetic elements (typically novel alleles) into a breeding program.


A “genetic map” is a description of genetic association or linkage relationships among loci on one or more chromosomes (or linkage groups) within a given species, generally depicted in a diagrammatic or tabular form.


“Genotype” refers to the genetic constitution of a cell or organism.


“Germplasm” means the genetic material that comprises the physical foundation of the hereditary qualities of an organism. As used herein, germplasm includes seeds and living tissue from which new plants may be grown; or, another plant part, such as leaf, stem, pollen, or cells, that may be cultured into a whole plant. Germplasm resources provide sources of genetic traits used by plant breeders to improve commercial cultivars.


An individual is “homozygous” if the individual has only one type of allele at a given locus (e.g., a diploid individual has a copy of the same allele at a locus for each of two homologous chromosomes). An individual is “heterozygous” if more than one allele type is present at a given locus (e.g., a diploid individual with one copy each of two different alleles). The term “homogeneity” indicates that members of a group have the same genotype at one or more specific loci. In contrast, the term “heterogeneity” is used to indicate that individuals within the group differ in genotype at one or more specific loci.


“Introgression” means the entry or introduction of a gene, QTL, marker, haplotype, marker profile, trait, or trait locus from the genome of one plant into the genome of another plant.


The terms “label” and “detectable label” refer to a molecule capable of detection. A detectable label can also include a combination of a reporter and a quencher, such as are employed in FRET probes or TAQMAN® probes. The term “reporter” refers to a substance or a portion thereof that is capable of exhibiting a detectable signal, which signal can be suppressed by a quencher. The detectable signal of the reporter is, e.g., fluorescence in the detectable range. The term “quencher” refers to a substance or portion thereof that is capable of suppressing, reducing, inhibiting, etc., the detectable signal produced by the reporter. As used herein, the terms “quenching” and “fluorescence energy transfer” refer to the process whereby, when a reporter and a quencher are in close proximity, and the reporter is excited by an energy source, a substantial portion of the energy of the excited state nonradiatively transfers to the quencher where it either dissipates nonradiatively or is emitted at a different emission wavelength than that of the reporter.


A “line” or “strain” is a group of individuals of identical parentage that are generally inbred to some degree and that are generally homozygous and homogeneous at most loci (isogenic or near isogenic). A “subline” refers to an inbred subset of descendents that are genetically distinct from other similarly inbred subsets descended from the same progenitor. Traditionally, a subline has been derived by inbreeding the seed from an individual soybean plant selected at the F3 to F5 generation until the residual segregating loci are “fixed” or homozygous across most or all loci. Commercial soybean varieties (or lines) are typically produced by aggregating (“bulking”) the self-pollinated progeny of a single F3 to F5 plant from a controlled cross between two genetically different parents. While the variety typically appears uniform, the self-pollinating variety derived from the selected plant eventually (e.g., F8) becomes a mixture of homozygous plants that can vary in genotype at any locus that was heterozygous in the originally selected F3 to F5 plant. Marker-based sublines that differ from each other based on qualitative polymorphism at the DNA level at one or more specific marker loci are derived by genotyping a sample of seed derived from individual self-pollinated progeny derived from a selected F3-F5 plant. The seed sample can be genotyped directly as seed, or as plant tissue grown from such a seed sample. Optionally, seed sharing a common genotype at the specified locus (or loci) are bulked providing a subline that is genetically homogenous at identified loci important for a trait of interest (e.g., yield, tolerance, etc.).


“Linkage” refers to the tendency for alleles to segregate together more often than expected by chance if their transmission was independent. Typically, linkage refers to alleles on the same chromosome. Genetic recombination occurs with an assumed random frequency over the entire genome. Genetic maps are constructed by measuring the frequency of recombination between pairs of traits or markers, the lower the frequency of recombination, the greater the degree of linkage.


“Linkage disequilibrium” is a non-random association of 2 or more alleles wherein the 2 or more alleles occur together at a greater frequency than expected from their individual frequencies.


“Linkage group” refers to traits or markers that co-segregate. A linkage group generally corresponds to a chromosomal region containing genetic material that encodes the traits or markers.


“Locus” is a defined segment of DNA.


A “management zone” is any specific area within a field that responds to management practices in a similar way. There are various criteria and ways to create management zones, including but not limited to using soil data, climate information, geographic data, and/or crop information in conjunction with an algorithm to identify areas of a field that are most similar. The computer can take thousands of numbers and find areas that are alike, cluster them together, and generate a map. Different zones can be defined by using different data inputs, but weighting inputs differently, by assigning different criteria, or by identifying different management practices of interest. For example a management zone for irrigation is probably not identical to a management zone for weed management for the same field in the same year. Management zones may also use the same inputs and criteria and yet differ across years.


A “map location,” a “map position,” or a “relative map position” is an assigned location on a genetic map relative to linked genetic markers where a specified marker can be found within a given species. Map positions are generally provided in centimorgans (cM), unless otherwise indicated, genetic positions provided are based on the Glycine max consensus map v 4.0 as provided by Hyten et al. (2010) Crop Sci 50:960-968. A “physical position” or “physical location” is the position, typically in nucleotide bases, of a particular nucleotide, such as a SNP nucleotide, on the chromosome. Unless otherwise indicated, the physical position within the soybean genome provided is based on the Glyma 1.0 genome sequence described in Schmutz et al. (2010) Nature 463:178-183, available from the Phytozome website (phytozome-dot-net/soybean).


“Mapping” is the process of defining the association and relationships of loci through the use of genetic markers, populations segregating for the markers, and standard genetic principles of recombination frequency.


“Marker” or “molecular marker” is a term used to denote a nucleic acid or amino acid sequence that is sufficiently unique to characterize a specific locus on the genome. Any detectable polymorphic trait can be used as a marker so long as it is inherited differentially and exhibits non-random association with a phenotypic trait of interest.


“Marker assisted selection” refers to the process of selecting a desired trait or traits in a plant or plants by detecting one or more nucleic acids from the plant, where the nucleic acid is associated with or linked to the desired trait, and then selecting the plant or germplasm possessing those one or more nucleic acids.


“Maturity Group” is an agreed-on industry division of groups of varieties, based on the zones in which they are adapted primarily according to day length and/or latitude. Soybean varieties are grouped into 13 maturity groups, depending on the climate and latitude for which they are adapted. Soybean maturities are divided into relative maturity groups (denoted as 000, 00, 0, I, II, III, IV, V, VI, VII, VIII, IX, X, or 000, 00, 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10). These maturity groups are given numbers, with numbers 000, 00, 0 and 1 typically being adapted to Canada and the northern United States, groups VII, VIII and IX being grown in the southern regions, and Group X is tropical. Within a maturity group are sub-groups. A sub-group is a tenth of a relative maturity group (for example 1.3 would indicate a group 1 and subgroup 3). Within narrow comparisons, the difference of a tenth of a relative maturity group equates very roughly to a day difference in maturity at harvest.


A “mixed defined plant population” refers to a plant population containing many different families and lines of plants. Typically, the defined plant population exhibits a quantitative variability for a phenotype that is of interest. “Multiple plant families” refers to different families of related plants within a population.


“Haplotype” refers to a combination of particular alleles present within a particular plant's genome at two or more linked marker loci, for instance at two or more loci on a particular linkage group. For instance, in one example, two specific marker loci on LG A1 are used to define a haplotype for a particular plant. In still further examples, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, or more linked marker loci are used to define a haplotype for a particular plant.


As used herein, a “marker profile” means a combination of particular alleles present within a particular plant's genome at two or more marker loci which are not linked, for instance two or more loci on two or more different linkage groups or two or more chromosomes. For instance, in one example, one marker locus on LG A1 and a marker locus on another linkage group are used to define a marker profile for a particular plant. In certain other examples a plant's marker profile comprises one or more haplotypes. In some examples, the marker profile encompasses two or more loci for the same trait, such as time to first flower. In other examples, the marker profile encompasses two or more loci associated with two or more traits of interest, such as time to first flower and a second trait of interest.


The term “plant” includes reference to an immature or mature whole plant, including a plant from which seed or grain or anthers have been removed. Seed or embryo that will produce the plant is also considered to be the plant.


“Plant parts” means any portion or piece of a plant, including leaves, stems, buds, roots, root tips, anthers, seed, grain, embryo, pollen, ovules, flowers, cotyledons, hypocotyls, pods, flowers, shoots, stalks, tissues, tissue cultures, cells, and the like.


“Polymorphism” means a change or difference between two related nucleic acids. A “nucleotide polymorphism” refers to a nucleotide that is different in one sequence when compared to a related sequence when the two nucleic acids are aligned for maximal correspondence.


“Polynucleotide,” “polynucleotide sequence,” “nucleic acid sequence,” “nucleic acid fragment,” and “oligonucleotide” are used interchangeably herein to indicate a polymer of nucleotides that is single- or multi-stranded, that optionally contains synthetic, non-natural, or altered RNA or DNA nucleotide bases. A DNA polynucleotide may be comprised of one or more strands of cDNA, genomic DNA, synthetic DNA, or mixtures thereof.


“Primer” refers to an oligonucleotide which is capable of acting as a point of initiation of nucleic acid synthesis or replication along a complementary strand when placed under conditions in which synthesis of a complementary strand is catalyzed by a polymerase. Typically, primers are about 10 to 30 nucleotides in length, but longer or shorter sequences can be employed. Primers may be provided in double-stranded form, though the single-stranded form is more typically used. A primer can further contain a detectable label, for example a 5′ end label.


“Probe” refers to an oligonucleotide that is complementary (though not necessarily fully complementary) to a polynucleotide of interest and forms a duplexed structure by hybridization with at least one strand of the polynucleotide of interest. Typically, probes are oligonucleotides from 10 to 50 nucleotides in length, but longer or shorter sequences can be employed. A probe can further contain a detectable label.


“Quantitative trait loci” or “QTL” refer to the genetic elements controlling a quantitative trait.


“Recombination frequency” is the frequency of a crossing over event (recombination) between two genetic loci. Recombination frequency can be observed by following the segregation of markers and/or traits during meiosis.


“Reproductive stage” is a description of the characteristics associated with various phases of reproductive growth.


“R1” is the first reproductive stage when soybean begins to bloom by producing the first flower.


“Time to R1 reproductive stage” is measured in days unless otherwise stated.


“Tolerance and “improved tolerance” are used interchangeably herein and refer to any type of increase in resistance or tolerance to, or any type of decrease in susceptibility. A “tolerant plant” or “tolerant plant variety” need not possess absolute or complete tolerance. Instead, a “tolerant plant,” “tolerant plant variety,” or a plant or plant variety with “improved tolerance” will have a level of resistance or tolerance which is higher than that of a comparable susceptible plant or variety.


“Self-crossing” or “self-pollination” or “selfing” is a process through which a breeder crosses a plant with itself; for example, a second-generation hybrid F2 with itself to yield progeny designated F2:3.


“SNP” or “single nucleotide polymorphism” means a sequence variation that occurs when a single nucleotide (A, T, C, or G) in the genome sequence is altered or variable. “SNP markers” exist when SNPs are mapped to sites on the soybean genome.


The term “yield” refers to the productivity per unit area of a particular plant product of commercial value. For example, yield of soybean is commonly measured in bushels of seed per acre or metric tons of seed per hectare per season. Yield is affected by both genetic and environmental factors.


An “isolated” or “purified” polynucleotide or polypeptide, or biologically active portion thereof, is substantially or essentially free from components that normally accompany or interact with the polynucleotide or polypeptide as found in its naturally occurring environment. Typically, an “isolated” polynucleotide is free of sequences (optimally protein encoding sequences) that naturally flank the polynucleotide (i.e., sequences located at the 5′ and 3′ ends of the polynucleotide) in the genomic DNA of the organism from which the polynucleotide is derived. For example, the isolated polynucleotide can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb, or 0.1 kb of nucleotide sequence that naturally flank the polynucleotide in genomic DNA of the cell from which the polynucleotide is derived. A polypeptide that is substantially free of cellular material includes preparations of polypeptides having less than about 30%, 20%, 10%, 5%, or 1% (by dry weight) of contaminating protein, culture media, or other chemical components.


Standard recombinant DNA and molecular cloning techniques used herein are well known in the art and are described more fully in Sambrook et al. Molecular Cloning: A Laboratory Manual; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, 1989 (hereinafter “Sambrook”).


Soybean is a short-day crop and its development is largely determined by variety-specific day length requirements that initiate floral development. In other words, as the days grow shorter soybean will flower and enter into reproductive development stages. Due to this photoperiod requirement, days from planting until maturity cannot be accurately estimated for soybean due to variation in planting date and other environmental variations. After flowering, temperature drives development and the days until maturity can be estimated. The number of days from floral initiation (R1) until physiological maturity (R7) is usually independent of variety, but will vary slightly from year to year due to temperature differences between years. Although most sensitive to day length, soybean flowering will be delayed to some extent with later planting dates. However, later planted soybean initiates flowering during a warmer time of the year; therefore, post-flower development speeds up. The precise number of days from full flower (R2) until R7 cannot be predicted, but fairly reliable estimates can be derived from historical information (see, e.g., Holshouser (2010) “Days to Soybean Physiological Maturity,” Virginia Cooperative Extension, Bulletin 3009-1459; and, Heatherly (2005) “Soybean maturity group, planting date and development related,” Delta Farm Press, Oct. 14, 2005).


Soybean growth is often characterized as comprising two stages: vegetative growth and reproductive growth. The vegetative (V) stages are numbered according to how many fully-developed trifoliate leaves are present. The reproductive (R) stages begin at flowering and include pod development, seed development, and plant maturation. Soybean yield is impacted by genetics and environment, and various management practices can impact crop growth and yield in the context of the genetics of the crop. These stages are well-characterized and known (see, e.g., McWilliams et al. (1999) Soybean Growth & Management Quick Guide, A-1174, NDSU Extension Service), and summarized in the table below.













Vegetative Stages
Reproductive Stages


















VE
Emergence
R1
beginning bloom, 1st flower


VC
Cotyledon Stage
R2
full bloom, flower in top 2 nodes


V1
1st trifoliate leaf
R3
beginning pod, 3/16″ pod in top 4 nodes


V2
2nd trifoliate
R4
full pod, ¾″ pod in top 4 nodes


V3
3rd trifoliate
R5
⅛″ seed in top 4 nodes


Vn
nth trifoliate
R6
full size seed in top 4 nodes


V6
flowering should
R7
beginning maturity, one mature pod



start soon
R8
full maturity, 95% of pods are mature









The advent of molecular genetic markers has facilitated mapping and selection of agriculturally important traits in soybean. Markers tightly linked to tolerance genes are an asset in the rapid identification of tolerant soybean lines on the basis of genotype by the use of marker assisted selection (MAS). Introgressing tolerance genes into a desired cultivar would also be facilitated by using suitable markers.


Soybean cultivar development for preferred reproductive growth phenotype can be performed using classical breeding methods or by using marker assisted selection (MAS). Genetic markers for maturity or flowering time have been identified.


Provided are markers, haplotypes, and/or marker profiles associated with a preferred reproductive growth phenotype, as well as related primers and/or probes and methods for the use of any of the foregoing for identifying and/or selecting soybean plants with preferred time to floral initiation. A method for determining the presence or absence of at least one allele of a particular marker or haplotype associated with floral initiation comprises analyzing genomic DNA from a soybean plant or germplasm to determine if at least one, or a plurality, of such markers is present or absent and if present, determining the allelic form of the marker(s). If a plurality of markers on a single linkage group are investigated, this information regarding the markers present in the particular plant or germplasm can be used to determine a haplotype for that plant/germplasm.


In certain examples, plants or germplasm are identified that have at least one favorable allele, marker, and/or haplotype that positively correlate a preferred reproductive growth phenotype. However, in other examples, it is useful to identify alleles, markers, and/or haplotypes that negatively correlate with a preferred reproductive growth phenotype, for example to eliminate such plants or germplasm from subsequent rounds of breeding, or to use as controls or check. Soybean plants, cells, seed, varieties, and/or germplasm having preferred reproductive growth phenotype are provided.


Any marker associated with a preferred reproductive growth phenotype locus or QTL is useful. Further, any suitable type of marker can be used, including Restriction Fragment Length Polymorphisms (RFLPs), Single Sequence Repeats (SSRs), Target Region Amplification Polymorphisms (TRAPs), Isozyme Electrophoresis, Randomly Amplified Polymorphic DNAs (RAPDs), Arbitrarily Primed Polymerase Chain Reaction (AP-PCR), DNA Amplification Fingerprinting (DAF), Sequence Characterized Amplified Regions (SCARs), Amplified Fragment Length Polymorphisms (AFLPs), and Single Nucleotide Polymorphisms (SNPs). Additionally, other types of molecular markers known in the art or phenotypic traits may also be used as markers in the methods.


Markers that map closer to a QTL are generally used over markers that map farther from such a QTL. Marker loci are especially useful when they are closely linked to a locus associated with a preferred reproductive growth phenotype. Thus, in one example, marker loci display an inter-locus cross-over frequency of about 10% or less, about 9% or less, about 8% or less, about 7% or less, about 6% or less, about 5% or less, about 4% or less, about 3% or less, about 2% or less, about 1% or less, about 0.75% or less, about 0.5% or less, or about 0.25% or less with a QTL to which they are linked. Thus, the loci are separated from the QTL to which they are linked by about 10 cM, 9 cM, 8 cM, 7 cM, 6 cM, 5 cM, 4 cM, 3 cM, 2 cM, 1 cM, 0.75 cM, 0.5 cM, or 0.25 cM or less.


In certain examples, multiple marker loci that collectively make up a haplotype and/or a marker profile are investigated, for instance 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or more marker loci.


In addition to the markers discussed herein, information regarding useful soybean markers can be found, for example, on the USDA's Soybase website, available at soybase.org. A number of soybean markers have been mapped and linkage groups created, as described in Cregan et al. (1999) Crop Sci 39:1464-90, Choi et al. (2007) Genetics 176:685-96, and Hyten, et al. (2010) Crop Sci 50:960-968, each of which is herein incorporated by reference in its entirety, including any supplemental materials associated with the publication. Many soybean markers are publicly available at the USDA affiliated soybase website (at soybase-dot-org). One of skill in the art will recognize that the identification of favorable marker alleles may be germplasm-specific. One of skill will also recognize that methods for identifying the favorable alleles are routine and well known in the art, and furthermore, that the identification and use of such favorable alleles is well within the scope of the invention.


The use of marker assisted selection (MAS) to select a soybean plant or germplasm based upon detection of a particular marker or haplotype of interest is provided. For instance, in certain examples, a soybean plant or germplasm possessing a certain predetermined favorable marker allele or haplotype will be selected via MAS. Using MAS, soybean plants or germplasm can be selected for markers or marker alleles that positively correlate with tolerance, without actually raising soybean and measuring for tolerance (or, contrawise, soybean plants can be selected against if they possess markers that negatively correlate with tolerance). MAS is a powerful tool to select for desired phenotypes and for introgressing desired traits into cultivars of soybean (e.g., introgressing desired traits into elite lines). MAS is easily adapted to high throughput molecular analysis methods that can quickly screen large numbers of plant or germplasm genetic material for the markers of interest and is much more cost effective than raising and observing plants for visible traits.


In some examples, molecular markers are detected using a suitable amplification-based detection method. Typical amplification methods include various polymerase based replication methods, including the polymerase chain reaction (PCR), ligase mediated methods, such as the ligase chain reaction (LCR), and RNA polymerase based amplification (e.g., by transcription) methods. In these types of methods, nucleic acid primers are typically hybridized to the conserved regions flanking the polymorphic marker region. In certain methods, nucleic acid probes that bind to the amplified region are also employed. In general, synthetic methods for making oligonucleotides, including primers and probes, are well known in the art. For example, oligonucleotides can be synthesized chemically according to the solid phase phosphoramidite triester method described by Beaucage & Caruthers (1981) Tetrahedron Letts 22:1859-1862, e.g., using a commercially available automated synthesizer, e.g., as described in Needham-VanDevanter et al. (1984) Nucl Acids Res 12:6159-6168. Oligonucleotides, including modified oligonucleotides, can also be ordered from a variety of commercial sources known to persons of skill in the art.


It will be appreciated that suitable primers and probes to be used can be designed using any suitable method. It is not intended that the invention be limited to any particular primer, primer pair, or probe. For example, primers can be designed using any suitable software program, such as LASERGENE® or Primer3.


The primers are not limited to generating an amplicon of any particular size. For example, the primers used to amplify the marker loci and alleles herein are not limited to amplifying the entire region of the relevant locus. In some examples, marker amplification produces an amplicon at least 20 nucleotides in length, or alternatively, at least 50 nucleotides in length, or alternatively, at least 100 nucleotides in length, or alternatively, at least 200 nucleotides in length, or alternatively, at least 300 nucleotides in length, or alternatively, at least 400 nucleotides in length, or alternatively, at least 500 nucleotides in length, or alternatively, at least 1000 nucleotides in length, or alternatively, at least 2000 nucleotides in length or more.


PCR, RT-PCR, and LCR are common amplification and amplification-detection methods for amplifying nucleic acids of interest (e.g., those comprising marker loci), facilitating detection of the markers. Details regarding the use of these and other amplification methods are well known in the art and can be found in any of a variety of standard texts. Details for these techniques can also be found in numerous references, such as Mullis et al. (1987) U.S. Pat. No. 4,683,202; Arnheim & Levinson (1990) C&EN 36-47; Kwoh et al. (1989) Proc Natl Acad Sci USA 86:1173; Guatelli et al. (1990) Proc Natl Acad Sci USA 87:1874; Lomell et al. (1989) J Clin Chem 35:1826; Landegren et al. (1988) Science 241:1077-1080; Van Brunt (1990) Biotechnology 8:291-294; Wu & Wallace (1989) Gene 4:560; Barringer et al. (1990) Gene 89:117; and Sooknanan & Malek (1995) Biotechnology 13:563-564.


Such nucleic acid amplification techniques can be applied to amplify and/or detect nucleic acids of interest, such as nucleic acids comprising marker loci. Amplification primers for amplifying useful marker loci and suitable probes to detect useful marker loci or to genotype alleles, such as SNP alleles, are provided. For example, exemplary primers and probes are provided in Table 26. However, one of skill will immediately recognize that other primer and probe sequences could also be used. For instance, primers to either side of the given primers can be used in place of the given primers, so long as the primers can amplify a region that includes the allele to be detected, as can primers and probes directed to other marker loci. Further, it will be appreciated that the precise probe to be used for detection can vary, e.g., any probe that can identify the region of a marker amplicon to be detected can be substituted for those examples provided herein. Further, the configuration of the amplification primers and detection probes can, of course, vary. Thus, the compositions and methods are not limited to the primers and probes specifically recited herein.


In certain examples, probes will possess a detectable label. Any suitable label can be used with a probe. Detectable labels suitable for use with nucleic acid probes include, for example, any composition detectable by spectroscopic, radioisotopic, photochemical, biochemical, immunochemical, electrical, optical, or chemical means. Useful labels include biotin for staining with labeled streptavidin conjugate, magnetic beads, fluorescent dyes, radiolabels, enzymes, and colorimetric labels. Other labels include ligands, which bind to antibodies labeled with fluorophores, chemiluminescent agents, and enzymes. A probe can also constitute radiolabelled PCR primers that are used to generate a radiolabelled amplicon. Labeling strategies for labeling nucleic acids and their corresponding detection strategies can be found, e.g., in Haugland (1996) Handbook of Fluorescent Probes and Research Chemicals Sixth Edition by Molecular Probes, Inc. (Eugene, Oreg.); or Haugland (2001) Handbook of Fluorescent Probes and Research Chemicals Eighth Edition by Molecular Probes, Inc. (Eugene, Oreg.).


Detectable labels may also include reporter-quencher pairs, such as are employed in Molecular Beacon and TAQMAN® probes. The reporter may be a fluorescent organic dye modified with a suitable linking group for attachment to the oligonucleotide, such as to the terminal 3′ carbon or terminal 5′ carbon. The quencher may also be an organic dye, which may or may not be fluorescent. Generally, whether the quencher is fluorescent or simply releases the transferred energy from the reporter by nonradiative decay, the absorption band of the quencher should at least substantially overlap the fluorescent emission band of the reporter to optimize the quenching. Non-fluorescent quenchers or dark quenchers typically function by absorbing energy from excited reporters, but do not release the energy radiatively.


Selection of appropriate reporter-quencher pairs for particular probes may be undertaken in accordance with known techniques. Fluorescent and dark quenchers and their relevant optical properties from which exemplary reporter-quencher pairs may be selected are listed and described, for example, in Berlman, Handbook of Fluorescence Spectra of Aromatic Molecules, 2nd ed., Academic Press, New York, 1971, the content of which is incorporated herein by reference. Examples of modifying reporters and quenchers for covalent attachment via common reactive groups that can be added to an oligonucleotide in the present invention may be found, for example, in Haugland (2001) Handbook of Fluorescent Probes and Research Chemicals Eighth Edition by Molecular Probes, Inc. (Eugene, Oreg.), the content of which is incorporated herein by reference.


In certain examples, reporter-quencher pairs are selected from xanthene dyes including fluorescein and rhodamine dyes. Many suitable forms of these compounds are available commercially with substituents on the phenyl groups, which can be used as the site for bonding or as the bonding functionality for attachment to an oligonucleotide. Another useful group of fluorescent compounds for use as reporters is the naphthylamines, having an amino group in the alpha or beta position. Included among such naphthylamino compounds are 1-dimethylaminonaphthyl-5 sulfonate, 1-anilino-8-naphthalene sulfonate and 2-p-touidinyl-6-naphthalene sulfonate. Other dyes include 3-phenyl-7-isocyanatocoumarin; acridines such as 9-isothiocyanatoacridine; N-(p-(2-benzoxazolyl)phenyl)maleimide; benzoxadiazoles; stilbenes; pyrenes and the like. In certain other examples, the reporters and quenchers are selected from fluorescein and rhodamine dyes. These dyes and appropriate linking methodologies for attachment to oligonucleotides are well known in the art.


Suitable examples of reporters may be selected from dyes such as SYBR green, 5-carboxyfluorescein (5-FAM™ available from Applied Biosystems of Foster City, Calif.), 6-carboxyfluorescein (6-FAM), tetrachloro-6-carboxyfluorescein (TET), 2,7-dimethoxy-4,5-dichloro-6-carboxyfluorescein, hexachloro-6-carboxyfluorescein (HEX), 6-carboxy-2′,4,7,7′-tetrachlorofluorescein (6-TET™ available from Applied Biosystems), carboxy-X-rhodamine (ROX), 6-carboxy-4′,5′-dichloro-2′,7′-dimethoxyfluorescein (6-JOE™ available from Applied Biosystems), VIC™ dye products available from Molecular Probes, Inc., NED™ dye products available from available from Applied Biosystems, and the like. Suitable examples of quenchers may be selected from 6-carboxy-tetramethyl-rhodamine, 4-(4-dimethylaminophenylazo)benzoic acid (DABYL), tetramethylrhodamine (TAMRA), BHQ-0™, BHQ-1™, BHQ-2™, and BHQ-3™, each of which are available from Biosearch Technologies, Inc. of Novato, Calif., QSY-7™, QSY-9™, QSY-21™ and QSY-35™, each of which are available from Molecular Probes, Inc., and the like.


In one aspect, real time PCR or LCR is performed on the amplification mixtures described herein, e.g., using molecular beacons or TAQMAN® probes. A molecular beacon (MB) is an oligonucleotide that, under appropriate hybridization conditions, self-hybridizes to form a stem and loop structure. The MB has a label and a quencher at the termini of the oligonucleotide; thus, under conditions that permit intra-molecular hybridization, the label is typically quenched (or at least altered in its fluorescence) by the quencher. Under conditions where the MB does not display intra-molecular hybridization (e.g., when bound to a target nucleic acid, such as to a region of an amplicon during amplification), the MB label is unquenched. Details regarding standard methods of making and using MBs are well established in the literature and MBs are available from a number of commercial reagent sources. See also, e.g., Leone et al. (1995) Nucl Acids Res 26:2150-2155; Tyagi & Kramer (1996) Nat Biotechnol 14:303-308; Blok & Kramer (1997) Mol Cell Probes 11:187-194; Hsuih et al. (1997) J Clin Microbiol 34:501-507; Kostrikis et al. (1998) Science 279:1228-1229; Sokol et al. (1998) Proc Natl Acad Sci USA 95:11538-11543; Tyagi et al. (1998) Nat Biotechnol 16:49-53; Bonnet et al. (1999) Proc Natl Acad Sci USA 96:6171-6176; Fang et al. (1999) J Am Chem Soc 121:2921-2922; Marras et al. (1999) Genet Anal Biomol Eng 14:151-156; and, Vet et al. (1999) Proc Natl Acad Sci USA 96:6394-6399. Additional details regarding MB construction and use are also found in the patent literature, e.g., U.S. Pat. Nos. 5,925,517; 6,150,097; and 6,037,130.


Another real-time detection method is the 5′-exonuclease detection method, also called the TAQMAN® assay, as set forth in U.S. Pat. Nos. 5,804,375; 5,538,848; 5,487,972; and 5,210,015, each of which is hereby incorporated by reference in its entirety. In the TAQMAN® assay, a modified probe, typically 10-30 nucleotides in length, is employed during PCR which binds intermediate to or between the two members of the amplification primer pair. The modified probe possesses a reporter and a quencher and is designed to generate a detectable signal to indicate that it has hybridized with the target nucleic acid sequence during PCR. As long as both the reporter and the quencher are on the probe, the quencher stops the reporter from emitting a detectable signal. However, as the polymerase extends the primer during amplification, the intrinsic 5′ to 3′ nuclease activity of the polymerase degrades the probe, separating the reporter from the quencher, and enabling the detectable signal to be emitted. Generally, the amount of detectable signal generated during the amplification cycle is proportional to the amount of product generated in each cycle.


It is well known that the efficiency of quenching is a strong function of the proximity of the reporter and the quencher, i.e., as the two molecules get closer, the quenching efficiency increases. As quenching is strongly dependent on the physical proximity of the reporter and quencher, the reporter and the quencher are typically attached to the probe within a few nucleotides of one another, usually within 30 nucleotides of one another, or within 6 to 16 nucleotides. Typically, this separation is achieved by attaching one member of a reporter-quencher pair to the 5′ end of the probe and the other member to a nucleotide about 6 to 16 nucleotides away, in some cases at the 3′ end of the probe.


Separate detection probes can also be omitted in amplification/detection methods, e.g., by performing a real time amplification reaction that detects product formation by modification of the relevant amplification primer upon incorporation into a product, incorporation of labeled nucleotides into an amplicon, or by monitoring changes in molecular rotation properties of amplicons as compared to unamplified precursors (e.g., by fluorescence polarization).


One example of a suitable real-time detection technique that does not use a separate probe that binds intermediate to the two primers is the KASPar detection system/method, which is well known in the art. In KASPar, two allele specific primers are designed such that the 3′ nucleotide of each primer hybridizes to the polymorphic base. For example, if the SNP is an A/C polymorphism, one of the primers would have an “A” in the 3′ position, while the other primer would have a “C” in the 3′ position. Each of these two allele specific primers also has a unique tail sequence on the 5′ end of the primer. A common reverse primer is employed that amplifies in conjunction with either of the two allele specific primers. Two 5′ fluor-labeled reporter oligos are also included in the reaction mix, one designed to interact with each of the unique tail sequences of the allele-specific primers. Lastly, one quencher oligo is included for each of the two reporter oligos, the quencher oligo being complementary to the reporter oligo and being able to quench the fluor signal when bound to the reporter oligo. During PCR, the allele-specific primers and reverse primers bind to complementary DNA, allowing amplification of the amplicon to take place. During a subsequent cycle, a complementary nucleic acid strand containing a sequence complementary to the unique tail sequence of the allele-specific primer is created. In a further cycle, the reporter oligo interacts with this complementary tail sequence, acting as a labeled primer. Thus, the product created from this cycle of PCR is a fluorescently-labeled nucleic acid strand. Because the label incorporated into this amplification product is specific to the allele specific primer that resulted in the amplification, detecting the specific fluor presenting a signal can be used to determine the SNP allele that was present in the sample.


Further, it will be appreciated that amplification is not a requirement for marker detection—for example, one can directly detect unamplified genomic DNA simply by performing a Southern blot on a sample of genomic DNA. Procedures for performing Southern blotting, amplification e.g., (PCR, LCR, or the like), and many other nucleic acid detection methods are well established and are taught, e.g., in Sambrook et al. Molecular Cloning—A Laboratory Manual (3d ed.) Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 2000 (“Sambrook”); Current Protocols in Molecular Biology, F. M. Ausubel et al., eds., Current Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., (supplemented through 2002) (“Ausubel”); and, PCR Protocols A Guide to Methods and Applications (Innis et al., eds) Academic Press Inc. San Diego, Calif. (1990) (“Innis”). Additional details regarding detection of nucleic acids in plants can also be found, e.g., in Plant Molecular Biology (1993) Croy (ed.) BIOS Scientific Publishers, Inc.


Other techniques for detecting SNPs can also be employed, such as allele specific hybridization (ASH) or nucleic acid sequencing techniques. ASH technology is based on the stable annealing of a short, single-stranded, oligonucleotide probe to a completely complementary single-stranded target nucleic acid. Detection is via an isotopic or non-isotopic label attached to the probe. For each polymorphism, two or more different ASH probes are designed to have identical DNA sequences except at the polymorphic nucleotides. Each probe will have exact homology with one allele sequence so that the range of probes can distinguish all the known alternative allele sequences. Each probe is hybridized to the target DNA. With appropriate probe design and hybridization conditions, a single-base mismatch between the probe and target DNA will prevent hybridization.


Isolated polynucleotide or fragments thereof are capable of specifically hybridizing to other nucleic acid molecules under appropriate conditions. In one example, the nucleic acid molecules comprise any of SEQ ID NOs: 1-512, complements thereof and fragments thereof. In another aspect, the nucleic acid molecules of the present invention include nucleic acid molecules that hybridize, for example, under high or low stringency, substantially homologous sequences, or that have both to these molecules. Conventional stringency conditions are described by Sambrook et al. In: Molecular Cloning, A Laboratory Manual, 2nd Edition, Cold Spring Harbor Press, Cold Spring Harbor, N.Y. (1989)), and by Haymes et al. In: Nucleic Acid Hybridization, A Practical Approach, IRL Press, Washington, D.C. (1985). Departures from complete complementarity are therefore permissible, as long as such departures do not completely preclude the capacity of the molecules to form a double-stranded structure. In order for a nucleic acid molecule to serve as a primer or probe it need only be sufficiently complementary in sequence to be able to form a stable double-stranded structure under the particular solvent and salt concentrations employed. Appropriate stringency conditions that promote DNA hybridization are, for example, 6.0× sodium chloride/sodium citrate (SSC) at about 45° C., followed by a wash of 2.0×SSC at 50° C., are known to those skilled in the art or can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y., 1989, 6.3.1-6.3.6. For example, the salt concentration in the wash step can be selected from a low stringency of about 2.0×SSC at 50° C. to a high stringency of about 0.2×SSC at 50° C. In addition, the temperature in the wash step can be increased from low stringency conditions at room temperature, about 22° C., to high stringency conditions at about 65° C. Both temperature and salt may be varied, or either the temperature or the salt concentration may be held constant while the other variable is changed.


In some examples, an a marker locus will specifically hybridize to one or more of the nucleic acid molecules set forth in SEQ ID NOs: 1-512 or complements thereof or fragments of either under moderately stringent conditions, for example at about 2.0×SSC and about 65° C. In an aspect, a nucleic acid of the present invention will specifically hybridize to one or more SEQ ID NOs: 1-512 or complements or fragments of either under high stringency conditions.


In some examples, a marker associated with a preferred reproductive growth phenotype comprises any one of SEQ ID NOs: 1-512 or complements or fragments thereof. In other examples, a marker has between 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity to any one of SEQ ID NOs: 1-512 or complements or fragments thereof. Unless otherwise stated, percent sequence identity is determined using the GAP program is default parameters for nucleic acid alignment (Accelrys, San Diego, Calif., USA).


Traits or markers are considered herein to be linked if they generally co-segregate. A 1/100 probability of recombination per generation is defined as a map distance of 1.0 centiMorgan (1.0 cM). The genetic elements or genes located on a single chromosome segment are physically linked. In some embodiments, the two loci are located in close proximity such that recombination between homologous chromosome pairs does not occur between the two loci during meiosis with high frequency, e.g., such that linked loci co-segregate at least about 90% of the time, e.g., 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5%, 99.75%, or more of the time. The genetic elements located within a chromosome segment are also genetically linked, typically within a genetic recombination distance of less than or equal to 50 centimorgans (cM), e.g., about 49, 40, 30, 20, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1, 0.75, 0.5, or 0.25 cM or less. That is, two genetic elements within a single chromosome segment undergo recombination during meiosis with each other at a frequency of less than or equal to about 50%, e.g., about 49%, 40%, 30%, 20%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.75%, 0.5%, or 0.25% or less. Closely linked markers display a cross over frequency with a given marker of about 10% or less (the given marker is within about 10 cM of a closely linked marker). Put another way, closely linked loci co-segregate at least about 90% of the time. With regard to physical position on a chromosome, closely linked markers can be separated, for example, by about 1 megabase (Mb; 1 million nucleotides), about 500 kilobases (Kb; 1000 nucleotides), about 400 Kb, about 300 Kb, about 200 Kb, about 100 Kb, about 50 Kb, about 25 Kb, about 10 Kb, about 5 Kb, about 4 Kb, about 3 Kb, about 2 Kb, about 1 Kb, about 500 nucleotides, about 250 nucleotides, or less.


When referring to the relationship between two genetic elements, such as a genetic element contributing to tolerance and a proximal marker, “coupling” phase linkage indicates the state where the “favorable” allele at the tolerance locus is physically associated on the same chromosome strand as the “favorable” allele of the respective linked marker locus. In coupling phase, both favorable alleles are inherited together by progeny that inherit that chromosome strand. In “repulsion” phase linkage, the “favorable” allele at the locus of interest (e.g., a QTL for tolerance) is physically linked with an “unfavorable” allele at the proximal marker locus, and the two “favorable” alleles are not inherited together (i.e., the two loci are “out of phase” with each other).


Markers are used to define a specific locus on the soybean genome. Each marker is therefore an indicator of a specific segment of DNA, having a unique nucleotide sequence. Map positions provide a measure of the relative positions of particular markers with respect to one another. When a trait is stated to be linked to a given marker it will be understood that the actual DNA segment whose sequence affects the trait generally co-segregates with the marker. More precise and definite localization of a trait can be obtained if markers are identified on both sides of the trait. By measuring the appearance of the marker(s) in progeny of crosses, the existence of the trait can be detected by relatively simple molecular tests without actually evaluating the appearance of the trait itself, which can be difficult and time-consuming because the actual evaluation of the trait requires growing plants to a stage and/or under environmental conditions where the trait can be expressed. Molecular markers have been widely used to determine genetic composition in soybeans.


Favorable genotypes associated with at least trait of interest may be identified by one or more methodologies. In some examples one or more markers are used, including but not limited to AFLPs, RFLPs, ASH, SSRs, SNPs, indels, padlock probes, molecular inversion probes, microarrays, sequencing, and the like. In some methods, a target nucleic acid is amplified prior to hybridization with a probe. In other cases, the target nucleic acid is not amplified prior to hybridization, such as methods using molecular inversion probes (see, for example Hardenbol et al. (2003) Nat Biotech 21:673-678). In some examples, the genotype related to a specific trait is monitored, while in other examples, a genome-wide evaluation including but not limited to one or more of marker panels, library screens, association studies, microarrays, gene chips, expression studies, or sequencing such as whole-genome resequencing and genotyping-by-sequencing (GBS) may be used. In some examples, no target-specific probe is needed, for example by using sequencing technologies, including but not limited to next-generation sequencing methods (see, for example, Metzker (2010) Nat Rev Genet 11:31-46; and, Egan et al. (2012) Am J Bot 99:175-185) such as sequencing by synthesis (e.g., Roche 454 pyrosequencing, Illumina Genome Analyzer, and Ion Torrent PGM or Proton systems), sequencing by ligation (e.g., SOLiD from Applied Biosystems, and Polnator system from Azco Biotech), and single molecule sequencing (SMS or third-generation sequencing) which eliminate template amplification (e.g., Helicos system, and PacBio RS system from Pacific BioSciences). Further technologies include optical sequencing systems (e.g., Starlight from Life Technologies), and nanopore sequencing (e.g., GridION from Oxford Nanopore Technologies). Each of these may be coupled with one or more enrichment strategies for organellar or nuclear genomes in order to reduce the complexity of the genome under investigation via PCR, hybridization, restriction enzyme (see, e.g., Elshire et al. (2011) PLoS ONE 6:e19379), and expression methods. In some examples, no reference genome sequence is needed in order to complete the analysis.


In some examples, markers within 1 cM, 5 cM, 10 cM, 15 cM, or 30 cM of SEQ ID NO: 1-512 are provided. Similarly, one or more markers mapped within 1, 5, 10, 20 and 30 cM or less from the markers provided can be used for the selection or introgression of the region associated with a preferred reproductive growth phenotype. In other examples, any marker that is linked with SEQ ID NOs: 1-512 and associated with a preferred reproductive growth phenotype is provided. In other examples, markers provided include a substantially a nucleic acid molecule within 5 kb, 10 kb, 20 kb, 30 kb, 100 kb, 500 kb, 1,000 kb, 10,000 kb, 25,000 kb, or 50,000 kb of a marker selected from the group consisting of SEQ ID NOs: 1-512.


Real-time amplification assays, including MB or TAQMAN® based assays, are especially useful for detecting SNP alleles. In such cases, probes are typically designed to bind to the amplicon region that includes the SNP locus, with one allele-specific probe being designed for each possible SNP allele. For instance, if there are two known SNP alleles for a particular SNP locus, “A” or “C,” then one probe is designed with an “A” at the SNP position, while a separate probe is designed with a “C” at the SNP position. While the probes are typically identical to one another other than at the SNP position, they need not be. For instance, the two allele-specific probes could be shifted upstream or downstream relative to one another by one or more bases. However, if the probes are not otherwise identical, they should be designed such that they bind with approximately equal efficiencies, which can be accomplished by designing under a strict set of parameters that restrict the chemical properties of the probes. Further, a different detectable label, for instance a different reporter-quencher pair, is typically employed on each different allele-specific probe to permit differential detection of each probe. In certain examples, each allele-specific probe for a certain SNP locus is 13-18 nucleotides in length, dual-labeled with a florescence quencher at the 3′ end and either the 6-FAM (6-carboxyfluorescein) or VIC (4,7,2′-trichloro-7′-phenyl-6-carboxyfluorescein) fluorophore at the 5′ end.


To effectuate SNP allele detection, a real-time PCR reaction can be performed using primers that amplify the region including the SNP locus, the reaction being performed in the presence of all allele-specific probes for the given SNP locus. By then detecting signal for each detectable label employed and determining which detectable label(s) demonstrated an increased signal, a determination can be made of which allele-specific probe(s) bound to the amplicon and, thus, which SNP allele(s) the amplicon possessed. For instance, when 6-FAM- and VIC-labeled probes are employed, the distinct emission wavelengths of 6-FAM (518 nm) and VIC (554 nm) can be captured. A sample that is homozygous for one allele will have fluorescence from only the respective 6-FAM or VIC fluorophore, while a sample that is heterozygous at the analyzed locus will have both 6-FAM and VIC fluorescence.


Introgression of a preferred reproductive growth phenotype into a soybean germplasm having an undesired or less preferred reproductive growth phenotype is provided. Any method for introgressing a QTL or marker into soybean plants known to one of skill in the art can be used. Typically, a first soybean germplasm that contains a preferred reproductive growth phenotype derived from a particular marker or haplotype and a second soybean germplasm that lacks such a reproductive growth phenotype derived from the marker or haplotype are provided. The first soybean germplasm may be crossed with the second soybean germplasm to provide progeny soybean germplasm. These progeny germplasm are screened to determine the presence a preferred reproductive growth phenotype derived from the marker or haplotype, and progeny that tests positive for the presence of tolerance derived from the marker or haplotype are selected as being soybean germplasm into which the marker or haplotype has been introgressed. Methods for performing such screening are well known in the art and any suitable method can be used.


One application of MAS is to use the tolerance markers or haplotypes to increase the efficiency of an introgression or backcrossing effort aimed at introducing a tolerance trait into a desired (typically high yielding) background. In marker assisted backcrossing of specific markers from a donor source, e.g., to an elite genetic background, one selects among backcross progeny for the donor trait and then uses repeated backcrossing to the elite line to reconstitute as much of the elite background's genome as possible.


Thus, the markers and methods can be utilized to guide marker assisted selection or breeding of soybean varieties with the desired complement (set) of allelic forms of chromosome segments associated with superior agronomic performance (tolerance, along with any other available markers for yield, disease tolerance, etc.). Any of the disclosed marker alleles or haplotypes can be introduced into a soybean line via introgression, by traditional breeding (or introduced via transformation, or both) to yield a soybean plant with superior agronomic performance. The number of alleles associated with tolerance that can be introduced or be present in a soybean plant ranges from 1 to the number of alleles disclosed herein, each integer of which is incorporated herein as if explicitly recited.


This also provides a method of making a progeny soybean plant and these progeny soybean plants, per se. The method comprises crossing a first parent soybean plant with a second soybean plant and growing the female soybean plant under plant growth conditions to yield soybean plant progeny. Methods of crossing and growing soybean plants are well within the ability of those of ordinary skill in the art. Such soybean plant progeny can be assayed for alleles associated with tolerance and, thereby, the desired progeny selected. Such progeny plants or seed can be sold commercially for soybean production, used for food, processed to obtain a desired constituent of the soybean, or further utilized in subsequent rounds of breeding. At least one of the first or second soybean plants is a soybean plant that comprises at least one of the markers or haplotypes associated with tolerance, such that the progeny are capable of inheriting the marker or haplotype.


Often, a method is applied to at least one related soybean plant such as from progenitor or descendant lines in the subject soybean plants pedigree such that inheritance of the desired tolerance can be traced. The number of generations separating the soybean plants being subject to the methods will generally be from 1 to 20, commonly 1 to 5, and typically 1, 2, or 3 generations of separation, and quite often a direct descendant or parent of the soybean plant will be subject to the method (i.e., 1 generation of separation).


Genetic diversity is important for long-term genetic gain in any breeding program. With limited diversity, genetic gain will eventually plateau when all of the favorable alleles have been fixed within the elite population. One objective is to incorporate diversity into an elite pool without losing the genetic gain that has already been made and with the minimum possible investment. MAS provides an indication of which genomic regions and which favorable alleles from the original ancestors have been selected for and conserved over time, facilitating efforts to incorporate favorable variation from exotic germplasm sources (parents that are unrelated to the elite gene pool) in the hopes of finding favorable alleles that do not currently exist in the elite gene pool.


For example, the markers, haplotypes, primers, and probes can be used for MAS involving crosses of elite lines to exotic soybean lines (elite×exotic) by subjecting the segregating progeny to MAS to maintain major yield alleles, along with the tolerance marker alleles herein.


As an alternative to standard breeding methods of introducing traits of interest into soybean (e.g., introgression), transgenic approaches can also be used to create transgenic plants with the desired traits. In these methods, exogenous nucleic acids that encode a desired QTL, marker, or haplotype are introduced into target plants or germplasm. For example, a nucleic acid that codes for a preferred reproductive growth trait is cloned, e.g., via positional cloning, and introduced into a target plant or germplasm.


Experienced plant breeders can recognize the time to R1 reproductive stage trait for soybean plants in the field, and can select the individuals or populations for breeding purposes or for propagation with the desired phenotype. In this context, the plant breeder recognizes “preferred” soybean plants. However, time to R1 is a phenotypic spectrum consisting of extremes in timing, as well as a continuum of intermediate phenotypes. Evaluation of these intermediate phenotypes using reproducible assays are of value to scientists who seek to identify genetic loci that impart a specific time to R1 stage, to conduct marker assisted selection for populations, and to use introgression techniques to breed a specific R1 trait into an elite soybean line, for example.


In some examples, a kit for detecting markers or haplotypes, and/or for correlating the markers or haplotypes with a desired phenotype (e.g., a preferred reproductive growth phenotype), are provided. Thus, a typical kit can include a set of marker probes and/or primers configured to detect at least one favorable allele of one or more marker locus associated with a preferred reproductive growth phenotype. These probes or primers can be configured, for example, to detect the marker alleles noted in the tables and examples herein, e.g., using any available allele detection format, such as solid or liquid phase array based detection, microfluidic-based sample detection, etc. The kits can further include packaging materials for packaging the probes, primers, or instructions; controls, such as control amplification reactions that include probes, primers, and/or template nucleic acids for amplifications; molecular size markers; or the like.


System or kit instructions that describe how to use the system or kit and/or that correlate the presence or absence of the allele with the predicted preferred or non-preferred phenotype are also provided. For example, the instructions can include at least one look-up table that includes a correlation between the presence or absence of the favorable allele(s) and the predicted time to floral initiation. The precise form of the instructions can vary depending on the components of the system, e.g., they can be present as system software in one or more integrated unit of the system (e.g., a microprocessor, computer or computer readable medium), or can be present in one or more units (e.g., computers or computer readable media) operably coupled to the detector.


Isolated nucleic acids comprising a nucleic acid sequence coding for a preferred reproductive growth phenotype, or capable of detecting such a phenotypic trait, or sequences complementary thereto, are also included. In certain examples, the isolated nucleic acids are capable of hybridizing under stringent conditions to nucleic acids of a soybean cultivar phenotyped for a preferred reproductive growth phenotype, to detect loci associated with a preferred reproductive growth phenotype, including one or more of S01435-1, S01239-1, S00780-1, S06925-1, S09951-1, S00170-1, S04059-1, S07851-1, S11659-1, S04279-1, S02211-1, S08942-1, S05742-1, S09155-1, S02037-1, S13136-1, S17291-001, S13139-1, S17292-001, S13146-1, S17293-001, S17294-001, S17581-001, S17691-001, S17701-001, S03703-1, S17297-001, S17298-001, S17299-001, S17300-001, S17306-001, S17310-001, S17311-001, S17312-001, S17312-001, S17316-001, S17317-001, S17318-001, S17322-001, S17326-001, S17327-001, S17328-001, S17329-001, S10746-1, S17331-001, S17332-001, S17337-001, S13093-1, S12211-1, S04555-1, S17301-001, S08519-1, S12876-1, S05937-1, S08575-1, S08669-1, S11212-1, S00543-1, S01452-1, S11993-1, S13446-1, S00252-1, S04060-1, S02664-1, S00281-1, S01109-1, S13844-1, S05058-1, S04660-1, S09955-1, S08034-1, S10293-1, S03813-1, S02042-1, S16601-001, S01481-1, S11309-1, S11320-1, S04040-1, S00863-1, S17151-001, S17153-001, S17154-001, S17156-001, S17159-001, S08590-1, S17242-001, S17166-001, S17167-001, S08539-1, S17178-001, S17179-001, S17180-001, S17181-001, S17182-001, S17183-001, S02780-1, S12107-1, S03624-1, S01953-1, S00111-1, S04180-1, S01008-1, S12861-1, S04966-1, S12867-1, S10631-1-Q1, S01574-1, S16594-001, S02777-1, Gm05:30568085, Gm08:7464336, Gm08:15841570, Gm11:4674824, Gm11:5231500, Gm11:7847341, Gm14:46138053, Gm14:47331319, Gm04:5754268, Gm04:8295779, Gm04:39691731, Gm04:44725098, Gm06:410442, Gm06:11659627, Gm06:15457913, Gm06:16391391, Gm06:16499786, Gm06:16593381, Gm06:16670047, Gm06:16804435, Gm06:17498270, Gm06:18203964, Gm06:19743496, Gm06:19986645, Gm06:20007173, Gm06:20084642, Gm06:20501491, Gm06:21197184, Gm06:21500085, Gm06:22501610, Gm06:25700006, Gm06:28501458, Gm06:28671736, Gm06:29499523, Gm06:30203054, Gm06:31694650, Gm06:32503141, Gm06:33196184, Gm06:35509548, Gm06:37712913, Gm06:38467854, Gm06:39168136, Gm06:39533730, Gm06:40766974, Gm06:41476201, Gm06:42450296, Gm06:47500976, Gm06:47521797, Gm06:48475049, Gm06:49978151, Gm06:22700011, Gm01:759365, Gm02:4893148, Gm02:9714426, Gm02:11502780, Gm02:15446229, Gm02:33158449, Gm02:45776142, Gm17:16136646, Gm17:39804515, Gm15:50237460, Gm13:235439, Gm13:20365663, Gm13:20744030, Gm13:35174140, Gm18:305113, Gm18:58086324, Gm18:61591142, Gm18:61831970, Gm12:11512115, Gm20:39051858, Gm20:41216234, Gm16:4678569, Gm16:36524407, Gm19:47535046, Gm19:47826727, Gm19:48252040, Gm19:48638646, Gm19:50222676, Gm07:1141099, Gm07:1830296, Gm07:1923026, Gm07:2179883, Gm07:2310058, Gm07:2679749, Gm07:3009018, Gm07:4282676, Gm07:4319368, Gm07:4342479, Gm07:5576650, Gm07:6288899, Gm07:6340656, Gm07:6347675, Gm07:6614649, Gm07:6616695, Gm07:6623333, Gm07:6671535, Gm07:7096376, Gm07:7774056, Gm07:8674220, Gm07:35590550, Gm07:36459825, Gm07:36638366, Gm03:38491492, Gm03:39583405, Gm03:46209939, Gm10:43974548, Gm10:44725777, Gm10:44732850, Gm10:50495033, and any combination thereof.


In some examples the isolated nucleic acids are markers, for example markers selected from the group consisting of S01435-1-001, S01239-1-A, S00780-1-A, S06925-1-Q1, S09951-1-Q1, S00170-1-A, S04059-1-A, S07851-1, S11659-1-Q1, S04279-1-A, S02211-1-A, S08942-1-Q1, S05742-1-Q1, S09155-1-Q1, S02037-1-A, S13136-1-Q1, S17291-001-K001, S13139-1-Q1, S17292-001-K001, S13146-1-Q1, S17293-001-K001, S17294-001-K001, S07518-001-Q008, S17691-001-Q001, S17701-001-Q001, S03703-1-Q1, S17297-001-K001, S17298-001-K001, S17299-001-K001, S17300-001-K001, S17306-001-K001, S17310-001-K001, S17311-001-K001, S17312-001-K001, S17312-001-K001, S17316-001-K001, S17317-001-K001, S17318-001-K001, S17322-001-K001, S17326-001-K001, S17327-001-K001, S17328-001-K001, S17329-001-K001, S10746-1-Q1, S17331-001-K001, S17332-001-K001, S17337-001-K001, S13093-1-Q1, S12211-1-Q1, S04555-1-Q1, S17301-001-K001, S08519-1-Q1, S12876-1-Q1, S05937-1-Q1, S08575-1-Q1, S08669-1-Q1, S11212-1-Q1, S00543-1-A, S01452-1-A, S11993-1-Q2, S13446-1-Q1, S00252-1-A, S04060-1-A, S02664-1-A, S00281-1-A, S01109-1-Q002, S13844-1-Q1, S05058-1-Q1, S04660-1-A, S09955-1-Q1, S08034-1-Q1, S10293-1-Q1, S03813-1-A, S02042-1-A, S16601-001-Q001, S01481-1-A, S11309-1-Q1, S11320-1-Q1, S04040-1-A, S00863-1-A, S17151-001-K001, S17153-001-K001, S17154-001-K001, S17156-001-K001, S17159-001-K001, S08590-1-Q1, S17242-001-K001, S17166-001-Q006, S17167-001-Q007, S08539-1-Q1, S17178-001-K001, S17179-001-K001, S17180-001-K001, S17181-001-K001, S17182-001-K001, S17183-001-K001, S02780-1-Q1, S12107-1-Q1, S03624-1-Q001, S01953-1-A, S00111-1-A, S04180-1-A, S01008-1-B, S12861-1-Q1, S04966-1-Q1, S12867-1-Q002, S10631-1-Q1, S01574-1-A, S16594-001-Q10, and S02777-1-A. In some examples the nucleic acid is one of more polynucleotides selected from the group consisting of SEQ ID NOs: 1-512. Vectors comprising one or more of such nucleic acids, expression products of such vectors expressed in a host compatible therewith, antibodies to the expression product (both polyclonal and monoclonal), and antisense nucleic acids are also included. In some examples, one or more of these nucleic acids is provided in a kit.


As the parental line having a preferred reproductive growth phenotype, any line known to the art or disclosed herein may be used. Also included are soybean plants produced by any of the foregoing methods. Seed of a soybean germplasm produced by crossing a soybean variety having a marker or haplotype associated with time to R1 reproductive stage with a soybean variety lacking such marker or haplotype, and progeny thereof, is also included.


A soybean plant, germplasm, plant part, or seed further comprising resistance to at least one herbicidal formulation is provided. For example, the herbicidal formulation can comprise a compound selected from the group consisting of an ALS-inhibiting herbicide, a glyphosate, a hydroxyphenylpyruvatedioxygenase (HPPD) inhibitor, a sulfonamide, an imidazolinone, a bialaphos, a phosphinothricin, a metribuzin, a mesotrione, an isoxaflutole, an azafenidin, a butafenacil, a sulfosate, a glufosinate, a dicamba, a 2,4-D, and a protox inhibitor. In some examples, resistance to the herbicidal formulation is conferred by a transgene.


Glyphosate resistance can be conferred from genes including but not limited to EPSPS, GAT, GOX, and the like, such as described in U.S. Pat. Nos. 6,248,876; 5,627,061; 5,804,425; 5,633,435; 5,145,783; 4,971,908; 5,312,910; 5,188,642; 4,940,835; 5,866,775; 6,225,114; 6,130,366; 5,310,667; 4,535,060; 4,769,061; 5,633,448; 5,510,471; RE36,449; RE37,287 E; 5,491,288; 5,776,760; 5,463,175; 8,044,261; 7,527,955; 7,666,643; 7,998,703; 7,951,995; 7,968,770; 8,088,972, 7,863,503; and US20030083480; WO 97/04103; WO 00/66746; WO 01/66704; and WO 00/66747, which are each incorporated herein by reference in their entireties for all purposes. Additionally, glyphosate tolerant plants can be generated through the selection of naturally occurring mutations that impart tolerance to glyphosate.


HPPD resistance can be conferred by genes including exemplary sequences disclosed in U.S. Pat. Nos. 6,245,968; 6,268,549; and 6,069,115; and WO 99/23886, which are each incorporated herein by reference in their entireties for all purposes. Mutant hydroxyphenylpyruvatedioxygenases having this activity are also known. For further examples see US20110185444 and US20110185445.


Resistance to auxins, such as 2,4-D or dicamba, can be provided by polynucleotides as described, for example, in WO2005/107437, US20070220629, and U.S. Pat. No. 7,838,733 and in Herman et al. (2005) J. Biol. Chem. 280:24759-24767, each which is herein incorporated by reference.


Resistance to PPO-inhibiting herbicides can be provided as described in U.S. Pat. Nos. 6,288,306; 6,282,837; and 5,767,373; and WO 01/12825, each of which is herein incorporated by reference. Plants containing such polynucleotides can exhibit improved tolerance to any of a variety of herbicides which target the protox enzyme. Resistance can also be conferred as described in US20100186131; US20110185444; US20100024080, each of which is herein incorporated by reference.


The development of plants containing an exogenous phosphinothricin acetyltransferase which confers resistance to glufosinate, bialaphos, or phosphinothricin is described, for example, in U.S. Pat. Nos. 5,969,213; 5,489,520; 5,550,318; 5,874,265; 5,919,675; 5,561,236; 5,648,477; 5,646,024; 6,177,616; and 5,879,903, which are each incorporated herein by reference in their entireties for all purposes. Mutant phosphinothricin acetyltransferase having this activity are also known in the art.


In some examples, the plant or germplasm further comprises a trait selected from the group consisting of drought tolerance, stress tolerance, disease resistance, herbicide resistance, enhanced yield, modified oil, modified protein, tolerance to chlorotic conditions, and insect resistance, or any combination thereof. In some examples, the trait is selected from the group consisting of brown stem rot resistance, charcoal rot drought complex resistance, Fusarium resistance, Phytophthora resistance, stem canker resistance, sudden death syndrome resistance, Sclerotinia resistance, Cercospora resistance, anthracnose resistance, target spot resistance, frogeye leaf spot resistance, soybean cyst nematode resistance, root knot nematode resistance, rust resistance, high oleic content, low linolenic content, aphid resistance, stink bug resistance, and iron chlorosis deficiency tolerance, or any combination thereof. In some examples, one or more of the traits is conferred by one or more transgenes, by one or more native loci, or any combination thereof. Examples of markers and loci conferring improved iron chlorosis deficiency tolerance are disclosed in US20110258743, U.S. Pat. No. 7,582,806, and U.S. Pat. No. 7,977,533, each of which is herein incorporated by reference. Various disease resistance loci and markers are disclosed, for example, in WO1999031964, U.S. Pat. No. 5,948,953, U.S. Pat. No. 5,689,035, US20090170112, US20090172829, US20090172830, US20110271409, US20110145953, U.S. Pat. No. 7,642,403, U.S. Pat. No. 7,919,675, US20110131677, U.S. Pat. No. 7,767,882, U.S. Pat. No. 7,910,799, US20080263720, U.S. Pat. No. 7,507,874, US20040034890, US20110055960, US20110185448, US20110191893, US20120017339, U.S. Pat. No. 7,250,552, U.S. Pat. No. 7,595,432, U.S. Pat. No. 7,790,949, U.S. Pat. No. 7,956,239, U.S. Pat. No. 7,968,763, each of which is herein incorporated by reference. Markers and loci conferring improved yield are provided, for example, in U.S. Pat. No. 7,973,212 and WO2000018963, each of which is herein incorporated by reference. Markers and loci conferring improved resistance to insects are disclosed in, for example, US20090049565, U.S. Pat. No. 7,781,648, US20100263085, U.S. Pat. No. 7,928,286, U.S. Pat. No. 7,994,389, and WO2011116131, each of which is herein incorporated by reference. Markers and loci for modified soybean oil content or composition are disclosed in, for example, US20120028255 and US20110277173, each of which is herein incorporated by reference. Methods and compositions to modified soybean oil content are described in, for example, WO2008147935, U.S. Pat. No. 8,119,860; U.S. Pat. No. 8,119,784; U.S. Pat. No. 8,101,189; U.S. Pat. No. 8,058,517; U.S. Pat. No. 8,049,062; U.S. Pat. No. 8,124,845, U.S. Pat. No. 7,790,959, U.S. Pat. No. 7,531,718, U.S. Pat. No. 7,504,563, and U.S. Pat. No. 6,949,698, each of which is herein incorporated by reference. Markers and loci conferring tolerance to nematodes are disclosed in, for example, US20090064354, US20090100537, US20110083234, US20060225150, US20110083224, U.S. Pat. No. 5,491,081, U.S. Pat. No. 6,162,967, U.S. Pat. No. 6,538,175, U.S. Pat. No. 7,872,171, U.S. Pat. No. 6,096,944, and U.S. Pat. No. 6,300,541, each of which is herein incorporated by reference. Resistance to nematodes may be conferred using a transgenic approach as described, for example, in U.S. Pat. No. 6,284,948 and U.S. Pat. No. 6,228,992, each of which is herein incorporated by reference. Plant phenotypes can be modified using isopentyl transferase polynucleotides as described, for example, in U.S. Pat. No. 7,553,951 and U.S. Pat. No. 7,893,236, each of which is herein incorporated by reference.


Soybean seeds, plants, and plant parts comprising a preferred reproductive growth phenotype may be cleaned and/or treated. The resulting seeds, plants, or plant parts produced by the cleaning and/or treating process(es) may exhibit enhanced yield characteristics. Enhanced yield characteristics can include one or more of the following: increased germination efficiency under normal and/or stress conditions, improved plant physiology, growth and/or development, such as water use efficiency, water retention efficiency, improved nitrogen use, enhanced carbon assimilation, improved photosynthesis, and accelerated maturation, and improved disease and/or pathogen tolerance. Yield characteristics can furthermore include enhanced plant architecture (under stress and non-stress conditions), including but not limited to early flowering, flowering control for hybrid seed production, seedling vigor, plant size, internode number and distance, root growth, seed size, fruit size, pod size, pod or ear number, seed number per pod or ear, seed mass, enhanced seed filling, reduced seed dispersal, reduced pod dehiscence and lodging resistance. Further yield characteristics include seed composition, such as carbohydrate content, protein content, oil content and composition, nutritional value, reduction in anti-nutritional compounds, improved processability and better storage stability.


Cleaning a seed or seed cleaning refers to the removal of impurities and debris material from the harvested seed. Material to be removed from the seed includes but is not limited to soil, and plant waste, pebbles, weed seeds, broken soybean seeds, fungi, bacteria, insect material, including insect eggs, larvae, and parts thereof, and any other pests that exist with the harvested crop. The terms cleaning a seed or seed cleaning also refer to the removal of any debris or low quality, infested, or infected seeds and seeds of different species that are foreign to the sample.


Treating a seed or applying a treatment to a seed refers to the application of a composition to a seed as a coating or otherwise. The composition may be applied to the seed in a seed treatment at any time from harvesting of the seed to sowing of the seed. The composition may be applied using methods including but not limited to mixing in a container, mechanical application, tumbling, spraying, misting, and immersion. Thus, the composition may be applied as a powder, a crystalline, a ready-to-use, a slurry, a mist, and/or a soak. For a general discussion of techniques used to apply fungicides to seeds, see “Seed Treatment,” 2d ed., (1986), edited by K A Jeffs (chapter 9), herein incorporated by reference in its entirety. The composition to be used as a seed treatment can comprise one or more of a pesticide, a fungicide, an insecticide, a nematicide, an antimicrobial, an inoculant, a growth promoter, a polymer, a flow agent, a coating, or any combination thereof. General classes or family of seed treatment agents include triazoles, anilides, pyrazoles, carboxamides, succinate dehydrogenase inhibitors (SDHI), triazolinthiones, strobilurins, amides, and anthranilic diamides. In some examples, the seed treatment comprises trifloxystrobin, azoxystrobin, metalaxyl, metalaxyl-m, mefenoxam, fludioxinil, imidacloprid, thiamethoxam, thiabendazole, ipconazole, penflufen, sedaxane, prothioconazole, picoxystrobin, penthiopyrad, pyraclastrobin, xemium, Rhizobia spp., Bradyrhizobium spp. (e.g., B. japonicum), Bacillus spp. (e.g., B. firmus, B. pumilus, B. subtilus), lipo-chitooligosaccharide, clothianidin, cyantraniliprole, chlorantraniliprole, abamectin, and any combination thereof. In some examples the seed treatment comprises trifloxystrobin, metalaxyl, imidacloprid, Bacillus spp., and any combination thereof. In some examples the seed treatment comprises picoxystrobin, penthiopyrad, cyantraniliprole, chlorantraniliprole, and any combination thereof. In some examples, the seed treatment improves seed germination under normal and/or stress environments, early stand count, vigor, yield, root formation, nodulation, and any combination thereof. In some examples seed treatment reduces seed dust levels, insect damage, pathogen establishment and/or damage, plant virus infection and/or damage, and any combination thereof.


The present invention is illustrated by the following examples. The foregoing and following description of the present invention and the various examples are not intended to be limiting of the invention but rather are illustrative thereof. Hence, it will be understood that the invention is not limited to the specific details of these examples.


EXAMPLES
Example 1

An F5 mapping population from a cross of 90Y50 and 90Y41 was used to identify loci associated with reproductive stage traits in soybean. The population consisted of 340 progeny phenotyped for physiological maturity. A set of 141 markers expected to be polymorphic were selected across all 20 chromosomes, and the samples were genotyped. Eighty-three markers showed monomorphism in this population and were removed from analysis. A further 35 markers were removed based on severe segregation distortion (p<0.001). The remaining 22 markers were used to construct a linkage map and perform QTL analysis using Map Manager QTX.b20 (Manly et al. (2001) Mammalian Genome 12:930-932; available online at mapmanager.org). The initial parameters were set at: Linkage Evaluation: Intercross; search criteria: p=1e−5; map function: Kosambi; and, cross type: line cross. A 1000 permutation test was used to establish the threshold for statistical significance (LOD ratio statistic—LRS). The maternal alleles were assigned as “A”, and the paternal alleles as “B”, and the heterozygous as “H”, and the “Low Signal” and “Equivocal” as “-” (missing). Chi-square test was used for goodness of fit test. One marker, S00281-1-A on LG F (Gm13:35174140, 73.16 cM) showed significant association in the QTL analysis.


Genomic DNA was extracted from leaf tissue of each progeny using a modification of the CTAB (cetyltriethylammonium bromide, Sigma H5882) method described by Stacey & Isaac (Methods in Molecular Biology, Vol. 28: Protocols for Nucleic Acid Analysis by Nonradioactive Probes, Ed: Isaac, Humana Press Inc, Totowa, N.J. 1994, Ch 2, pp. 9-15). Approximately 100-200 mg of tissue was ground into powder in liquid nitrogen and homogenised in 1 ml of CTAB extraction buffer (2% CTAB, 0.02 M EDTA, 0.1 M Tris-Cl pH 8, 1.4 M NaCl, 25 mM DTT) for 30 min at 65° C. Homogenised samples were cooled at room temperature for 15 min before a single protein extraction with approximately 1 ml 24:1 v/v chloroform:octanol was done. Samples were centrifuged for 7 min at 13,000 rpm and the upper layer of supernatant was collected using wide-mouthed pipette tips. DNA was precipitated from the supernatant by incubation in 95% ethanol on ice for 1 h. DNA threads are spooled onto a glass hook, washed in 75% ethanol containing 0.2 M sodium acetate for 10 min, air-dried for 5 min and resuspended in TE buffer. Five μl RNAse A was added to the samples and incubated at 37° C. for 1 hour.


Example 2

An F2 mapping population from a cross of 90A01 and 90Y41 comprising 227 progeny that were segregating for flowering date and for maturity date was used to identify loci associated with the R1 reproductive stage trait in soybean. A set of 197 markers was used to genotype the samples. The F2 plant samples were genotyped, and 1-4 F3 plants were phenotyped for each F2 genotyped plant. Flowering date and maturity date were recorded, and converted to sequential numbering with the earliest date assigned to equal 1. The replicates were averaged to produce one phenotypic score/genotyped plant.


Genomic DNA was extracted essentially as described by Truett et al. (2000 BioTechniques 29:52-54). The samples are prepared for extraction by adding 400 μl Extraction Buffer (25 mM NaOH, 0.2 mM disodium EDTA (pH not adjusted, ˜pH 12)) to sample racks containing 1-2 leaf punches and a stainless steel bb for grinding in each well. Each plate is heat-sealed, and ground in a Genogrinder. After grinding, the plate is heated at 94° C. for 70 minutes. The seal is removed and 400 μl of Neutralization Buffer is added (40 mM Tris-HCl (pH not adjusted, ˜pH 5)). The plate is sealed with a new foil seal and shaken to mix the solutions. The sealed plate is centrifuged for 10 minutes. The final DNA extract contains 20 mM Tris-HCl, pH 8.1, and 0.1 mM EDTA and is ready for use in various assays.


Map Manager QTX.b20 (Manly et al. (2001) Mammalian Genome 12:930-932; available online at mapmanager.org) was used to construct the linkage map using the following settings: Linkage Evaluation: Intercross; search criteria: p=1e−5; map function: Kosambi; and, cross type: line cross.


Single marker analysis (SMA), composite interval mapping (CIM), and multiple interval mapping (MIM) were executed using QTL Cartographer 2.5 (Wang et al. (2011) Windows QTL Cartographer 2.5; Dept. of Statistics, North Carolina State University, Raleigh, N.C. Available online at statgen.ncsu.edu/qticart/WQTLCart.htm). The standard CIM model and forward and backward regression method was used, and the LRS threshold for statistical significance to declare QTLs was determined by a 500 permutation test. The initial MIM model was determined using the CIM results and the threshold found by permutation test. The default criteria were used to optimize QTL positions, verify QTL significance, and search for interactions.


While evaluating the genotype data, 27 markers were removed from the analysis for failing one or more criteria. Markers were evaluated for segregation distortion via distribution of chi test results, from this 11 markers were identified as severely distorted (p<0.0001), but were retained in the analysis. Genotypic data across each progeny indicated no selfed plants within the population, and all progeny had greater than 70% data return.


Phenotype data for the population was also evaluated, 7 progeny with high standard deviation between replications of phenotypic scores and one progeny that was an extreme outlier for maturity were removed from the analysis. The remaining 219 progeny showed a relatively normal distribution for maturity, and a skewed distribution for flowering (skewed left).


Linkage groups were created using 159 non-distorted markers, resulting in 28 linkage groups with 7 markers remaining unlinked. Three distorted markers formed an additional linkage group and the remaining 8 distorted markers could not be distributed.


Single marker analysis of the flowering time phenotypic dataset found highly significant associations on LG C2 (ch 6) in an interval flanked by and including S02037-1-A and S13093-1-Q1 at 89.19-113.11 cM. Table 1 summarizes data for markers found with an F test statistic (pr(F))<0.05 level of significance.














TABLE 1







Position
QTL




Marker
LG (ch)
(cM)
Effect
Pr(F)
R2




















S09155-1-Q1
C2 (6)
69.29
90Y41
0.02132
0.0236


S02037-1-A
C2 (6)
89.19
90Y41
0.00000
0.2749


S13136-1-Q1
C2 (6)
96.04
90Y41
0.00000
0.3702


S13146-1-Q1
C2 (6)
98.23
90Y41
0.00000
0.3749


S10746-1-Q1
C2 (6)
104.94
90Y41
0.00000
0.3108


S13093-1-Q1
C2 (6)
113.11
90Y41
0.00002
0.0752


S12211-1-Q1
C2 (6)
116.04
90Y41
0.00088
0.0477


S04555-1-Q1
C2 (6)
132.43
90Y41
0.02968
0.0194


S08539-1-Q1
M (7)
36.74
90A01
0.00578
0.0416


S08590-1-Q1
M (7)
19.96
90A01
0.02616
0.0224


S01239-1-A
A2 (8)
40.49
90A01
004836
0.0188


S08669-1-Q1
D1b (2)
76.53
90A01
0.02989
0.0215


S11212-1-Q1
D1b (2)
83.28
90A01
0.01806
0.0244


S03813-1-A
J (16)
30.57
90Y41
0.01707
0.0259


S02042-1-A
J (16)
85.53
90Y41
0.04332
0.0187


S12862-1-Q1
N (3)
53.56
90Y41
0.01687
0.023


S12867-1-Q002
N (3)
58.35
90Y41
0.02117
0.0242


S04966-1-Q1
N (3)
92.16
90Y41
0.02070
0.0213









Single marker analysis of the maturity time phenotypic dataset found highly significant associations on LG C2 (ch 6) in an interval flanked by and including S02037-1-A and S13093-1-Q1 at 89.19-113.11 cM. Additional significant associations were found on LG D1b (ch 2) in an interval at about 29.48-34.18 cM which included S12876-1-Q1, LG F (ch 13) at marker S00252-1-A (˜0 cM), LG L (ch 19) at marker S04040-1-A (˜100.89 cM), and LG M (ch 7) at S08539-1-Q1 at about 36.74 cM. Table 2 summarizes data for these markers with an F test statistic (pr(F))<0.05 level of significance.














TABLE 2







Position
QTL




Marker
LG (ch)
(cM)
Effect
Pr(F)
R2




















S05742-1-Q1
C2 (6)
4.88
90Y41
0.01338
0.0279


S09155-1-Q1
C2 (6)
69.29
90Y41
0.01271
0.0273


S02037-1-A
C2 (6)
89.19
90Y41
0.00000
0.2741


S13136-1-Q1
C2 (6)
94.84
90Y41
0.00000
0.3837


S13146-1-Q1
C2 (6)
98.23
90Y41
0.00000
0.4036


S10746-1-Q1
C2 (6)
104.94
90Y41
0.00000
0.3463


S13093-1-Q1
C2 (6)
113.11
90Y41
0.00005
0.0692


S12211-1-Q1
C2 (6)
116.04
90Y41
0.00026
0.0614


S04555-1-Q1
C2 (6)
132.43
90Y41
0.00801
0.0292


S08590-1-Q1
M (7)
19.96
90A01
0.00108
0.0487


S12107-1-Q1
M (7)
43.16
90A01
0.02162
0.0187


S08539-1-Q1
M (7)
36.74
90A01
0.00218
0.0558


S12876-1-Q1
D1b (2)
29.48
90Y41
0.00014
0.0645


S08669-1-Q1
D1b (2)
76.53
90Y41
0.02810
0.022


S00252-1-A
F (13)
0
90Y41
0.00606
0.046


S04060-1-A
F (13)
36.9
90Y41
0.01496
0.0372


S02664-1-A
F (13)
36.96
90Y41
0.03657
0.0223


S11309-1-Q1
L (19)
91.1
90A01
0.01827
0.0231


S04040-1-A
L (19)
100.89
90A01
0.00543
0.0351


S05058-1-Q1
G (18)
105.85
90Y41
0.01395
0.0275


S01435-1-Q001
A1 (5)
33.61
90Y41
0.04597
0.0182


S00780-1-A
A2 (8)
76.47
90A01
0.04270
0.017


S11659-1-Q1
C1 (4)
29.24
90Y41
0.01646
0.0253


S04279-1-A
C1 (4)
45.75
90Y41
0.03048
0.0275


S02211-1-A
C1 (4)
54.48
90Y41
0.02204
0.0242









Composite interval mapping also identified the markers on LG C2 associated with flowering date and with maturity date, as well as the markers on LG D1b and LG M associated with maturity date. Four QTLs were identified on LG C2 for flowering date using a 1-LOD interval. The peak markers spanned 69.29 cM to 104.94 cM, and percent variation explained ranged from 22.6% to 66%. The QTL effect was from 90Y41 for all four QTLs. Table 3 summarizes the CIM analysis for flowering date associations on LG C2.














TABLE 3






Marker
LG (ch)
Position (cM)
LOD
R2





















S09155-1-Q1
C2 (6)
69.29
21.1
0.461



S02037-1-A
C2 (6)
89.19
16.0
0.226



S13146-1-Q1
C2 (6)
98.23
40.5
0.660



S10746-1-Q1
C2 (6)
104.94
22.0
0.414









QTLs for maturity were identified on LG C2, D1b, and M. The QTL identified in this analysis on LG C2 was in about the same region as the loci found for flowering date, with a peak position at S13146-1-Q1 (98.23 cM). The percent variation explained was 44.9% and the effect was from 90Y41. QTLs were identified on LG D1b, including a peak at S05937-1-Q1 (48.44 cM). The percent variation explained was 4.6% and the effects were from 90A01. A QTL was also found on LG M with a peak at S08590-1-Q1 (19.96 cM), which explained about 4.8% of the phenotypic variation and the effect was from 90A01. Table 4 summarizes the CIM analysis for maturity date associations on these linkage groups.














TABLE 4






Marker
LG (ch)
Position (cM)
LOD
R2





















S13146-1-Q1
C2 (6)
98.23
31.5
0.449



S05937-1-Q1
D1b (2)
48.44
4.6
0.058



S08590-1-Q1
M (7)
19.96
4.2
0.048









Multiple interval mapping (MIM) was performed to better estimate the percent variation explained by each QTL and to test for QTL interactions for flowering date. MIM results indicated three QTLs on LG C2, explaining a total of 87.1% of the phenotypic variation for flowering date (Table 5). Three epistatic interactions (A=Additive; D=Dominance; AA=Additive by Additive; AD=Additive by Dominance) were also identified, accounting for an additional 9.2% of the variation (Table 6). Combined, 96.3% of the phenotypic variation was explained by this model.













TABLE 5






Marker
LG (ch)
Position (cM)
R2




















S09155-1-Q1
C2 (6)
69.29
−0.071



S02037-1-A
C2 (6)
89.19
0.109



S13146-1-Q1
C2 (6)
98.23
0.833




















TABLE 6





QTL1
QTL2
Type
Effect
R2



















1
2
AA
−1.503
0.050


1
2
AD
0.477
−0.080


1
3
AD
0.066
0.123









Multiple interval mapping (MIM) was performed to better estimate the percent variation explained by each QTL and to test for QTL interactions for maturity date. The four QTLs identified in the CIM analysis explained a total of about 67.7% of the phenotypic variation using MIM. One dominant by dominant (DD) epistatic interaction was identified between S13146-1-Q1 on LG C2 and S05837-1-Q1 on LG D1b, explaining an additional 2% of the variation. Results are summarized in Table 7.













TABLE 7






Marker
LG (ch)
Position (cM)
R2




















S13146-1-Q1
C2 (6)
98.23
0.479



S12876-1-Q1
D1b (2)
29.48
0.105



S05937-1-Q1
D1b (2)
48.44
0.049



S08590-1-Q1
M (7)
19.96
0.044









Example 3

Genome-wide analysis indicated a reproductive stage QTL related to maturity on LG M in 8 different biparental crosses. KASPar markers were designed across the putative region and used to fine map the locus. Three F3 populations were selected, two populations from 92Y91 X 92Y60 (designated as JB5341 and JB5386 respectively), and one population from 92Y80 X 92Y60 (JB5333). A total of 33 KASPar markers were designed and used to assay all populations in a region between 1.83 Mbps and 6.63 Mbps on LG M. Phenotype data comprised maturity scores.


Map Manager QTX.b20 (Manly et al. (2001) Mammalian Genome 12:930-932; available online at mapmanager.org) was used to construct the linkage map using the following settings: Linkage Evaluation: Intercross; search criteria: p=1e−5; map function: Kosambi; and, cross type: line cross.


Single marker analysis (SMA) and composite interval mapping (CIM) were done using QTL Cartographer 2.5 (Wang et al. (2011) Windows QTL Cartographer 2.5; Dept. of Statistics, North Carolina State University, Raleigh, N.C. Available online at statgen.ncsu.edu/qticart/WQTLCart.htm). The standard CIM model and forward and backward regression method was used, and the default LRS threshold of 11.5 was used to declare QTLs statistically significant.


Genotyping results indicated that 9 KASPar markers were polymorphic for JB5341, 14 KASPar markers were polymorphic for JB5386, and 13 KASPar markers were polymorphic for JB5333. Six markers from previous genome wide analysis were added to each job. A chi test was performed to identify segregation distortion, and indicated that 5 markers were severely distorted in JB5341, 12 in JB5386, and 2 in JB5333. A total of 48 progeny were missing phenotypic scores and another three were missing more than 30% data for JB5341. Likewise, one progeny was missing a phenotypic score and two were missing more than 30% data for JB5386. These 54 progeny were removed from the analysis. The phenotypic distribution for each population was essentially normal for each population, though some distortion was observed in JB5341 as noted earlier.


Linkage groups were created for each population, with 6 linkage groups formed and 5 unlinked markers for JB5386, 4 linkage groups and 1 unlinked marker for JB5341, and two linkage groups and 1 unlinked marker for JB5333.


Single marker analysis indicated minor significance on LG M in JB5341 and JB5386, with the highest associations at S17179-001-K001 (40.83 cM) (PVE=5.7%) and S17159-001-K001 (18.14 cM) (PVE=2.2%), respectively. Highly significant markers were found in JB5333 between S00863-1 (8.09 cM) and S01953-1 (48.13 cM). The peak marker was S17167-001-K001 at 31.99 cM, with an R2 value of 37.7%. Table 8 summarizes all markers on LG M significant by single marker analysis at a pr(F)<0.05 level.













TABLE 8









JB5341
JB5386
JB5333














Marker
Position
Pr(F)
R2
Pr(F)
R2
Pr(F)
R2

















S00863-1
8.09




0.00000
0.075


S17151-001-K001
11.64
0.02341
0.029


0.00000
0.110


S17153-001-K001
12.12
0.00452
0.051
0.03158
0.021
0.00000
0.147


S17154-001-K001
13.97
0.02816
0.033


0.00000
0.138


S17156-001-K001
15.53
0.01732
0.037
0.03859
0.021
0.00000
0.147


S17159-001-K001
18.14
0.00873
0.044
0.03687
0.022
0.00000
0.125


S17166-001-K001
31.87




0.00000
0.345


S17167-001-K001
31.99




0.00000
0.377


S17178-001-K001
40.59




0.00000
0.154


S17179-001-K001
40.83
0.00285
0.057


0.00000
0.166


S17180-001-K001
40.85




0.00000
0.152


S17181-001-K001
41.66
0.00285
0.052


0.00000
0.141


S17182-001-K001
41.66




0.00000
0.145


S17183-001-K001
41.69
0.00285
0.057


0.00000
0.144


S03624-1
45.02
0.00203
0.054






S00111-1
79.14
0.03238
0.029






S02780-1
41.85




0.00000
0.125


S01953-1
48.13




0.00000
0.068


S04180-1
86.05




0.00878
0.020









Composite interval mapping analysis did not find any QTL for populations JB5341 or JB5386, however a QTL was found for population JB5333. The peak for the region was near S17167-001-K001 (31.99 cM) on LG M, with LOD=35.1. This locus explained 37.8% of the phenotypic variation. The additive effect indicated that early maturity was from parent 92Y60.


Example 4

Additional markers targeting a region on LG C2 were developed to probe the F2 population from 90A01/90Y41 (Example 3) and to fine map one or more loci associated with flowering data or maturity. Additional markers were developed based on KASPar technology and used to saturate the region to further refine the locus.


Forty-seven KASPar markers were developed to target a region on LG C2 spanning 16.5 Mbps to 47.5 Mbps. The genotypic data from these markers was combined with the results from 13 markers on LG C2 from previous analysis.


Map Manager QTX.b20 (Manly et al. (2001) Mammalian Genome 12:930-932; available online at mapmanager.org) was used to construct the linkage map using the following settings: Linkage Evaluation: Intercross; search criteria: p=1e−5; map function: Kosambi; and, cross type: line cross.


Single marker analysis (SMA), composite interval mapping (CIM), and multiple interval mapping (MIM) were done using QTL Cartographer 2.5 (Wang et al. (2011) Windows QTL Cartographer 2.5; Dept. of Statistics, North Carolina State University, Raleigh, N.C. Available online at statgen.ncsu.edu/qticart/WQTLCart.htm). The standard CIM model and forward and backward regression method was used, and the default LRS threshold for statistical significance was used to declare QTLs statistically significant. Window size and walk speed parameters were adjusted to narrow the QTL peak. The initial MIM model was determined using the MIM forward search method. The default criteria were used to optimize QTL positions, verify QTL significance, and search for interactions.


Preliminary analysis of the marker genotype data indicated 11 markers were missing more than 30% data, nine markers were monomorphic, and 2 failed. There was also one progeny that was removed from analysis based on missing more than 30% data.


Reviewing phenotype data showed seven progeny with exceptionally high standard deviations between phenotype score replicants, additionally one progeny was an extreme outlier for maturity. These progeny were removed from the analysis and the phenotypic distributions for flowering time and for maturity of the remaining 218 progeny evaluated. The phenotypic distribution for maturity was essentially normal, while the distribution for flowering time had some skewing to the left. Linkage analysis was done and one linkage group was formed comprising all 38 markers.


Single marker analysis of the flowering time data set showed highly significant associations in an interval flanked by and comprising S02037-1-A (89.19 cM) and S13093-1-Q1 (113.11 cM) on LG C2, with R2 values ranging from 7.5% to 44.5%. The peak marker in this region was S17297-001-K001 at 102.43 cM. Table 9 summarizes all markers associated with flowering time at a pr(F)<10−5 level of significance.













TABLE 9





Marker
Position (cM)
QTL Donor
Pr(F)
R2



















S02037-1-A
89.19
90Y41
0.00000
0.274


S13136-1-Q1
94.84
90Y41
0.00000
0.369


S17291-001-K001
96.04
90Y41
0.00000
0.369


S17292-001-K001
97.84
90Y41
0.00000
0.353


S13146-1-Q1
98.23
90Y41
0.00000
0.374


S17293-001-K001
100.29
90Y41
0.00000
0.405


S17294-001-K001
101.72
90Y41
0.00000
0.413


S17297-001-K001
102.43
90Y41
0.00000
0.445


S17298-001-K001
102.71
90Y41
0.00000
0.426


S17299-001-K001
102.83
90Y41
0.00000
0.418


S17300-001-K001
102.93
90Y41
0.00000
0.410


S17301-001-K001
102.97
90Y41
0.00000
0.412


S17306-001-K001
103.29
90Y41
0.00000
0.419


S17310-001-K001
103.3
90Y41
0.00000
0.421


S17311-001-K001
103.3
90Y41
0.00000
0.378


S17312-001-K001
103.3
90Y41
0.00000
0.377


S17313-001-K001
103.31
90Y41
0.00000
0.399


S17316-001-K001
103.31
90Y41
0.00000
0.389


S17317-001-K001
103.31
90Y41
0.00000
0.203


S17318-001-K001
103.32
90Y41
0.00000
0.405


S17322-001-K001
103.37
90Y41
0.00000
0.378


S17326-001-K001
103.79
90Y41
0.00000
0.372


S17327-001-K001
104
90Y41
0.00000
0.376


S17328-001-K001
104.25
90Y41
0.00000
0.349


S17329-001-K001
104.38
90Y41
0.00000
0.324


S10746-1-Q1
104.94
90Y41
0.00000
0.310


S17331-001-K001
105.8
90Y41
0.00000
0.347


S17332-001-K001
106.19
90Y41
0.00000
0.347


S17337-001-K001
113.1
90Y41
0.00001
0.080


S13093-1-Q1
113.11
90Y41
0.00001
0.075









Single marker analysis of the maturity data set showed highly significant associations in an interval flanked by and comprising S02037-1-A (89.19 cM) and S13093-1-Q1 (113.11 cM) on LG C2, with R2 values ranging from 7.0% to 49.7%. The peak marker in this region was S17297-001-K001 at 102.43 cM. Table 10 summarizes all markers associated with maturity at a pr(F)<10−5 level of significance.













TABLE 10





Marker
Position (cM)
QTL Donor
Pr(F)
R2



















S02037-1-A
89.19
90Y41
0.00000
0.243


S13136-1-Q1
94.84
90Y41
0.00000
0.379


S17291-001-K001
96.04
90Y41
0.00000
0.379


S17292-001-K001
97.84
90Y41
0.00000
0.367


S13146-1-Q1
98.23
90Y41
0.00000
0.399


S17293-001-K001
100.29
90Y41
0.00000
0.449


S17294-001-K001
101.72
90Y41
0.00000
0.470


S17297-001-K001
102.43
90Y41
0.00000
0.497


S17298-001-K001
102.71
90Y41
0.00000
0.494


S17299-001-K001
102.83
90Y41
0.00000
0.484


S17300-001-K001
102.93
90Y41
0.00000
0.475


S17301-001-K001
102.97
90Y41
0.00000
0.481


S17306-001-K001
103.29
90Y41
0.00000
0.488


S17310-001-K001
103.3
90Y41
0.00000
0.474


S17311-001-K001
103.3
90Y41
0.00000
0.449


S17312-001-K001
103.3
90Y41
0.00000
0.433


S17313-001-K001
103.31
90Y41
0.00000
0.452


S17316-001-K001
103.31
90Y41
0.00000
0.430


S17317-001-K001
103.31
90Y41
0.00000
0.356


S17318-001-K001
103.32
90Y41
0.00000
0.453


S17322-001-K001
103.37
90Y41
0.00000
0.428


S17326-001-K001
103.79
90Y41
0.00000
0.426


S17327-001-K001
104
90Y41
0.00000
0.425


S17328-001-K001
104.25
90Y41
0.00000
0.384


S17329-001-K001
104.38
90Y41
0.00000
0.372


S10746-1-Q1
104.94
90Y41
0.00000
0.342


S17331-001-K001
105.8
90Y41
0.00000
0.345


S17332-001-K001
106.19
90Y41
0.00000
0.345


S17337-001-K001
113.1
90Y41
0.00003
0.083


S13093-1-Q1
113.11
90Y41
0.00003
0.070









The initial composite interval mapping results for flowering date using the default settings showed 4 QTLs on LG C2 between about 89.19 cM and 104.38 cM. In further analyses the window size was adjusted to 5 cM, and then to 1 cM, to narrow down the probable location of the locus. The final results indicate a QTL peak at marker S17297-001-K001 explaining 26.4% of the phenotypic variation for flowering time. Early flowering date was from parent 90A01. Table 11 summarizes the results of CIM analysis on LG C2.














TABLE 11






Window
Peak
Position
LRS
R2




















10
cM
S13146-1-Q1
98.23
161.0
0.526


5
cM
S17297-001-K001
102.43
102.9
0.264


1
cM
S17297-001-K001
102.43
102.9
0.264









The initial composite interval mapping results for maturity using the default settings showed 2 QTLs on LG C2. In further analyses the window size was adjusted to 5 cM, and then to 1 cM, which resulted in a single QTL. The final results indicate a QTL peak at marker S17297-001-K001 explaining 49.7% of the phenotypic variation for maturity. Early maturity date was from parent 90A01. Table 12 summarizes the results of CIM analysis on LG C2.














TABLE 12






Window
Peak
Position
LRS
R2




















10
cM
S17297-001-K001
102.43
150.4
0.497


5
cM
S17297-001-K001
102.43
150.4
0.497


1
cM
S17297-001-K001
102.43
150.4
0.497









Multiple interval mapping (MIM) was used to corroborate the results obtained from composite interval mapping. MIM results indicated a single QTL on LG C2 with a peak near S17297-001-K001 for both flowering date and for maturity. In the MIM analysis 54.4% of the phenotypic variation in flowering time, and 49.9% of the phenotypic variation in maturity was explained by this model.


Example 5

Populations were developed by crossing lines from maturity groups (MG) 0 or 1 with lines from maturity groups 3 or 4, specifically 90Y20/94Y22, 90Y90/93Y82, and 91Y20/93Y82. Plants from F2 seed and check lines were leaf punched and genotyped with markers S01574-1-A (E2) and S01481-1-A (E3), these markers are in high linkage disequilibrium (LD) with the causative mutation at each respective locus. The polymorphism detected by marker S01574-1-A has allele C associated with early flowering and allele A associated with late flowering. The polymorphism detected by marker S01481-1-A polymorphism has allele T associated with early flowering and allele G associated with late flowering.


F3 seed were harvested from selected plants from each population, planted in randomized plots, and phenotyped for flowering time and maturity during the growing season. The genotyping and phenotyping data sample information is summarized in Table 13.












TABLE 13





Population
90Y20/94Y22
90Y90/93Y82
91Y20/93Y82


















F2 plants genotyped
552
736
1104


F3 plants phenotyped
91
184
265









Genotyping data was grouped into one of eight classes depending on the allele identified by each marker, and whether both loci are considered in the analysis, and then phenotypic data aggregated and analyzed accordingly. The data is summarized in Table 14.















TABLE 14








Avg.
Avg.
Avg.






#days
#days
#days


Genotype

#F2:3
to
planting
flow-


Class

prog-
flow-
to
ering to


(E2/E3)
Pedigree
eny
ering
maturity
maturity
Note







Late/late
94Y22

49
NA
NA
Check


Early/Early
90Y20/94Y22
22
34
100
66
Progeny


Early/Late
90Y20/94Y22
24
38
106
68
Progeny


Late/Early
90Y20/94Y22
21
43
113
70
Progeny


Late/Late
90Y20/94Y22
24
46
115
69
Progeny


Early/—
90Y20/94Y22
46
36
103
67
Progeny


—/Early
90Y20/94Y22
43
38
106
68
Progeny


—/Late
90Y20/94Y22
48
42
110
68
Progeny


Late/—
90Y20/94Y22
45
45
114
69
Progeny


Early/Early
90Y90

35
 97
62
Check


Late/Late
93Y82

47
NA
NA
Check


Early/Early
90Y90/93Y82
49
37
100
63
Progeny


Early/Late
90Y90/93Y82
44
39
104
65
Progeny


Late/Early
90Y90/93Y82
37
42
111
69
Progeny


Late/Late
90Y90/93Y82
54
47
115
68
Progeny


Early/—
90Y90/93Y82
93
38
102
64
Progeny


—/Early
90Y90/93Y82
86
39
105
66
Progeny


—/Late
90Y90/93Y82
98
43
110
67
Progeny


Late/—
90Y90/93Y82
91
45
113
68
Progeny


Early/Early
91Y20

35
 97
62
Check


Late/Late
93Y82

47
NA
NA
Check


Early/Early
91Y20/93Y82
69
37
101
65
Progeny


Early/Late
91Y20/93Y82
63
39
107
68
Progeny


Late/Early
91Y20/93Y82
65
42
113
71
Progeny


Late/Late
91Y20/93Y82
68
46
116
70
Progeny


Early/—
91Y20/93Y82
132 
38
104
66
Progeny


—/Early
91Y20/93Y82
134 
39
107
68
Progeny


—/Late
91Y20/93Y82
131 
43
112
69
Progeny


Late/—
91Y20/93Y82
133 
44
114
70
Progeny


Early/Early
91Y40

37
101
64
Check


Early/Early
91Y62

36
101
65
Check


Late/Early
91Y81

37
106
69
Check


Early/Late
91Y92

34
109
75
Check


Late/Early
92Y11

38
107
69
Check


Late/Early
92Y31

39
112
73
Check


Late/Early
92Y53

39
110
71
Check


Late/Early
92Y60

41
115
74
Check


Late/Early
92Y74

39
113
74
Check


Late/Early
92Y83

40
115
75
Check


Late/Early
92Y91

39
114
75
Check


Late/Late
93Y22

45
NA
NA
Check


Late/Late
93Y82

47
NA
NA
Check


Late/Late
94Y22

49
NA
NA
Check









Example 6

Several segregating populations in one or more locations were genotyped using one or more markers, which in some cases included markers associated with reproductive growth, as well as phenotyped for one or more reproductive stages. Initiation of flowering was measured both as days after planting (DAP), and as day of the year (DOY), which helps account for different planting dates across locations where relevant. Marker data and phenotypic data associations were analyzed using a partial least squares (PLS) methodology. Tables 15-23 summarize these studies, Tables 20-23 show single location results which are also aggregated for analysis and presentation in Table 19.















TABLE 15













Allelic


Pop (♀/♂)

Locus
Position
SNP
μFLDATE (DAP)
Sub















(n)
Locs
(ch)
(cM)




(DAP)


















92Y75/92Y22
1
S09951-1
32.04
G_G
T_T
41.7
39.6
2.0


(319)

(11)




S00170-1
45.37
A_A
T_T
41.6
39.7
2.0




(11)




S08519-1
8.96
C_C
G_G
41.2
40.4
0.8




(1)




S08942-1
80.59
G_G
C_C
40.3
41.5
1.2




(4)




S02780-1
41.85
G_G
A_A
40.6
40.4
0.2




(7)




S02777-1
129.25
T_T
C_C
40.0
40.9
0.9




(10)




S04059-1
75.42
G_G
A_A
40.5
41.4
0.9




(14)








Variance
11.0


(DAP or DOY)


Mean (DAP)
40.8


Mean (DOY)
181.8






















TABLE 16













Allelic


Pop (♀/♂)

Locus
Position
SNP
μFLDATE (DAP)
Sub















(n)
Locs
(ch)
(cM)




(DAP)


















92Y75/92Y80
1
S08575-1
58.78
A_A
G_G
65.1
66.3
1.1


(304)

(2)




S08942-1
80.59
G_G
C_C
65.1
66.2
1.1




(4)




S13139-1
97.08
C_C
T_T
64.9
66.3
1.4




(6)




S06925-1
28.92
G_G
A_A
66.5
65.1
1.3




(11)




S13446-1
92.65
T_T
C_C
65.4
66.3
0.9




(15)




S01452-1
73.34
C_C
T_T
66.2
64.9
1.2




(17)




S01109-1
0.92
T_T
G_G
66.2
64.9
1.3




(18)




S10293-1
85.1
A_A
G_G
66.7
64.7
2.0




(20)








Variance
8.2


(DAP or DOY)


Mean (DAP)
65.7


Mean (DOY)
176.7






















TABLE 17













Allelic


Pop (♀/♂)

Locus
Position
SNP
μFLDATE (DAP)
Sub















(n)
Locs
(ch)
(cM)




(DAP)





93Y30/92Y22
1
S01481-1
89.53
G_G
T_T
50.3
45.9
4.4


(136)

(19)








Variance
8.2


(DAP or DOY)


Mean (DAP)
47.4


Mean (DOY)
189.4






















TABLE 18













Allelic


Pop (♀/♂)

Locus
Position
SNP
μFLDATE (DAP)
Sub















(n)
Locs
(ch)
(cM)




(DAP)





92Y60/92Y32
1
S01008-1
87.09
C_C
G_G
42.9
41.8
1.1


(174)

(7)




S09955-1
58.82
C_C
T_T
41.6
43.2
1.6




(12)




S07851-1
83.76
A_A
G_G
43.6
42.2
1.4




(14)




S11993-1
99.75
A_A
G_G
42.1
42.1
0.1




(17)




S13844-1
85.55
T_T
G_G
43.6
41.7
1.8




(18)




S08034-1
71.47
C_C
G_G
41.8
42.8
1.0




(20)








Variance
10.3


(DAP or DOY)


Mean (DAP)
42.6


Mean (DOY)
183.6






















TABLE 19













Allelic







μFLDATE (DAP;
Sub


Pop (♀/♂)

Locus
Position
SNP
DOY)
(DAP;















(n)
Locs
(ch)
(cM)




DOY)


















91Y90/92Y22
4
S08942-1
80.59
G_G
C_C
46.1;
50.4;
4.3;


(1253)

(4)



178.8
177.7
1.1




S10631-1
94.2
T_T
C_C
44.8;
50.1;
5.3;




(10)



175.5
180.4
4.9




S01574-1
99.5
C_C
A_A
44.6;
50.1;
5.5;




(10)



174.7
180.5
5.7




S04660-1
106.5
T_T
C_C
48.1;
48.1;
0.0;




(18)



180.1
176.7
3.4




S01481-1
89.53
G_G
T_T
49.3;
46.9;
2.4;




(19)



180.8
176.4
4.4




S11320-1
92.18
A_A
T_T
49.5;
46.9;
2.6;




(19)



180.6
176.7
3.9








Variance (DAP; DOY)
60.3; 51.1


Mean (DAP)
48.4


Mean (DOY)
178.1






















TABLE 20













Allelic


Pop (♀/♂)

Locus
Position
SNP
μFLDATE (DAP)
Sub















(n)
Locs
(ch)
(cM)




(DAP)


















91Y90/92Y22
1
S08942-1
80.59
G_G
C_C
41.0
41.9
0.8


(254)

(4)




S10631-1
94.2
T_T
C_C
38.5
43.3
4.8




(10)




S01574-1
99.5
C_C
A_A
37.9
43.2
5.2




(10)




S04660-1
106.5
T_T
C_C
41.6
40.9
0.7




(18)




S01481-1
89.53
G_G
T_T
43.2
39.3
3.9




(19)




S11320-1
92.18
A_A
T_T
43.0
39.9
3.2




(19)








Variance
18.3


(DAP or DOY)


Mean (DAP)
41.2


Mean (DOY)
182.2






















TABLE 21













Allelic


Pop (♀/♂)

Locus
Position
SNP
μFLDATE (DAP)
Sub















(n)
Locs
(ch)
(cM)




(DAP)


















91Y90/92Y22
1
S08942-1
80.59
G_G
C_C
44.3
48.1
3.8


(337)

(4)




S10631-1
94.2
T_T
C_C
41.3
48.4
7.1




(10)




S01574-1
99.5
C_C
A_A
40.8
48.6
7.8




(10)




S04660-1
106.5
T_T
C_C
48.1
44.3
3.8




(18)




S01481-1
89.53
G_G
T_T
48.4
43.6
4.8




(19)




S11320-1
92.18
A_A
T_T
48.4
43.8
4.6




(19)








Variance
28.3


(DAP or DOY)


Mean (DAP)
46.1


Mean (DOY)
178.1






















TABLE 22













Allelic


Pop (♀/♂)

Locus
Position
SNP
μFLDATE (DAP)
Sub















(n)
Locs
(ch)
(cM)




(DAP)


















91Y90/92Y22
1
S08942-1
80.59
G_G
C_C
59.6
58.2
1.4


(334)

(4)




S10631-1
94.2
T_T
C_C
56.8
59.7
2.9




(10)




501574-1
99.5
C_C
A_A
56.6
59.9
3.2




(10)




S04660-1
106.5
T_T
C_C
59.8
57.8
2.0




(18)




S01481-1
89.53
G_G
T_T
59.1
58.2
1.0




(19)




S11320-1
92.18
A_A
T_T
59.6
57.9
1.7




(19)








Variance
6.5


(DAP or DOY)


Mean (DAP)
58.5


Mean (DOY)
169.5






















TABLE 23













Allelic


Pop (♀/♂)

Locus
Position
SNP
μFLDATE (DAP)
Sub















(n)
Locs
(ch)
(cM)




(DAP)


















91Y90/92Y22
1
S08942-1
80.59
G_G
C_C
45.3
46.8
1.4


(328)

(4)




S10631-1
94.2
T_T
C_C
42.4
48.6
6.2




(10)




S01574-1
99.5
C_C
A_A
41.8
48.6
6.8




(10)




S04660-1
106.5
T_T
C_C
46.1
45.6
0.4




(18)




S01481-1
89.53
G_G
T_T
48.1
44.3
3.7




(19)




S11320-1
92.18
A_A
T_T
47.9
44.6
3.3




(19)








Variance
24.0


(DAP or DOY)


Mean (DAP)
45.9


Mean (DOY)
183.9









Example 7

From the analyses of marker loci associated with reproductive stage in soybean populations and varieties, several markers were developed, tested, and confirmed, as summarized in preceding tables. Any methodology can be deployed to use this information, including but not limited to any one or more of sequencing or marker methods.


In one example, sample tissue, including tissue from soybean leaves or seeds can be screened with the markers using a TAQMAN® PCR assay system (Life Technologies, Grand Island, N.Y., USA).












TAQMAN ® Assay Conditions







Reaction Mixture (Total Volume = 5 μl);











Genomic DNA (dried)
16
ng



DDH20
2.42
μl



Klearkall Mastermix
2.5
μl



Forward primer (100 μM)
0.0375
μl



Reverse primer (100 μM)
0.0375
μl



Probe 1 (100 μM)
0.005
μl



Probe 2 (100 μM)
0.005
μl







Reaction Conditions:











94° C.
10 min
1 cycle



40 cycles of the following:





94° C.
30
sec



60° C.
60
sec





Klearkall Mastermix is available from KBioscience Ltd. (Hoddesdon, UK).






A summary of the alleles for markers associated with reproductive growth phenotype in soybean is provided in Table 24. Marker S17691-001-Q001 detects a deletion event, as reported in the tables “D” represents the deletion.












TABLE 24






Genetic

Allele


Marker
(cM)
Physical (bp)
polymorphism


















S01435-1-Q001
33.61
Gm05:30568085
A/T


S01239-1-A
40.49
Gm08:7464336
A/G


S00780-1-A
76.47
Gm08:15841570
A/G


S06925-1-Q1
28.92
Gm11:4674824
C/T


S09951-1-Q1
32.04
Gm11:5231500
G/T


S00170-1-A
45.37
Gm11:7847341
A/T


S04059-1-A
75.42
Gm14:46138053
A/G


S07851-1-Q1
83.76
Gm14:47331319
A/G


S11659-1-Q1
29.24
Gm04:5754268
C/T


S04279-1-A
45.75
Gm04:8295779
A/T


S02211-1-A
54.48
Gm04:39691731
A/G


S08942-1-Q1
80.59
Gm04:44725098
C/G


S05742-1-Q1
4.88
Gm06:410442
G/T


S09155-1-Q1
69.29
Gm06:11659627
A/G


S02037-1-A
89.19
Gm06:15457913
A/G


S13136-1-Q1
94.84
Gm06:16391391
A/G


S17291-001-K001
96.04
Gm06:16499786
C/T


S13139-1-Q1
97.08
Gm06:16593381
C/T


S17292-001-K001
97.84
Gm06:16670047
A/G


S13146-1-Q1
98.23
Gm06:16804435
A/G


S17293-001-K001
100.29
Gm06:17498270
A/G


S17294-001-K001
101.72
Gm06:18203964
C/T


S17581-001-Q008
102.13
Gm06:19743496
G/A


S17691-001-Q001
102.2
Gm06:19986645
D/I


S17701-001-Q001
102.2
Gm06:20007173
G/C


S03703-1-Q1
102.26
Gm06:20084642
C/T


S17297-001-K001
102.43
Gm06:20501491
A/T


S17298-001-K001
102.71
Gm06:21197184
A/C


S17299-001-K001
102.83
Gm06:21500085
C/T


S17300-001-K001
102.93
Gm06:22501610
C/T


S17301-001-K001
102.97
Gm06:22700011
A/G


S17306-001-K001
103.29
Gm06:25700006
A/G


S17310-001-K001
103.3
Gm06:28501458
G/T


S17311-001-K001
103.3
Gm06:28671736
C/T


S17312-001-K001
103.3
Gm06:29499523
G/T


S17313-001-K001
103.31
Gm06:30203054
C/G


S17316-001-K001
103.31
Gm06:31694650
A/G


S17317-001-K001
103.31
Gm06:32503141
A/C


S17318-001-K001
103.32
Gm06:33196184
C/T


S17322-001-K001
103.37
Gm06:35509548
C/G


S17326-001-K001
103.79
Gm06:37712913
A/C


S17327-001-K001
104
Gm06:38467854
C/T


S17328-001-K001
104.25
Gm06:39168136
C/T


S17329-001-K001
104.38
Gm06:39533730
G/T


S10746-1-Q1
104.94
Gm06:40766974
A/G


S17331-001-K001
105.8
Gm06:41476201
C/T


S17332-001-K001
106.19
Gm06:42450296
A/T


S17337-001-K001
113.1
Gm06:47500976
C/T


S13093-1-Q1
113.11
Gm06:47521797
C/T


S12211-1-Q1
116.04
Gm06:48475049
C/T


S04555-1-Q1
132.43
Gm06:49978151
A/G


S08519-1-Q1
8.96
Gm01:759365
C/G


S12876-1-Q1
29.48
Gm02:4893148
C/G


S05937-1-Q1
48.44
Gm02:9714426
A/C


S08575-1-Q1
58.78
Gm02:11502780
A/G


S08669-1-Q1
76.53
Gm02:15446229
C/T


S11212-1-Q1
83.28
Gm02:33158449
G/T


S00543-1-A
103.71
Gm02:45776142
G/T


S01452-1-A
73.34
Gm17:16136646
C/T


S11993-1-Q2
99.75
Gm17:39804515
C/T


S13446-1-Q1
92.65
Gm15:50237460
C/T


S00252-1-A
0
Gm13:235439
A/T


S04060-1-A
36.9
Gm13:20365663
C/G


S02664-1-A
36.96
Gm13:20744030
A/G


S00281-1-A
73.16
Gm13:35174140
C/T


S01109-1-Q002
0.92
Gm18:305113
A/C


S13844-1-Q1
85.55
Gm18:58086324
G/T


S05058-1-Q1
105.85
Gm18:61591142
A/G


S04660-1-A
106.5
Gm18:61831970
C/T


S09955-1-Q1
58.82
Gm12:11512115
C/T


S08034-1-Q1
71.47
Gm20:39051858
C/G


S10293-1-Q1
85.1
Gm20:41216234
A/G


S03813-1-A
30.57
Gm16:4678569
A/G


S02042-1-A
85.53
Gm16:36524407
A/G


S16601-001-Q001
87.73
Gm19:47535046
A/C


S01481-1-A
89.53
Gm19:47826727
G/T


S11309-1-Q1
91.1
Gm19:48252040
A/T


S11320-1-Q1
92.18
Gm19:48638646
A/T


S04040-1-A
100.89
Gm19:50222676
G/T


S00863-1-A
8.09
Gm07:1141099
A/T


S17151-001-K001
11.64
Gm07:1830296
A/G


S17153-001-K001
12.12
Gm07:1923026
A/C


S17154-001-K001
13.97
Gm07:2179883
C/T


S17156-001-K001
15.53
Gm07:2310058
A/G


S17159-001-K001
18.14
Gm07:2679749
A/G


S08590-1-Q1
19.96
Gm07:3009018
A/G


S17242-001-K001
31.68
Gm07:4282676
A/C


S17166-001-Q006
31.87
Gm07:4319368
C/T


S17167-001-Q007
31.99
Gm07:4342479
A/G


S08539-1-Q1
36.74
Gm07:5576650
A/G


S17178-001-K001
40.59
Gm07:6288899
C/T


S17179-001-K001
40.83
Gm07:6340656
A/G


S17180-001-K001
40.85
Gm07:6347675
A/C


S17181-001-K001
41.66
Gm07:6614649
C/G


S17182-001-K001
41.66
Gm07:6616695
A/T


S17183-001-K001
41.69
Gm07:6623333
G/T


S02780-1-Q1
41.85
Gm07:6671535
A/G


S12107-1-Q1
43.16
Gm07:7096376
G/T


S03624-1-Q001
45.02
Gm07:7774056
A/G


S01953-1-A
48.13
Gm07:8674220
C/T


S00111-1-A
79.14
Gm07:35590550
A/G


S04180-1-A
86.05
Gm07:36459825
A/G


S01008-1-B
87.09
Gm07:36638366
C/G


S12862-1-Q1
53.56
Gm03:38491492
C/T


S12867-1-Q002
58.35
Gm03:39583405
A/G


S04966-1-Q1
92.16
Gm03:46209939
A/T


S10631-1-Q1
94.2
Gm10:43974548
C/T


S01574-1-A
99.5
Gm10:44725777
A/C


S16594-001-Q010
99.55
Gm10:44732850
A/T


S02777-1-A
129.25
Gm10:50495033
A/G









Table 25 summarizes exemplary allele polymorphisms and further associates them with early or late phenotype for time to flowering and/or maturity. In some instances, the allele polymorphisms in Table 25 represent the complement of calls provided in Table 24.












TABLE 25








Allele



Genetic

polymorphism


Marker
(cM)
Physical (bp)
(Early/Late)


















S17581-001-Q008
102.13
Gm06:19743496
T/C


S17691-001-Q001
102.2
Gm06:19986645
D/I


S03703-1-Q1
102.26
Gm06:20084642
T/C


S16601-001-Q001
87.73
Gm19:47535046
C/A


S01481-1-A
89.53
Gm19:47826727
T/G


S17166-001-Q006
31.87
Gm07:4319368
T/C


S17167-001-Q007
31.99
Gm07:4342479
A/G


S01574-1-A
99.5
Gm10:44725777
A/C


S16594-001-Q010
99.55
Gm10:44732850
A/T










A summary of exemplary marker sequences is provided in Tables 26 and 27.














TABLE 26







SEQ

SEQ
Region


Locus
Primers (FW/REV)
ID
Probes
ID
SEQ ID NO:




















SO1435-1
GCCTCTACTAGAA
  1
6FAM-cagtacTttcgtcaataa
  3
  5



TCCGTGCATAC







GGAAGTGCTCTTG
  2
VIC-cagtacAttcgtcaataa
  4




GAACACAAT









SO1239-1
tgagaattgatgctcattt
  6
6FAM-acttgttaAcagcattc
  8
 10



aggaa







ccctttgtttcattttccctct
  7
VIC-ttgttaGcagcattc
  9






S00780-1
TGTAGTCCCATTG
 11
6FAM-
 13
 15



CCATATGAGGC

CTCGTTTTAAaCCTTCT





CATTAGGGGTTTC
 12
VIC-
 14




ACCACTGTCCA

CTCGTTTTAAgCCTTC







S06925-1
tgaggggaaattaagaaattgg
 16
6FAM-caccgTgatgctatt
 18
 20



tgaggggaaattaagaaattgg
 17
VIC-catcattttcaccgCgat
 19






S09951-1
tgccgtaagtaacacacacaaa
 21
6FAM-cacttgattAaattcctgt
 23
 25



caagagcacaccatcacctg
 22
VIC-cttgattCaattcct
 24






S00170-1
GCTGATACCGTTT
 26
6FAM-
 28
 30



TGGTGTTTCCA

TGCATCTGCaGACGT





TGCAACATCCCGT
 27
VIC-
 29




GAAAGGATT

TGCATCTGCtGACGTG







S04059-1
caccatcagcaagctttgag
 31
6FAM-cctctgagttAgcctt
 33
 35



gaagggcacttcaacagagc
 32
VIC-ctctgagttGgcctt
 34






S07851-1
cttttccttggacggtacga
 36
6FAM-CactgTacaaatcaa
 38
 40



cgtgtgaatggaagaaagca
 37
VIC-cactgCacaaatc
 39






S11659-1
tctgtcccaatgctcaatca
 41
6FAM-catgatgAcaacctc
 43
 45



cttggaggggaaggtctagc
 42
VIC-atgatgGcaacctcta
 44






S04279-1
aacaactccctctggtgtcc
 46
6FAM-atctccttcTcttcctt
 48
 50



cataggggagtagattatatg
 47
VIC-tctccttcActtcct
 49




gcttt









S02211-1
cctctctcattctcgttcacgat
 51
6FAM-caatttaTcgtaacatcag
 53
 55



gtaa







cttttggttcatttaaattccat
 52
VIC-caatttaCcgtaacatc
 54




ttgct









S08942-1
cgtgatcctacgcctctctt
 56
6FAM-tcacgatcgCagtct
 58
 60



aggtcatgtccacgacgaa
 57
VIC-tcacgatcgGagtct
 59






S05742-1
acgacgtcaagaagttcctttc
 61
6FAM-tccgaaatcAtaatc
 63
 65



ggccgaactcggttctaatc
 62
VIC-ccgaaatcCtaatcc
 64






S09155-1
ctattgccgagaagctcgat
 66
6FAM-caacgtttTgtcatca
 68
 70



tcatcctccgtgagatagcc
 67
VIC-caacgtttCgtcatca
 69






S02037-1
tccatcaacaaagccctttta
 71
6FAM-aatgcttcAagatca
 73
 75



aaaatatctagttgagttggac
 72
VIC-atgcttcGagatcaa
 74




caaga









S13136-1
cgtgcgccctatcagtctat
 76
6FAM-accaccaTgtcgc
 78
 80



gagttgttgcttgcattgga
 77
VIC-accaccaCgtcgc
 79






S17291-001
N/A

GAAGGTGACCAAGTTC
 82
 84





ATGCTTTTCCTTTTGCT







ATTTTTGACTCGG





AGGGCAATAGTTT
 81
GAAGGTCGGAGTCAAC
 83




GAAGATTTGGGAT

GGATTAACTTTTCCTTT





GAA

TGCTATTTTTGACTCGA







S13139-1
aatctttccccgtttcttgg
 85
6FAM-agatcccAttcatg
 87
 89



ttgcagaggcaaatagagctt
 86
VIC-tatatagatcccGttcatg
 88






S17292-001
N/A

GAAGGTGACCAAGTTC
 91
 93





ATGCTCAATGTAATCA







TTTAAGTACATTATCCC







ACA





CRGGACACATTTT
 90
GAAGGTCGGAGTCAAC
 92




TAGCTTACGTAGT

GGATTAATGTAATCAT





TAAA

TTAAGTACATTATCCC







ACG







S13146-1
gtcatcatagccgcaatcaa
 94
6FAM-aagttcatcAaagccat
 96
 98



tccaaatctttgttgagtcgtg
 95
VIC-aagttcatcGaagcca
 97






S17293-001
N/A

GAAGGTGACCAAGTTC
100
102





ATGCTGTGTTTTAACTC







ACTCAGTTTCGAATGT





GCCTAAAGACCA
 99
GAAGGTCGGAGTCAAC
101




ACAATTTGTAAGA

GGATTGTTTTAACTCA





GTAAA

CTCAGTTTCGAATGC







S17294-001
N/A

GAAGGTGACCAAGTTC
104
106





ATGCTAGAAAGTAAGG







AAAATTTCTAATTTTCA







TTGC





CACACAGGAGAC
103
GAAGGTCGGAGTCAAC
105




AAATCAYGTCGAT

GGATTGAGAAAGTAAG





AA

GAAAATTTCTAATTTTC







ATTGT







S17581-001
ACGAATGCAAAA
107
6FAM-ttgaggacgtgtagTtg
109
111



TTGGAAATG







TCTTCCTTCGTCC
108
VIC-ttgaggacgtgtagCtgt
110




GTGTCA









S17691-001
CCTCTTTTCCTTG
112
VIC-cttctcatcattgtggac
114
115



GCTATGTGAT







CAATCTTAACATG
113
N/A





GTTCCAAAACA









S17701-001
GACCCTATTCATC
116
6FAM-tggatttcCtcttctt
118
120



TCTTCCAACA







GATGTCCTAAAGT
117
VIC-atggtggatttcGtc
119




TAGAGGCTTCG









S03703-1
cccaaggactaaccaggattc
121
6FAM-acacaagTcgctacc
123
125



tttattaaatggagtgagaagg
122
VIC-cacaagCcgctacc
124




tgtc









S17297-001
N/A

GAAGGTGACCAAGTTC
127
129





ATGCTCACAACACTAT







TTAATTTATTTCTGAAA







AGCAA





GTAAGAAAGTTTT
126
GAAGGTCGGAGTCAAC
128




TTTGTGTGTAAAC

GGATTCACAACACTAT





TGAT

TTAATTTATTTCTGAAA







AGCAT







S17298-001
N/A

GAAGGTGACCAAGTTC
131
133





ATGCTCTCGCTTAGAG







GAAGAACGTGTA





CTCGCGCTTGAAG
130
GAAGGTCGGAGTCAAC
132




GCATCAATCTT

GGATTCTCGCTTAGAG







GAAGAACGTGTC







S17299-001
N/A

GAAGGTGACCAAGTTC
135
137





ATGCTGACTACCACCA







CGCGTCATAG





GGCCTTTTACATC
134
GAAGGTCGGAGTCAAC
136




GGTTCTAATGACT

GGATTGGACTACCACC





TTT

ACGCGTCATAA







S17300-001
N/A

GAAGGTGACCAAGTTC
139
141





ATGCTCATATAAGTAG







AGATGTCAAATTTTCG







AC





TTGTGAAGGACAC
138
GAAGGTCGGAGTCAAC
140




TCAACTATTCCAC

GGATTAATCATATAAG





TA

TAGAGATGTCAAATTT







TCGAT







S17301-001
N/A

GAAGGTGACCAAGTTC
143
145





ATGCTATACTTTATCCT







GAGTATTTCTCATGAT







CT





CCCTATCACCTGT
142
GAAGGTCGGAGTCAAC
144




CATATACCCCTT

GGATTCTTTATCCTGA







GTATTTCTCATGATCC







S17306-001
N/A

GAAGGTGACCAAGTTC
147
149





ATGCTATTTTTAAGAA







ACATGTTTTTAGGAAA







CTATA





CCCTCATCCTTCT
146
GAAGGTCGGAGTCAAC
148




CCATGGGATTTT

GGATTATTTTTAAGAA







ACATGTTTTTAGGAAA







CTATG







S17310-001
N/A

GAAGGTGACCAAGTTC
151
153





ATGCTGAAAATACGCA







AGGAGCTCTGTTC





TCATTGATGGTGC
150
GAAGGTCGGAGTCAAC
152




CTCTTTATTGCAC

GGATTCGAAAATACGC





TTT

AAGGAGCTCTGTTA







S17311-001
N/A

GAAGGTGACCAAGTTC
155
157





ATGCTAAGTATCCTAT







TACAACCATCAACGG





CGCAGGAGTCATG
154
GAAGGTCGGAGTCAAC
156




GATCTTGTCAAT

GGATTGATAAGTATCC







TATTACAACCATCAAC







GA







S17312-001
N/A

GAAGGTGACCAAGTTC
159
161





ATGCTAAGAAAGAAAA







TCACGCAACATAAATG







TTG





GAAGGTCGGAGTCAAC
158
GAAGACCAACGC
160




GTTCTCTACTTGTT

GGATTAAAAAGAAAG







AAAATCACGCAACATA







AATGTTT







S17313-001
N/A

GAAGGTGACCAAGTTC
163
165





ATGCTGGTACGGCCTC







GATCACACC





AGTCCTTTGAAGA
162
GAAGGTCGGAGTCAAC
164




GGAGGACGTGTA

GGATTGGTACGGCCTC







GATCACACG







S17316-001
N/A

GAAGGTGACCAAGTTC
167
169





ATGCTAAGAGCTTCCA







TTTTCGATTACGAA





TCGGATGTTCGAT
166
GAAGGTCGGAGTCAAC
168




TGTGTCCCATAAT

GGATTAAGAGCTTCCA





ATA

TTTTCGATTACGAG







S17317-001
N/A

GAAGGTGACCAAGTTC
171
173





ATGCTTGAGAAAATCC







CTCCTCCATTTTA





GAGTTGGTGAACT
170
GAAGGTCGGAGTCAAC
172




AATTTTCCCTGTT

GGATTCTTGAGAAAAT





GAT

CCCTCCTCCATTTTC







S17318-001
N/A

GAAGGTGACCAAGTTC
175
177





ATGCTAACAGGAAGGG







AAAACAAAGTGTCG





CCTTGATGCTCTA
174
GAAGGTCGGAGTCAAC
176




TTTCTTTTCTCCCAA

GGATTGAACAGGAAGG







GAAAACAAAGTGTCA







S17322-001
N/A

GAAGGTGACCAAGTTC
179
181





ATGCTATTTTGGGTTTT







TTTTTGTAAAAACAGA







AAGTC





ATGTTGTTTGTGT
178
GAAGGTCGGAGTCAAC
180




AGATTAACATCGG

GGATTTTGGGTTTTTTT





CTTT

TTGTAAAAACAGAAAG







TG







S17326-001
N/A

GAAGGTGACCAAGTTC
183
185





ATGCTAAAATAGCTGA







AATTGCATTTATGGTG







CAA





CACAACACTGCTT
182
GAAGGTCGGAGTCAAC
184




ACAGCAAATTGCA

GGATTATAGCTGAAAT





TAA

TGCATTTATGGTGCAC







S17327-001
N/A

GAAGGTGACCAAGTTC
187
189





ATGCTAAGTAGCAGTT







AAAGAGGACTGGTC





GACCTCATATGAA
186
GAAGGTCGGAGTCAAC
188




AGAATATGTCCAA

GGATTAAAAGTAGCAG





TCTT

TTAAAGAGGACTGGTT







S17328-001
N/A

GAAGGTGACCAAGTTC
191
193





ATGCTATCCACCTTGCT







TTACAATGCATCC





TTCTACAAGGCGA
190
GAAGGTCGGAGTCAAC
192




AGGACCATTTTAT

GGATTCATCCACCTTG





CAT

CTTTACAATGCATCT







S17329-001
N/A

GAAGGTGACCAAGTTC
195
197





ATGCTCCTTTGCTTCTT







GAAGATCATGGC





CTCCAATCATCTT
194
GAAGGTCGGAGTCAAC
196




TCTTCCTTCTCCAT

GGATTGTCCTTTGCTTC





TT

TTGAAGATCATGGA







S10746-1
attgggatcctgatcaacca
198
6FAM-caacaaTgagcctaat
200
202



cccaggcattggtgtttaag
199
VIC-caacaaCgagcctaa
201






S17331-001
N/A

GAAGGTGACCAAGTTC
204
206





ATGCTGGGAAATGAAG







ACAATTAATAACATCG







TG





CGGTTGTCTCTGS
203
GAAGGTCGGAGTCAAC
205




TCTTCTCAGATT

GGATTGGGAAATGAAG







ACAATTAATAACATCG







TA







S17332-001
N/A

GAAGGTGACCAAGTTC
208
210





ATGCTCAAACCTTAGG







ATAGATGACTTCTTGTT





AACCTTACCCTAA
207
GAAGGTCGGAGTCAAC
209




CAACATACAACTA

GGATTCAAACCTTAGG





AGAA

ATAGATGACTTCTTGT







A







S17337-001
N/A

GAAGGTGACCAAGTTC
212
214





ATGCTTGATTGATAAT







TTTTTTTATTATGTACA







TGAC





CCTATTGACCGTG
211
GAAGGTCGGAGTCAAC
213




ATATTAATTAAGA

GGATTGATTGATAATT





CTTT

TTTTTTATTATGTACAT







GAT







S13093-1
catcgagtctccagcaagtg
215
6FAM-attggcacttTtaac
217
219



tgagattcacgaagtgggttc
216
VIC-cacttCtaacatcaatg
218






S12211-1
gaccagaggtagtagattcca
220
6FAM-ctgcaaTgccatact
222
224



aaagt







tgcattaagctcactcagttat
221
VIC-ctgcaaCgccatac
223




gtatta









S04555-1
AAATCGCCACTAG
225
6FAM-cagcacTggatctt
227
229



GCTTGC







CTAGGGTTCTGCA
226
VIC-cagcacCggatct
228




GTTCATCG









S08519-1
acagttcattggccttgaca
230
6FAM-tcagctctgCcaatag
232
234



ggcttcacacttgaggaggt
231
VIC-tcagctctgGcaatag
233






S12876-1
cccgccacaactcttgttat
235
6FAM-cacgcttcCaatct
237
239



gggaggtgtttggcaatatc
236
VIC-aagcacgcttcGaat
238






S05937-1
gcaaaattaaggagaggacc
240
6FAM-catcAgctatgaccatg
242
244



ttg







ctctcttgcaaaatgcacca
241
VIC-catcCgctatgacc
243






S08575-1
aactatgcacttatgctcatgg
245
6FAM-acttcttgcTgaatct
247
249



taa







tggatccaaacatgcgtcta
246
VIC-aacttcttgcCgaatc
248






S08669-1
gtggtgggttggtttttgac
250
6FAM-tcttatgggacatTtc
252
254



tccaatattctcagcctcttcag
251
VIC-tcttatgggacatCtc
253






S11212-1
ctccaagaccttgccttcct
255
6FAM-ccccgTttacttcc
257
259



gatcccaaatgagattaggag
256
VIC-cccgGttacttcc
258




act









S00543-1
GCATGCAATATGA
260
6FAM-
262
264



ACAACTTGACAAC

CACTTATCCAtTGGTTC





CCCTTTTCACATG
261
VIC-
263




AGTATGCATGTC

CACTTATCCAgTGGTTC







S01452-1
caaacaatgaatgatgaatcc
265
6FAM-aagcaaTtggtcacaac
267
269



aa







gcattttgagagccaccaata
266
VIC-aagcaaCtggtcacaa
268






S11993-1
atttgtgagtgctgcggatt
270
6FAM-ccagcacaatTgat
272
274



tgaacatgaacgtgctaaacg
271
VIC-ccagcacaatCga
273






S13446-1
atcccaggcttctaatgtgg
275
6FAM-cccaccacActca
277
279



ggctgcgctactttcgtact
276
VIC-cccaccacGctc
278






S00252-1
ATGCATGCAGCTG
280
6FAM-
282
284



GGCAATAAT

CGGTCTCtTGGTACTAT





GATGCCACCGATG
281
VIC-
283




AAGAAGCAC

CGGTCTCaTGGTACTAT







S04060-1
ctcttgcagcggattcagtc
285
6FAM-cgacttcaCtcacc
287
289



ctcgccgatttcctcatct
286
VIC-acttcaGtcaccgagat
288






S02664-1
GTTTTGGTTTCCTT
290
6FAM-
292
294



AGGATGAACT

CTTTCCATCTTaTTCG





ATGTGCAGAGGTC
291
VIC-
293




CCATTCT

CTTTCCATCTTgTTCG







S00281-1
GGCCGAGCAAAC
295
6FAM-
297
299



AACAAGAAAA

CATAGTGaACCTCTC





TCCAAACTCCTCA
296
VIC-
298




CAAGCCTTCA

CCATAGTGgACCTCT







S01109-1
AGTAGTACTTCAT
300
6FAM-ccaccaccTctgaaa
302
304



CCCTGACACCA







AGGAGTATAACCT
301
VIC-accaccGctgaaaa
303




TGGTTTAAAGCTG









S13844-1
tgatggaaagccgaaaaaga
305
6FAM-tcccttaAgtagtcttt
307
309



ctgagcagccctcatatgttt
306
VIC-atcccttaCgtagtctt
308






S05058-1
aaatgatacgcaattttgactc
310
6FAM-atttgcAgttcattgtac
312
314



ag







tgtgttatgcctaccaatcaaa
311
VIC-atttgcGgttcattgta
313




ct









S04660-1
tcggccttcgtcatagaagt
315
6FAM-catctacAtccttcc
317
319



tccttcaatttccccatatcc
316
VIC-catctacGtccttcc
318






S09955-1
gatcgggatgatttttggaa
320
6FAM-cacacttgaTactcca
322
324



cttttcatgatccaaccagaca
321
VIC-cacacttgaCactcca
323






S08034-1
tcctttctaaaccctgctgtg
325
6FAM-caatccaCaggagcat
327
329



ttggtctctttcttagtttcat
326
VIC-caatccaGaggagcat
328




ctca









S10293-1
gggcatccagactttatctatg
330
6FAM-tttcTacagtcgatctc
332
334



a







acgatttaatgcacgacgagt
331
VIC-tttcCacagtcgatc
333






S03813-1
atttagcgtacatgtcaactaa
335
6FAM-cctaacTagaatttc
337
339



cga







tgcaaatgctttgaatctgg
336
VIC-cctaacCagaattt
338






S02042-1
gccatcttccatttctgcaacc
340
6FAM-ctccttgaggTtttc
342
344



agaagcgttggctatgcacg
341
VIC-ctccttgaggCttt
343




ag









S16601-001
TTCACACATGTAC
345
6FAM-cagcttcaAaacatt
347
349



TAGGCTTTGG







CCACCTTTCACAC
346
VIC-cagateaCaacatt
348




AGCTTGA









S01481-1
tttcaggctggctgtttctt
350
6FAM-cagtcacTttatggtcc
352
354



actctgggtgccaaagtcaa
351
VIC-cagtcacGttatggtc
353






S11309-1
gaccggtctgggacaatg
355
6FAM-atttcAtggttctcg
357
359



cacttaatcaagttgcccaag
356
VIC-caatttcTtggttctc
358




aa









S11320-1
gcccaaagttcaaaagcaat
360
6FAM-aaacaAacgatctcaac
362
364



tcggatgcgaatatgaagtg
361
VIC-aaacaTacgatctcaac
363






S04040-1
cccagctgctgaggagaa
365
6FAM-aaggtttcccTctagtg
367
369



ggattgaaaaacaattggag
366
VIC-aaggtttcccGctagt
368




ga









S00863-1
CAGCATCACACAC
370
6FAM-
373
372



GCTATAAGACCA

TCAACACTCACAaGAT





CATGACTTTTTCA
371
VIC-
373




TACTAAGTTGGAC

TCAACACTCACAtGAT





ACCA









S17151-001
N/A

GAAGGTGACCAAGTTC
376
378





ATGCTGGAAAAAGAAG







AAAAAATTCCACTAAT







GTGA





CRAGTCTCAGTCA
375
GAAGGTCGGAGTCAAC
377




ATCTGTGACTCTTT

GGATTGAAAAAGAAG







AAAAAATTCCACTAAT







GTGG







S17153-001
N/A

GAAGGTGACCAAGTTC
380
382





ATGCTGCCCTTAATTG







CTCAAATTTCCACTA





CTGACGTGGAGTG
379
GAAGGTCGGAGTCAAC
381




TGACAATGCAAT

GGATTGCCCTTAATTG







CTCAAATTTCCACTC







S17154-001
N/A

GAAGGTGACCAAGTTC
384
386





ATGCTAAATGTTTTTCT







CGTGCTTGATTGC





GAACCACATTTCT
383
GAAGGTCGGAGTCAAC
385




AAGTTAAAGCGA

GGATTMTAAATGTTTT





CTTAA

TCTCGTGCTTGATTGT







S17156-001
N/A

GAAGGTGACCAAGTTC
388
390





ATGCTCAACAACTAAT







TGACCCTGCAGG





CCCTTCCAATGAA
387
GAAGGTCGGAGTCAAC
389




ATAAAGCACTTGG

GGATTAACAACTAATT





AT

GACCCTGCAGG







S17159-001
N/A

GAAGGTGACCAAGTTC
392
394





ATGCTGGAAGACACGT







GGTCCACCT





TYCAGCCCAACAA
391
GAAGGTCGGAGTCAAC
393




ATCTCAAATGGGA

GGATTGGAAGACACGT





T

GGTCCACCC







S08590-1
cccttgaaagacgaccaaaa
395
6FAM-cagatcAgttgtcattt
397
399



acttatccgcagccgtacac
396
VIC-cagatcGgttgtcatt
398






S17242-001
N/A

GAAGGTGACCAAGTTC
401
403





ATGCTTGGGGAATAAA







CATCGTGCTTTATAATT







A





CAGAGTGCCTTGA
400
GAAGGTCGGAGTCAAC
402




CGTAGTGACATA

GGATTGGGGAATAAAC







ATCGTGCTTTATAATTC







S17166-001
CCGCAAACTGTAG
404
6FAM-caagacaTgcagcaga
406
408



TACAAATCAA







GGGTTGTAGAAA
405
VIC-caagacaCgcagcag
407




GTAACTTGGGAAG









S17167-001
GTTTTCACATGTA
409
6FAM-taactgtgctTttttaaaa
411
413



ATTTTCAAAACAA







A







TGTCAGTGATGGT
410
VIC-taactgtgctCttttaa
412




GAAAATGATAG









S08539-1
cacttctgtaaagagtcaaca
414
6FAM-agagctttgaagaAtt
416
418



agagg







tgaatacaccttgagtccaaa
415
VIC-tagagctttgaagaGttt
417




gaa









S17178-001
N/A

GAAGGTGACCAAGTTC
420
422





ATGCTACCACCATTCG







GCTAAAGTCAATC





GAATTGATATTTC
419
GAAGGTCGGAGTCAAC
421




AACCATGGATGCA

GGATTCACCACCATTC





TCAT

GGCTAAAGTCAATT







S17179-001
N/A

GAAGGTGACCAAGTTC
424
426





ATGCTAAAAATATTTT







CTAACTCTAAAAGCAA







ACTGGA





CCAGTTTACTTAG
423
GAAGGTCGGAGTCAAC
425




TTAGGTGCCCAAA

GGATTAATATTTTCTA





TTA

ACTCTAAAAGCAAACT







GGG







S17180-001
N/A

GAAGGTGACCAAGTTC
428
430





ATGCTGAAATGATAAA







ACCTAGTAAGCTTTCA







GTT





CAGTGCATTTCCC
427
GAAGGTCGGAGTCAAC
429




ATAGAAAGTTATT

GGATTAAATGATAAAA





TGTT

CCTAGTAAGCTTTCAG







TG







S17181-001
N/A

GAAGGTGACCAAGTTC
432
434





ATGCTGTAATCACTAA







AATTACACACTTAAAT







TAC





GGGCCAATTTTGT
431
GAAGGTCGGAGTCAAC
433




ATTACATCTTTCC

GGATTGTAATCACTAA





AGAA

AATTACACACTTAAAT







TAG







S17182-001
N/A

GAAGGTGACCAAGTTC
436
438





ATGCTGTAATAGGTCA







TAAATGTTGATGGAAT







ATTCT





CTCAGATATACAT
435
GAAGGTCGGAGTCAAC
437




AGATGAGAGGTG

GGATTGTAATAGGTCA





ACAA

TAAATGTTGATGGAAT







ATTCA







S17183-001
N/A

GAAGGTGACCAAGTTC
440
442





ATGCTGATCGTGCGGT







GGATGTGAAG





ATRCGTGGCCACC
439
GAAGGTCGGAGTCAAC
441




ATTTACCTGTATTA

GGATTTGATCGTGCGG







TGGATGTGAAT







S02780-1
attttccagactattgccttta
443
6FAM-actctggAtaacctg
445
447



cctt







agaatacttgactgtataggat
444
VIC-actctggGtaacctg
446




gcaaac









S12107-1
cctcctcctcaaactgttgc
448
6FAM-caatcggctccAtc
450
452



gtggcaaagtgcgaacaata
449
VIC-caatcggctccCtc
451






S03624-1
TAGAATAATCACT
453
6FAM-agcatcattagtgTcacat
455
457



ACAATAACAGAT







GATCTTG







CATTACATGCATA
454
VIC-agcatcattagtgCcacat
456




ACCTCTCATCA









S01953-1
GGAGTGTACTTCT
458
6FAM-ttccttctTcacttgat
460
462



TTATGAAAAACGG







TGA







GTGTCGGGCCACT
459
VIC-tccttctCcacttgat
461




AATTTTGGAGCCT







TT









S00111-1
ACTGATTCAAGAT
463
6FAM-
465
467



ACGATCAAGTTTC

CCTAATTgCGTTTACC





CTTATCATTT







TTGGTTTTGGTGA
464
VIC-





ATAACTGGAAAA

CCTAATTaCGTTTACCC
466




GTGTGT









S04180-1
cattcaccatttatgaattttg
468
6FAM-cattgacActgttcct
470
472



atcc







aaatgaaaacccagaataatg
469
VIC-cattgacGctgttcc
471




tgc









S01008-1
atcccttgctttaactagattgt
473
6FAM-caattCctctgtaagtc
475
477



tattcatgt







atgcactggattgtgaagaga
474
VIC-caattGctctgtaagtc
476




atataagc









S12862-1
ggttcgagggttgtgtatcc
478
6FAM-catttatcAgattcgatc
480
482



gattgcacccatatcgacct
479
VIC-acatttatcGgattcga
481






S12867-1
CACACGTTAAAAC
483
6FAM-accacatgtaactAtt
485
487



TCTTGTTCAGC







TCCTAGAATAAAT
484
VIC-accacatgtaactGtt
486




ATGATCCCTTCTC







AT









S04966-1
atacccagcagcagtcacca
488
6FAM-catttgggtatgcTgtg
490
492



ttgtgtggccttacctttca
489
VIC-catttgggtatgcAgtg
491






S10631-1
tttgatgcaagatctgtcgaa
493
6FAM-ccaactcAgatctt
495
497



agccggtttacttggaatgtt
494
VIC-ccaactcGgatctt
496






S01574-1
tgaagcaactaggaaagctg
498
6FAM-aaggcatcTttatctc
500
502



aa







acgacccaatttgcttgtct
499
VIC-aaggcatcGttatct
501






S16594-001
CACTACAGCCTCC
503
6FAM-catgtcttatgTaaatat
505
507



CGTGCT







CCTCTGAAGAATA
504
VIC-catgtcttatgAaaata
506




GCTTCCACTG









S02777-1
acatatcgaagtgcaaatacgg
508
6FAM-ttgttatcttccActttag
510
512



tcgacttgaaatggaaactgaa
509
VIC-ttgttatcttccGcttta
511



















Table 26 provides the genomic region comprising the


polymorphism associated with a reproductive growth


phenotype in soybean.









Locus
Genomic region
SEQ ID





S01435-1
TGTTGATGAGGGGGCCATATACCATCAGGAACATGCCAATGC
  5



TTATTAGAGAATGGAAACYTGATTTTAACCTGAAGCAAGACA




TGCTACGGACTCTTCCTATTTGGGTACAACTCCCTCAACTGCC




ATTGCATTTATGGGGTGGAAAAAGCCTAGGCAAAATTGGAAG




TGCTCTTGGAACACAATTGGTTATTGACGAA[A/T]GTACTGCA




AATAAACTTCGGGTCTCGTATGCACDGATTCTAGTAGAGGCA




GATGTCACGCCAGAACTGAGAAACGAAATTACTATAAAGGAC




AATGAGGGGCGGAGGATCACTCAAAAAKTCGAATATGAGTG




GAAACCAATGTTTTGTGATAAGTGCCAAAAGTTTGGCCAYAA




ATGTGGTGAAGTAAAGCCAAGGAAG






S01239-1
ATTTTTATAGTTATATTTCTCAAAATTATATTCAAGAATCAGA
 10



AAACAGAAAAATAATGTTATAAATTTSATGCATATCTTCCATT




AAAACAGGTTAGATCCATAGTTGTTGCTAGTCAATAACCTTAT




GGTTAGCAGTGCCATCCCTTDCAACAATGAGAATTGATGCTC




ATTTAGGAAGTCAACACTTGTACTTGTTA[A/G]CAGCATTCYG




GAAAGAAACAAGAGGGAAAATGAAACAAAGGGTGTTTTTCA




AAAAATATTGGCTGATCATGATTTTTTGATCATGATTTTTACR




TGTTCTGTTAAGACTAAAGTACATTAATTTCACTGTGTTGGCA




CCTGAATAGTTATTGAGCATTATGTGAGGTGGCAATTAAGTA




TACAGTTTTGTCAYGACTTTT






S00780-1
ACAATGAAGAAGGAAGTACTTTGCAGGTATTGCAATAAGAAG
 15



TTCAGTTGTTACCAAGCCTTGGGTGGACATCATAATTCTCACA




AGGCGGAAAGAGCAGCAGAAATACATAGCAAGGCTTCTGCTT




GTTACAAAACATATGGTTATGGGTTTTGTGGCAAAACATTAG




GGGTTTCACCACTGTCCATGACTCGTTTTAA[A/G]CCTTCTATT




ATTGTTGGCCTCATATGGCAATGGGACTACATGACTGACTATC




ATCATGTTGCATGGCCAAGGCACCAGATTTTGAATCCTCCTCA




ACCCACCATGTATCAATTCATGGCTGAAGGGAGTGGATCTCA




CCAACACCACAAATTATATTARCCTTTGAGTTCTTCTATAGTC




AGAGGAGGACCAACAAATTC






S06925-1
AGTCCATTGAACATTCCTCATATTGTTCCATGCACTTTGTGTG
 20



AAACTGCAGTGGCACCGGTTCTTTTGGGGGAAGTAGATCAGC




TTCAAAGATTGCAGCTTTTCATTTTGCTTATTTTCCATATTTTA




TGAGGGGAAATTAAGAAATTGGTTGTTAAATAGAGAAACAA




AGATTTGTCTCCACTGCATCATTTTCACCG[C/T]GATGCTATTG




CTATATCTTGTTCCATATTAGCTGTGAATCCTTGTCACTTATG




GGGGATATTAGGCTTACCCGGGTAGAGCAAGGTCAAACAAA




GATTAGAAATGTTCCAATTGCTGTCACCCCTGAAGGATTTTGG




TGTTGCCMTACTCCTGTTGGGTTTCAGAAAAGCCTCAAACCTC




AAAACCCTTTGAATAAACTC






S09951-1
CGAGCCCTTGCTATAAAAACGGGACTGAAGCTCATTTCAACG
 25



GGGAAAATTCCCAAATCTGTTTTAGCTCGCCCCAACTGGCCAT




ACACGTTGCTCCCACAGCTCAAGAGCACACCATCACCTGATA




TAGCATATAGTAAAAAACTTAAGTTTTGAAAACGCATGCATA




CATCTTAATGATCTAACTTGCTACAGGAATT[G/T]AATCAAGT




GTTTGGTGTTTGTGTGTGTTACTTACGGCAAAGAATCAGAGA




GTGATCAAGGCCACACGCAATGTCCAGAACATGAACACCGTG




TAATTCTTCGACCAAGCGGGGYTCCCATACTATTGGCATGTTA




TTCTCCTGYTCCATTCCTTGCAAGACTCTCTTTTCAGCAGCTTC




AWGGTCCKCGAGAGCTGTGAG






S00170-1
TSACGAGTAGAATTTCNANTAGAATTTCAAGTAATTTTCGGAC
 30



TGAATCAAATAATCCAAACCAAAGAGTAGATTACAAGCCAGG




TAAATTTTCAAGGATCCAAATGTGGTAGGACCAGGTCACTGG




GATACACAATGCATCATATATATATGTACGTGCAACATCCCG




TGAAAGGATTTTGCCTAGGTCGTTGCMCGTC[A/T]GCAGATGC




ATTTTATTTGTACCTAAGAATAATACACTCGAGTACACAGCTA




TTTATTTCAGTCTTGAGCACTTGAAAACATCATGTTAAGTCCA




CATAAATTCATTAAAGACATTGGAAACACCAAAACGGTATCA




GCAACTGAGGCTATCCAAAGCCAGAGATCCAAAATTACAAAG




GCAAAATGATAAAATACGGAAT






S04059-1
AAATTCCAAACGATTATTATTCTGGGAAGAGTCGAAGCACAA
 35



TACTCATTAACAAGAAAATTCGGTTGTACCCATAAAATACCG




ATTCCTACCCGCACTTAGATTGATCAGTACTTGTAGTCCTCAA




CATGGAGGCTCTCTGAGGCACCATCAGCAAGCTTTGAGTTAC




CCTTGTAGGTTCCCAGAGTTGCCTCTGAGTT[A/G]GCCTTGGCT




CTTACCAAAAGGGCTTCCTGAGCCTTCTTCACATTCTCCTCTT




TTCCACTCCATGCCTTAAGGGTGCTCTGTTGAAGTGCCCTTCC




AAAGGAGAAAGAGAGTGACCATGGCTTCTTTCCATTGACCTG




GTTAATGGCATTGAGGTTAACGGATGCCTCCTCCTCACTCTGC




CCACCAGACAAGAAAACGAT






S07851-1
TGTAAAAGGTTTACGTAACCATACTGCAATTTRCAACCACAT
 40



GCTGCAGTCGTATCAGCTATATTTTCTTGTAACTGWGATTAAT




TATAATCGCAATCTAAATTCATTATTAGAGATAAATGACCAC




CATGCATCCTGATTTATGCGTGTGAATGGAAGAAAGCATATA




CAGATGTTGACATATCCAAGCATTGATTTGT[A/G]CAGTGATC




TCCAAATATTTCGTACCGTCCAAGGAAAAGATCAGGCTCTAG




CCACATATCTTCCGAGCCACATACATGAACTGAGCTTGAATC




ATATACCATAATTAGCATAAAAGGGATGAATTAATGCAGAGG




AGTTTAATTATAAAGCTTTATTTTTCGCTTCARAGTTTCAGCA




AGATTTTATATATATATATATAT






S11659-1
TAAAAATGATTTAATTATTATTAATTTAACTRTAGTTGAATGA
 45



TTGAATCTTGAACTAATACCTTCTTCTTCATTGATTGAATGGA




AAATCTAATTTTATAAACATTGATTWGCCTCATTTGTCGAGGS




CCACTAATGAAATGTGAGATTTCCTTGGAGGGGAAGGTCTWG




CTCTAAAAAGCCCAGCTAGCTAGAGGTTG[C/T]CATCATGTAC




GCAATCTTAAATGATTGAGCATTGGGACAGAGCTTGCCATGT




ACTTTACTACCAATGCTCATATTTCCCMTGTTGATTGTGTCTC




CCTTTCTATCTTTATATCAACTTTCCAAGTTGTTGACCATGTCC




ATGTACAAAGGATCATAGCTGCTCTTTTCTTTTTTCTTTCTGTT




GTGCTTCTCACCTTACC






S04279-1
TAGATTAACTGCAAGGAGACACATTTCATCTTGAATTTTCTAT
 50



GTAACTTTGTATTACAACAAAGCCTTGTCCCGCTAGGTGAGGT




CAGTTATATGGATCACACGATGCCATTTGACTTGGTTGAAGG




CCAAATCTTAWGAGATATTATTTACCATGAGATCCCTCTTAA




CAACTCCCTCTGGTGTCCTTGATCTCCTTC[A/T]CTTCCTTTT




CACAAAACTAAAAGCCATATAATCTACTCCCCTATGTAGCAGTG




TAATACATCCTGGATTTTCTGTGAAAAGTTATACATTTTTTCA




GAAAATTGAAGGTCCTATTTATTTTCATAATGGCCACATGTCT




ATATGATACACYCTTAGCCCATGTATATATAAAAAATATGGG




CTGGGAAAGAAATGGCACA






S02211-1
GAGCTCTGCAAACAGATCTAGGAGGAGAGAGAGCGCACYGA
 55



GTTTCGTCTTCTTCAGGAGAAAGCTAGCCTCGTTTCGTAATTT




CCCCTTTCCNATTCAATTTTATTTTTTTAGGGTTTTCAATTTGT




ACTGTAGTGTAAGTGAATTTGGAAAGATGCTTTTGGTTCATTT




AAATTCCATTTGCTCGAACTGATGTTACG[A/G]TAAATTGCTTT




TCTTTTTTACATCGTGAACGAGAATGAGAGAGGAATGGYGTG




GCGACGTTGGCGTCACAGAGAAGGAAGAAGAAGAAGTGACG




TTGATGAAGAAGAAGAGTAAGAGAAATGAGAGGGTTGGTGC




CGCAAAGAAGGAATATGAAGAAGAATAAGAGAACGAAGAGG




GAGCGGTGGTGCTCCACTGTGCAT






S08942-1
TTAAATAATTTTGTTAGATCTCTCTTATTTTTAAATATTTTATT
 60



TAAATATTTATTAAATTAATATAATTATGTAAATATTCGGTTT




CTCTCCTTGTTTTTTTTTCTTCTATTTTTTATTTTTTTTTACTTCC




CTCTCCCGTGATCCTACGCCTCTCTTCTTTCTTTTTCCTCCTTTT




TTCTTTCCCTCCATCACGATCG[C/G]AGTCTCCTTTCACCTTCC




CCTTCGTCGTGGACATGACCTCCCTCCCCCTTCCTTTCCGTCA




CGGCCTCCTCCCCATTCTCCTCCGTCGCGAGCCCCCTTTACTC




CTTCCGAAACTCTATCACGACCTCCTTCCCCATGTGTCAATAT




GTATATTTTGTTTTTATGGTTTGTTTTTATTTGTTTCTTTTTAAT




CATTCCAT






S05742-1
GAGAGGACTCCCAGGTTTGGGCTGGGACGATTTTCTGGGCTT
 65



CCAAACGCAGATAACTCTCTTTGGCAACAACTCCGCCACGGT




CTTCCCGAAAACGGCGTCGTCCTGATGGATCAGATGCTCCCC




CAGCTGTTCCTCCAGGTACTTGTCGAATTCGGGAGGGTCCTCG




TGGCCGAACTCGGTTCTAATCTCCAGGATTA[G/T]GATTTCGG




ACTGCGTGTCGGAAAGGAACTTCTTGACGTCGTTGAGGACGA




GGTCAACGCCGTAGGTGAGGAGGATACCGTGGCAGACGCGG




CGGTGTTCTTGGACGCGGATGTCGAGGACGCGGGTGCCGAGG




GAGAGCTGGCGGTAGATGGAGAGGGATTGGCACTGGGCGAA




GGGGCGAGTGAGGAGAGGGATGCCAA






S09155-1
TACTCATGGATCTTGTCCTAAACCTGATATGAATGATTCAAGC
 70



CCAACACGGGYGACAAGACTGATTTCGAGAGTGTGGACTGTA




GCCAAACAGGAAGTGTTTGGTCWTGAACTTCCTATGGTTCAT




ACTCATTTTTCATGCAATCTGCAATACGATCATCCTCCGTGAG




ATAGCCAGGCAGAACAACAGGATTGATGAC[A/G]AAACGTTG




CAATTTATGCGCCTTGATCACTGGTTCGACATGCWGTCGTCA




GTGAAGGTAGTTTGAATCATCGAGCTTCTCGGCAATAGTGTTT




GGGAACAACTGAGATACAGGATTCGAAGGTACGGAAGCCAT




GGATACACAGATCTTGAATGATAACTACAAAGGATAGGCTCT




CGATACCATATTTGATAAAACTGA






S02037-1
GAAGGATAGCCCTGAYAGGAAGGGTGTCTTTACACGCTTTCC
 75



AAATCATGTTTTTGGCCTAGGGGATTGCTTTTGTTTTCCACAT




CRCCTTTCAAAAGCTCCTGGAATTTCCATTRGACAAGGCATAG




GAAGAATTATCCATCAACAAAGCCCTTTTATACCCAGATTTG




ACAACATACCTTCCATGTTGTGAATGCTTC[A/G]AGATCAAAT




CATTACTAGTCTTGGTCCAACTCAACTAGATATTTTGAATAAT




TTGAGAGATATGTTATAGGAATAAAGAGTTTATGGAAGTGTT




GTTCCAAGATTTGGACCCTGGCTCAATAAGGTCCTTAACCTTC




AAAATAGGATCCATTGTAGAACCATTTGGAGGATCGAGAKAG




TGCCTYGCCAGATTTAGAATY






S13136-1
TTAACTTTGGTGGATTTTTAATTTTTTTAATTTGCTTTTCAAAT
 80



TCCAATTTGTGATATTCCAATTTGTATGTGTGAGGTTGCTTGT




GTTTGATTGTGTTGAATTGAGTTGTYGCTTGCATTGGATGCAG




TTGATAGGATGGTGAAGTGTGAGAAGTGGATTCGGGATGATG




AAKATCACTTGGAGGGGTCTAAGGCGAC[A/G]TGGTGGTTGA




ATAGACTGATAGGGCGCACGAAGAAAGTAACTGTTGACTGGC




CATTCCCGTTTTCTGAGGGGAAGCTTTTTGTTCTTACTGTTAGT




GCTGGGCTGGAGGGGTATCRTGTTTCTGTTGATGGGAGGCAT




GTGACCTCTTTTCCTTATGGCACTGTAAGTKATATATATCTTT




CTCCTCGAAGTTGCTAACC






S17291-
AAGACCCTTATGCACACCTAGCAACCTACATTGAGATTGTAA
 84


001
TACAACCAAGATTTCCGGTGTGCCAGAGGATGCAATTAGGTT




GAGTTTGTTTTCATTTTCACTGTCTGGAGAAGCTAAGAGATGG




CTACTCTCATTTAAGGGCAATAGTTTGAAGATTTGGGATGAA




GTTATTGAAAAATTCTTGAAGAAATATTTTC[C/T]CGAGTCAA




AAATAGCAAAAGGAAAAGTTGTCATCTCTTTTTTTCACCAATT




CCTAGATGAATCCTTGAGTGAAGTTCTAGAAAGATTCCGTAG




CTTGCTACGAAAAACTCTGACTCATGGATTCCCAGAGCCGAT




TCAACTTAATATCTTTATTGATGGGTTAAGGTCAYAGTCAAAG




CAGTTTCTTGATGCTTCTGCTT






S13139-1
GCTGAATGATATGATTCTAATAACTGTGGTTTAGACTTTACAC
 89



TTTGTTCTATTTCCATTTACTATTGTTTTTTTGTTCAAATCAGT




TCCGAATTAGTGGATGCTGTCAAAGGTAGTGGTGATGCCATA




CACAAAAAGGAAGAGACTCATAGAATKGCAGAGGCAAATAG




AGCTTTTGCACATTTTCATTAATTCATGAA[C/T]GGGATCTATA




TAGACAGACCCATATAGAGAGTATTTTGAAAATTGTAATCTG




ACAATTAATCTATTACCCTATTACTTCCAAGAAACGGGGAAA




GATTTGCCTTGTTTGGTACTTACACCATAAATATCTTTTTAGG




AAAAATTCTGCTTTGGTTCTTTTACATCTGAGAAGTGGATATT




TGTGTTTTTTGACAATATTT






S17292-
TATAAATTTTCTTGTATCATGTTCAATTCTTATTGATAAAAAA
 93


001
AAATACCTCTCATCTCTATTTACCATABCCAAGTTGAAGTAW




GGGGCTGTGCTAAATCTTATTTMTAGAGAATCAAGTATGTAT




TAATTGAATCAACTATCCATCAATAATTTCTTACCGTCTTTAA




CAATGTAATCATTTAAGTACATTATCCCAC[A/G]TTCTTTATTC




AAATCATTTCATTCTTTTTCACATAACTACTTAATTATCCTATT




TAACTACGTAAGCTAAAAATGTGTCCYGTCAAATAATCATTTT




CATTATTATGGTTATTGGTTAAGACACCGACACAACACAGGG




TTTAAATCTAGTTATGCAAAAAATAAAAATATTATTATTGCTT




CACTCTTAAACTGACTTC






S13146-1
TGCTGCTAGTTATGTTAAATAGGTGATTAGGAAGTATTTGGA
 98



GAAAAAGGACTCAAAAATAGGCCAAAAAYTGATGAAGTTGG




ACTCTAACTATTCATCATGGCTATGATGAGTCATCATAGCCGC




AATCAAACATAGGCATCATCAAAGTCGTGATCCTTTAATCAT




AGCCCAATGACAGAAAAGTTATGAAGTTCATC[A/G]AAGCCA




TGATCTTTTGATCACGACTCAACAAAGATTTGGATTTGAAGG




ATCATCATGAMTTTGATGAATCCATCCTAGCCGTGATGAGGA




CACGTTGGCAGTCACGTAACATATTTATATAAATAGCCTTTTT




TAGACCCTAGGTTTCTAGTCTTTTATTCTTTTKCAGTTTTGAGA




AGTTCTGGGAGGCAAGAGTGCTA






S17293-
TAGAAAACACATGACCAAATAAAACCATTAACCCTAATTCCT
102


001
AAACAATATCTCTAATATATAAAGACCAACGATTTATAAAAG




TAAAAATAACATTCAAAATTAGGTGAATAAAAACAATTTAAT




ATAAAATTTAAATTATTAAACCTTAAACAATATCTCTAAAGCC




TAAAGACCAACAATTTGTAAGAGTAAAAACA[A/G]ATTCGA




AACTGAGTGAGTTAAAACACATGAACAAATAAAACCAATTTA




ATATAAAATTTAAATGATTAAAACCTAAACCCTAAATCTTAA




AACTTAATCTTAGCATAAAATCACTTAAATCAATTATTAAAAT




CTAACCCTAACTCCCTAAAAAACGTGTTTGATGATTGGGTGA




AGCCACCCAATTTGGCGCCACCAA






S17294-
AAAAATAAAACATAAAAAAGGATATATAATAAGTTGAAAAG
106


001
TTAATAAGATAAAAAAATAAACACACTTGCTAAAGTTAAATC




AACAACACATAATAATAATAATAATAATAATAATAAYAATAA




TAATAATAATAAAATAAATTAATTAATTAATTAAATACAAAA




AGAGAAAGTAAGGAAAATTTCTAATTTTCATTG[C/T]ATTATC




GACRTGATTTGTCTCYTGTGTGAATCTCAGCATTAAAGTTGAT




AGAGTATTTTCAATTACAATAAATAAAANAATTCAGAGTATA




ATTTGWTTTCACCCATAAATATAAARAGAAATAACTAAAATA




CACAGAARATCAGAAATATATTATGTAAATAAAAATGCADGA




AGCAATCAVCAAGATAAAAATARAA






S17581-
AACAAACAAATACAAATCCTATTTAAAAACATTTTTTAAAAC
111


001
ATAAATAACAAATTTTGCAAAAAAAAAATTAAAAACGTTCAT




ACAKAGGAAGTTACACTTACGGATGAACTTCACCAGTACRAA




TGCAAAATTGGAAATGCGCAAATGCCATTAACGGAGACGTGA




AGCTTACCTCGACGRTGGAAGACCAAAKCACA[A/G]CTACAC




GTCCTCAATGCCTATGGTGAATCACCCTATGTGACACGGACG




AAGGAAGAAGAAGCTCGATCGGYGAYGAGAGGAGAAGAAG




RAGGAGGTCGAAGGCGCTGCGGAAGGAAGAAGGAAGYGTAT




GAAAATAAGCTGGTGCGCGASTTTTAAATTTTAAGTGAAGGG




AATTTTCGCCCATTCACTTAAAATGTTGG






S17691-
TACATTGGTTTAGACATTTGTGACTCTAGCTATGATCATTGTG
115


001
TGATTGATTTGTACACAAGTTGAATAGTTAGCATGATCTTCCT




TGCTAGTGATTTCACTGACATTAGTCATGCATATTTGTGAAGA




TTTGAGCTTGAACAATAAGGTTTTATTACACTATATATCTATG




[CCTCTTTTCCTTGGCTATGTGATTAAGCTTCTCATCATTGTGGA




CTCTAGAATTTGTTTTGGAACCATGTTAAGATTG]TGTACTAGT




TTGCATTAATGAAGATGATCAAGGCACATAGGAAAATTCTTT




CTGGCCCTTGAATTAGTTGAGAGYTGTTGCCCCTTATTAGCCA




AATTTGAGCCTAACACTCTTGTTATTTGGTACCTTTGCATTTGT




TGAAATATTATAT






S17701-
GCCTTCACTTCCATTTCACAAATACACAAAAAAAAAAAAAAA
120


001
TGCTAGTAGTGWAAACACTCAAAACACTCAAATTAAGCCCTT




TCAACCTTTCTTTCTTATGAGTTTACAATCTCAAAGCCCATCA




AAGTTCACGACCCTATTCATCTCTTCCAACATGAGCAACCCTT




CAGATGAAAGGGAGCAGTGTCAAAAGAAGA[C/G]GAAATCCA




CCATATGCGAAGCCTCTAACTTTAGGACATCAAGGAGAAGAT




TCTGCAGCAACAACAAAAATGAAGAGGAGATGAACAATAAG




GGAGTTTCAACAACACTGAAGCTTTACGATGATCCTTGGAAG




ATCAAGAAGACGCTAACCGATAGCGATTTGGGAATCCTAAGT




AGACTCTTGCTGGCTGCAGATTTGG






S03703-1
GTTGGAGGATCATAAACCACTTTTTTTTGCTAACAATGGTATT
125



GGTACAAAGAAGCCCTGCCMGAAGCGGTGACTAATCTTCGTC




RAAGACTATGAGCATACAAKAGATGAGTGTACGTATTCCCCT




CCCAACRTGATTTATTCATACCCAAGGACTAACCAGGATTCA




AACYATGAATCATTTGATTAAGCGACAMAAG[C/T]CGCTACC




ACTTGTGTCAACCGTTGTTRGTATCATAAACCACATTTATAAG




CTTAATTAGACACCTTCTCACTCCATTTAATAAATTATTTTGA




ATATTACTTTTTATTAATATGTTGGTGTGAAAATAAGTCAATT




GGTCAGTCGTGTCATCTTATTACCAACAAGTGATTTCCTTTAG




GCGACTAACTCAAGAAAGAAA






S17297-
AKAGTACCAAACCATTTTTTTATACTTTCAAATGTTTCTTAAT
129


001
GCTYAAATATATTAATTCAACAAAATAAAAAATAATTATTAW




TAAGTAATAATTTTACAACAATATTTAATTTATTATTATACAG




ATATAACATATACAABTRAAAAGAAATAAATTAATAATTTCA




CAACACTATTTAATTTATTTCTGAAAAGCA[A/T]TAAACAACA




TTTTCACAACAATATTTAATTTATAATATTAAACATATCAGTT




TACACACAAAAAAACTTTCTTACATATGTATTTGATAGTTACA




ATATAATATTTTTTTTCTAAAAAAAACTTACTTTATTATTAGTT




GTATTTGCTAAACAAATATTTGAATCACGTAACTAAAAAGAA




AAGAATTTGTATCTGTCGC






S17298- 
AGTTCTTGTTGCTATATATTCGTTATCCTTAATTAACCACATA
133


001
CGTGAAATTTAAAGATGCCATCACAAGCAGAGCTAAGCATGA




TGGATTACAAGCCCTACAGCTACTCHACACTGCTGAAATCAT




TTTTAGATCAAACTGAAACTGATCAGACCTACAAGCTTGAAG




AGTTCCTCTCTCGCTTAGAGGAAGAACGTGT[A/C]AAGATTGA




TGCCTTCAAGCGCGAGCTTCCTCTCTGCATGCAACTCCTCACC




AACGGTACAAGTTTCAATCAATCATCATCATGGTTTCACCAA




AGAAACATATCAAACGTAGTTGATGATATTCCAAATTCCAAT




GAACCAATTAAAACATGGAATGTCCTAAACCCTAAAGTTTCA




TCAATACCCCATGATGAAAATAT






S17299-
CACTATAACAAAATTGACTTGTTTTTTTTTTAAATAACAAAAC
137


001
TGACTTATACTAAGTAGGTTATATTTTGCTTATAAARAGAAGT




AGCTTATATTTTAATCTTTCAACACATAAAACATTGTCAATAA




ATAGTAGAGGTGRCTTACACTACTAAAAAAAAAGGCCTTTTA




CATCGGTTCTAATGACTTTTCTACATCAA[C/T]TATGACGCGTG




GTGGTAGTCCAAGTTGTCVAATAACGACATCGGTTGAAGGA




CCGTCTTTGAAGAACATTGRTACGAAGACGAGCATGGTACCA




AACTCTTCTTAGAATGGGAATTGTTCTATATCGGTTGTGTAGG




TACAACAAATGTAGAATGTTAGTTTTCTACATCGGTTCTKAGG




GTGAAACCGATGTAGAATG






S17300-
GTCGGATGCTCTTATTTTACTCTTATTATTTTCCAGTATTTCGT
141


001
TTCTGGCTATCCATATCAAGGAGATGCTAAATTTTAGAAGAA




TAGATATTGATATTATTAGTAATTATACTGGATGGTTATTTGG




CTGATGAAATAGTCGGATACCCCCTCCTTGATTAAAAAATAA




TCATATAAGTAGAGATGTCAAATTTTCGA[C/T]TAGTGGAATA




GTTGAGTGTCCTTCACAACCCACTAAAAGACAATCTCAGACA




TCTAGCCACCAGAGTGTCTGAATACTTCCTAGAACATAAATG




TCAGAYGGCAAGACAAATATAGACCTTGACCTTTTGGTTGGT




CGGATGCCCGGATATGCATTTGGCCACCCGAATAATCAAATA




CTCAATAAAGAGTAAACTCGAC






S17301-
AATAGTATGCTATTCAAAAGTAGGTTATCGAAATGTGTTTGA
145


001
ATGACTCATGTTCGCAAGGAAAATATCAGTACAAAAGACCTC




AATTTACCATACAATTTGATAAGGGGAACAACTTAYAGAAAA




CATTATCGGACAAGAAAATTTGGATTAGTAAAGAATAACCCT




ATCACCTGTCATATACCCCTTTATAGATAGCA[A/G]GATCATG




AGAAATACTCRGGATAAAGTATAGTCTAAGGAAACAACTTTA




TCTCTTAGCTTGGCAATATCTAACTATTTAACTATGCTATTTGT




AACTAAATTGTGCCCTAACGRATCTCAAGGTCTTGACATTTCT




CTCAACAATGTYGTCTCGAACTAATCCATACATAGCCTTAAA




CTACTCACCATTTRAGGTGTCT






S17306- 
GGATAAAGAAAATAAAAACATTTTTTTTTCTTTCTTTCTCTTTC
149


001
TTTTCATTAAAGGCTATGTTTGACAACTAGCCGGAAAGCTAG




CTAGAAATTAATGTTTTAAGAAAATGTAACTTCAAACAAAGT




TACTAAAAAAGTTAAAACTTAATTTTTCATGTTTGATATGTAT




TTTTAAGAAACATGTTTTTAGGAAACTAT[A/G]AAACTATGAT




TCTTAGGAATAATTTTTAACTCACATTCTTGCAAAAACAAAAA




ATCCCATGGAGAAGGATGAGGGGTAGAATCTTACTTTTTAAA




GTTTAATTTTTTGCTTCACTATAATAAAAACAGGTGCTACTCT




TATTAAGTTATTCAATGTGGTAAATTTTAAAGTTATTGATAAA




AATATCACGTAAATTTTTT






S17310-
CGATGATCGCAAAGTTTGGCTACGACAATAGCTTGAGTGAGG
153


001
GAAGGTGGTTGGAAGGCCAGTACCTCATGGCGCAATTCTGGT




GTAAGGCCGGAAATGGAGCAACTCATCAGGAATGTTGGAGC




AAGGCCCACAATGCGATTAGCCAAGTGTTCGAATTCCGTGAG




GTATTCATTGATGGTGCCTCTTTATTGCACTTT[G/T]AACAGAG




CTCCTTGCGTATTTTCGTAGAAGGACGAGGAAAACTGAGACT




CTAAGGCCTAAAGCATAGTTGACCATGACATGAAGAAGCCGT




TGCGGGACATCCATGGGTACCACGAGAACGTTGGGCCCTCCA




TGTAAAAGGAGGCCACGGTCAAGYGTTCATGGTTCAAGAGAC




CCTAATAATCAAAGAATTGTGATAT






S17311-
AAAATTTGAATTCATCGTGGATTGGAAAGTATCTAGGGTATT
157


001
CACAAGACCAAATGGAGTCGTGAAGTGCTCATAATGACCTTC




GTKAGTGTGAAAGGAAATTTTGGGAATGTTAGATTCCCTTAT




GCGGATCTGATGAAATCTAGATTTTAAATCAAGCTTCGAGTA




GACGCAGGAGTMATGGATCTTGTCAATGAGCC[C/T]CGTTGAT




GGTTGTAATAGGATACTTATCTAGGACAAGGACCTTATTGAG




GGCCCTAAAGTCCACACATATCCTTTATGATTCATCTTTCTTC




ACTAAGATGATAGGGCTCGAAAATGGACTAATTGAGTGACGG




ATGATGTCCTTTGTGAGAAGCTCTTGCACTTGTCGCTTAATCT




CGTCCTTGTGATGGTAAGCATTT






S17312- 
TTGTTTGCAGYCGACAAGTGTACTGGATCGCACAAGTAGTAT
161


001
AAAACGATAAGAACCAAGTATCAAACTCTTGGGGAACTTGTG




TTATCTATCAAGCTATTTCGRTAAATAGGTGTCTGGTATGAAA




AGATGATTGTGGTTATGAACAAGTATGTAAACTATCTATGCA




AAAAGAAAGAAAATCACGCAACATAAATGTT[G/T]TGTAAAA




ACAAGTAGAGAACGCGTTGGTCTTCCTAATWGGTTCCTGATG




CTAAAACGGATGTTCTCTATCTAACAATGCTCATGTATTCCTA




TGTTGTCTCCTGGACTGTTAGACCCCGATTCCTCATGATAGCC




TAGCGTAATCCTGATCAAGTCTCATCCGCAGATTCCTCTTGTA




AGACTAAACTCATTCAGGACCG






S17313-
AATATTAGTAGTTTYGTATTCCATTTTATTTGTTCTTCTCTTTA
165


001
ATTACCAAACAACCAACCCCCCCCCCCMYCGTTACTGTTACT




GCAAGTATATTATGAACATTTGGCTTGTCACTGCTCGTTGGGA




AACGACCTAGGATCACTTCCTAGTTACTGCATTTTCATGTTTA




TTTGATTCGGGTACGGCCTCGATCACAC[C/G]CCCTCGCCTTC




AGAGGACTACACGTCCTCCTCTTCAAAGGACTATACGTCCTCT




TCTTCAGAGGACCACACRTCCTCCCCTTCAGAGGACTTCACGT




CCTTGCCATCAGAGGACTACAYGTCCTCACCTTCAGAGGGAT




ACACATCCTCACCTTCATAGGATTACACGTCCTCCCCTTCASA




GGGCTGCACGCCCTCGCC






S17316-
TTCTRTTTTCAATAACGAGCGTCTCGATATATTACGVGACTCA
169


001
ATCGGAGATCYGTGTAAAAAGTTATTGTCGTTTGATTTTTCTC




AGAGCTTCAGTTTTCAATTCCGAGCGTCTCGATATACTACGGG




ACACAATCRGACATCCGASTTAAAATTTATTGTCGTTTGATAT




TTCTAAGAGCTTCCATTTTCGATTACGA[A/G]GATTTTGATATA




TTAYGGGACACAATCGAACATCCGAGTAAAAAGTTATTTCGT




TTGATTTTTCTCAGAGCTTCAGTTTTCWATTTCGAGCGTCTCG




ATATACCACGGGACACMATCARACATMCKAGTCAAAAGTTA




TTGTCGTTYRAATTTGCTAAGAGCTTCTGTTTTCAATTACGAG




CRCCAGCCCCACGTCATNN






S17317-
TCAATTATTTCAGCATGAAATACAAAARGATCTTCAGATGGG
173


001
TGTTTCATAGCATCAAGAATATTAAAATGAACAGTTATATCA




CCAAACTCCATAGATAGTGTGCCTGCATATACATCTATCTTAG




TTCTAGCAGTTTTCATAAAAGGTCTGCCTAGAATGATGGGAA




CTGATCCTTGAGAAAATCCCTCCTCCATTTT[A/C]AAAATATA




AAAATCAACAGGGAAAATTAGTTCACCAACTCTAACTAAGAC




ATCCTCTATGAAACCAGCAGGATAGGCAACACTTCTATTAGC




TAAATGAATTACCACATCAGTTGRCTGCAAAGGACCAAGAGA




TAGAGAATTAAAAATAGACAGAGGCATAACACTAACAGAAG




CTCCTAAATCYAGCATGGCATTGTC






S17318-
AATCTTAAATAGATAGTTAGAGTTTTTACATCAAGAGTGCTCA
177


001
GTGGAAAAATTCTCTAACAATGAAGTGTTTAGCCCTCCATTA




GCARGGAGGGCTCAATACAAGGTTGAAACAAGATAGAAATT




GAGTGGTGAAGTGAATGTGTGAAGAAAGTAGCTTCCTTCAGC




CTTGATGCTCTATTTCTTTTCTCCCAACCTGC[C/T]GACACTTT




GTTTTCCCTTCCTGTTCTATTTTTAATGACTTTTGGGATTCTCG




GATTATGAATGCGCACTCAGCCAGCATGTCTCGCTGAGTGAG




AGTTAGTGATTAGGCTCTTAGCGAGCTTTGACACGCTAAGCG




CGAGAAGCGACAAAGGCTTCGCTGGGCGGGCTGGTTGCGTGC




TTAGCACGTTGCTCTCTGAATT






S17322-
TTGAATATTAATTATTGATAGTTATTAATAAATTATTTATTCA
181


001
TAGTTATCAATTGATTTTTTACAATTGATAGTTTACTAGCTAG




CTTTCTGCTAAAAACTGTTTGAAAGCAAAATGCAAATGCTAT




ATGCTGTGTTGTGTGGTCTGATTTGAAATTTACAGGTTAAATT




TTGGGTTTTTTTTTGTAAAAACAGAAAGT[C/G]TATTTAAAAA




AAATCCTAATAACAACATCGATTTTTTTATAAAAAAAAGCCG




ATGTTAATCTACACAAACAACATTGGTTTTTTGGAAAAATCG




ATGTTAATATCCAAAARCGTTAACATCGRTTTCTGTGAAAAAC




CGATGTTAACATAGAAAATGTTAACATCGGTTTTCTATAGTTC




ACATCGGTTTTTGACTGAAA






S17326-
GGGCAYGGTAAACAGCTGGTTAGTGAACTAGATTCTTGTTCT
185


001
TTCTTTTCAAAGTGTTTCAAGATATCCTGAACTAAYGTAATTT




GATGCTCCCTGTAACTCCCGTAGTCCCGCTGTATGBTGTGTTG




AATTGCTTGCTCAGGCAGCTTGAAGGAGCACATTGCTAAGGT




TAAAATAGCTGAAATTGCATTTATGGTGCA[A/C]TTATGCAAT




TTGCTGTAAGCAGTGTTGTGGTAGTAATGTTCTAAATCTTGAA




AGTGTTGTTTCCTAGGTTTATAGCATCTATTTAAGGACTCATG




AGAAATCCCAGTTTATTGGAACATGTTTGTCTCGCTGACATCT




ATCTGCTGTAGCATTCAACTAGTCTGTGTTTTGGTAACTGTGT




GGACATGCCATTCAATCCC






S17327-
ATAGTGGATGTAACTAGAGTCTAACAGAGAGACTATGGTGGT
189


001
TATAGGCAGTCTTCTTCNGCCATGTAAAGATAATACCAGTCTA




ATTGCTCCATAGTGAAGATGAGTGTATCCTTGGTGTTGCTAAC




TTCTGATCAGTTGTTTAGGAATCTCAATATTAACATATTGCTC




CTCAAAAGTAGCAGTTAAAGAGGACTGGT[C/T]CATTCTTAAA




GATTGGACATATTCTTTCATATGAGGTCTTCTAGTGGTGATCA




AGTTGGTAATAGATCTTAAAGAGTAACGATGTCTTTTAAATAT




ATGGTAAGGGCTCAACAAGGGATTTAGTGACTGAGAGATTTG




AGCATCTTCTGGAATATATGAATATTCTACAAGATTCTCTATT




TTTCTGAAAGAGTTTTGGA






S17328-
RTCTCTACCAAGAGATTCAGCAAGATCCACGTGTTTTGGAGTC
193


001
CATAGATTCAATCHCATTTGTTGAAACTCCTTTGCATGTTGCT




GCATCTCTTGGTCATTTTGAGTTTGCTAYTRAGATCATGACAC




TGAAACCTTMACTTGCTGTGAAACTAAATCCAGAAGGCTTCA




CTCCCATCMACCTTGCTTTACAATGCATC[C/T]ATGATAAAAT




GGTCCTTCRCCTTGTAGAAATGARCAAAGATCTCGTCCGAGTC




AAAGGGAGTGAAGGCTTCACTCCACTGCATTTTGCAAGTCAA




CAAWGTAAAACTGAGCTTTTKGATAAGTTCCTCAAGGCTTGT




CCAGATTCCATTGAGGATGTGACTACCAGAAGTGARACCGCA




CTACATATTGCAGTGAAACAT






S17329-
ATCATTTGAGAATTATACTTCMAAGTTCAGACCTCATTTGAG
197


001
GCACAAAATTTCKTGCTCCTTCTCTCCYTCTCCCTCCACTCAT




CTTCYCCTTCCTTCRAGCTCTTATCCAYGGCTTCCTGTGGTGG




TGAGCTTYTTCTTGACTCATCTTCTCCTTGAAGTGGCRTCTCC




AATCATCTTTCTTCCTTCTCCATTTYGCT[G/T]CCATGATCTTC




ACAAGCTCCACATAGAGCTACATCACTTAGCAACTCTCCAAT




GGTCAATACCTGGATTACTCTAATATCTTCCTAACATTCCAAC




TRCAAAAGCAATGCCAGACCTTGTGCATACTTGAGCATACAT




AAGGCTTCCAACAACTAAAGC






S10746-1
TGGTTGATTCGAAGAACAATTDGGTTTTGATTAAYGTGATGA
202



AGGTGTTTGCTAAGTTGGCTCCCTTGGAACCTAGGTTGGGGA




AGATTGTTGAGCCTGTTTGTGACCADATGAGGCGCTCTGGGG




CCCAGGCATTGGTGTTTAAGTGTGTTAGGACTGTGCTCACTAG




CTTGAGTGATTATCATTYTGCTATTAGGCTC[A/G]TTGTTGAGA




AGGCTAGGGATTTGTTGGTTGATCAGGATCCCAATCTTAGAT




ATCTTGGTCTGTAGGCGCTTTTGGTTGCCACTCATAAGCACTT




GTGGGTGGTGATAGAGAATAKGGAAGTGGTGGTTAAGTCGTT




GAGTGATGATGATTTGACTATCAAGATCYTGTYAGTRCGATT




KTTGATGGGCATATATGGTGTC






S17331-
TTCATTGGAATGGAATATAACAAAGTAATTAGATTAGATAAG
206


001
AARAATAGTTGGAAATGAGACGTTAGTGTGGTGTGCGAGGCG




AGGCCATGTGCTCCAATGCGGCGGTTATTTAAATATGTCGTAT




TGTTTAGKTACACACATTAACGTCAAAGTTTCAAACTATATGC




GGTTGTCTCYGSTCTTCTCAGATTCTCTCC[C/T]ACGATGTTAT




TAATTGTCTTCATTTCCCATCTCTATTCTCTATTTGATCACACC




GTTAACATGTTCCCATTCCATCTCATTGACAATACAAAATAAA




TTATTTATGCACTGAAATTAATATCTTAACACACATTTTTATTT




TTTTGGTAACGTCACACATTTTTATTTCATTGTAAATTATCAG




GTGTAATAAATTTAWT






S17332-
TCCAAAATTTTAAYAGTTACGATGAACARACTAAGCGCAACA
210


001
GGCGCGYTTAGCACGTTCATCGCTATTTCCAAACAAAACCAC




AGGGGTYTTCACCCGTTTTAGCCACATGGCCCCTAATGGGCTT




CTAAGTTACCTAAAATCCTATATTGACTAACCCTAAAACTAAT




AACCTTACCCTAACAACATACAACTAAGAA[A/T]ACAAGAAG




TCATCTATCCTAAGGTTTGAAGAATGAAAAATGGAAATAGAA




AAGTACTCACTTACTTGGATTGTTCTTGAAATGAAGCAAAGA




AGATGYAGACAAGCAGTACACACACAGCAAAAATACACACT




TGCTYAGGGTTCACAAATGTAGAAGCTGAAGGTATTTGGGGT




AACACCCAAGATCCTTAGCCTTTGT






S17337-
TAGTCTTGTTATTTTTTAATTGAGACAATTTATTYCCAATTTTA
214


001
AAAAGTTTATAATTTTARTCTCCTATTTTTTAAATTAGACGTTT




CGTCTTTCACTTTTAAAAAAATCAATAATTTTAATCTTTATGT




CCAATTTCAAACGTTGATCTATACACTTTTGTAAATGTTGATT




GATAATTTTTTTTATTATGTACATGA[C/T]AAATATTTTTTTTAT




AATTATTTAATTGTTATCAAGCTTAATTTATTAAAAGAATTAA




AAAAAGTCTTAATTAATATCACGGTCAATAGGTTTTAAATTG




ATCAAGGAGATTAAAATTATAAATTTTTTTTTAAAAATAGAG




GACGAAATGTTACAATTAAAAAAAATAAGGAGACTAAAATT




GTATATTTTTTWAAATG






S13093-1
TAAACACATGAATTTTTTTTATCGACAAATATTAATCATTAAT
219



TTGTTAGTAAGAGGATMGAACTGCGCSATATTTTTCTTGTTCC




GTTAAATTANACACATGAATTAATATGAAGGGAAAATTAAAC




AACAACATGATACATACCTCAGCATGCACAACATGAAGCATC




GAGTCTCCAGMARGTGAGGAATTGGCACTT[C/T]TAACATCAA




TGTTCTCTTCATGCAGCATTCGAATGATCTCAGAGAAAATGA




ACTGGTGGTCTAATCCACAAGTAAGAACAACTTGTAAAGAAG




AACCCACTTCGTGAATCTCAAGTTGTGGCGATTTTGGGAAAC




CTGCAGAAGTTGCAGCAAAATTGGTACTATTATTGGAGAAGC




AATCACGGGATCTCTTTCTAATT






S12211-1
TACTAGACACTCACTCATTGGATTATGAGTATTGGTATTAAAT
224



GTGACCATCACTTACAACATTTAAACTTATGATAGTATTAAAT




GTGACCACCACTTTCCTTGGTCTTGATGATTTGTCCTATATTTC




TTATATATAGGACCAGAGGTAGTAGATTCCAAAAGTTTATGC




TACCACAATATTACTTGTAAAGCTGCAA[C/T]GCCATACTAAA




CCATAATACATAACTGAGTGAGCTTAATGCAAATTGCTTGTTC




ACCAGAAATAAATAGAAGATTCAGGCACGCGGTACAACAGG




ATAATGGAGTCAAAACACAAAACTAAAGTTATTTATAGACTA




CCATGTATTTTATTAAATGACCACTAATTTGTGATATAGGCCA




TTAAAAAACAATTTCATCAA






S04555-1
CTTCAATGGCATGGCCGTGGAAAGAAACAGAGCTTAGATTCT
229



CTGTTTTTAATTCTCCAACGAGGAAGCTCCGATTCGAAAATTG




CCTCCGCTAGGGTTCTGCAGTTCATCGCCGTGGACGCGGATG




CGAAGATYTCAATCGYCGAGAAGCAAGGCGTGGTGGCCGAG




TTGCTGAAATCGGCCGCACCAGAGAAAGATCC[A/G]GTGCTG




ATCGAGGCCGCGCTGGCAAGCCTAGTGGMGATTTCGGTGCCG




AAGCGGAACAAACTGAAGMTGGTGAACCTCGGAGCGGTGAA




GGCGATGAAGAGGCTGTTGAAGGAGGCGAATTTGGGCGCGG




TGGAGAAGGTGCTGAAAGAGAATGAGAATGGAAGAGTGGAC




GACAATGAGAATGGAAAACTGAGGGTAGA






S08519-1
TTTGAAAGAARAAAGAAAGTGCTCACTGCTACCAATATACTA
234



ATACCGAACCATCCAACCAAATTATCTTTCGTCTATACTTTTT




AGGCTTCACACTTGAGGAGGTGTGAACTGTATGGCCAAATTC




TATCAACAGACCAATCAAATATTAACCCATAAATGGCTCACC




ATGTCCAATCAGGCTCATGGCTGATCTATTG[C/G]CAGAGCTG




ACTCAATGTCAAGGCCAATGAACTGTTGTGCACTGATAGCAG




GAAGACACTAGAGCTGTGAAGAATTGGCAGGCCAACTAGTCT




TGGCGGCCCAACDTAACAGTCTCTTGATCCTTCTCATGGATCT




AGCTAAAGTGTCATTGGCCAGAACAGTTAAAGAATGGCACAC




TTGTTAAATAGGTGTGACTAGTC






S12876-1
GAAAGGTCTTCCCCTGGTTCATTTCCTTGCTTCTTGGCTGACT
239



CACGAGGACTTCAATCGTTTTCTGCGAAGCTCTTAGGGTCGTG




GCTCGAATTGGGATCAGGGGATTCTCTCTTTCTCCGATGATCT




CCAAAATTGGAACCGGGAGGTGTTTGGCAATATCTTCCGAAA




TAGGGTCTCCTTAAGCAAATTCAGAGATT[C/G]GAAGCGTGCT




TGGGCTCCTCCTTTTCTGATGATCTCGTTTACAGATAACAAGA




GTTGTGGCGGGAATAGGAGCAAGTGCTGATACAGGAGGAAC




TATTATGGTTACAAAAATCTTTTTGTTACAGCAGGGGATTTCG




TCCATTTCTATTTATCTAACTCTTTCCCTGAGTTGGACATGTTG




GCTGCCAGGCCAGCTTGGT






S05937-1
GAAGGGGGCTGGGTTGGAGTAACACAAGGGGAACTCATAGT
244



CAGGCTTGATAGACCATGCCCAAAGGAGTAAGGAGAAGAAT




TAGAATYCAWGTAGAGGAAATTATTGAGAAGCAAACAGGAA




ATGGCAACAATTGAATTGTTCCCATGTCCCAATGGGCAAAGT




CCAGCAAAATTAAGGAGAGGACCTTGATAATCATC[A/C]GCT




ATGACCATGTGCACTGGTGCATTTTGCAAGAGAGCCAACTCA




GCCAAGGGATCTTCACAATATAACCATAAAGGCTCTAATGCT




CTGTTTCATGACGAACTGGAGTCGGAGGTGGGCTGCAGGGTA




AGTTTGAGTTAAGTCACCAACAATACCATGAAAAACAAAGGC




GCTAGGGGAGCAAGGGCGTTCACATGCTT






S08575-1
GCAATTTGAGTATAATTAGCTGCTTCTTTGGACATATGTTTGT
249



ACTGTGTGTTTAGTAGTGCTTTCACCAGGTCCAATGTGCATCA




AAACAGAAGCAACTAAAACTAGCTTCCACATTTTTTTAGATG




ATATGAGGTGATTTAAGCTTCAAACATGCATATTTGGAGTGG




ATCCAAACATGCGTCTAGTCTAAGAGATTC[A/G]GCAAGAAGT




TCAAAGAGATGAAGCTCTAAATTTATTATTTTTGTAATATTCA




GAAATTAAGCTTATTACCATGAGCATAAGTGCATAGTTACAA




CAATTTACTGAGACCTCTTTCATTATGGTTGCTCATAAATGGA




ATAACATTTTCATTTTTAATTATATCATGTTATTCTCWACATC




TTCCGATTGCTTAGTTTGAA






S08669-1
CCTCCATTCATCTAGATAAAMAGTTGAAGTTTAGCACAAGGT
254



ATGTTATGCTTGTACATTGTCCACACTTCAAGCCCAAAACGTC




TTGCATGAGGTGGTGGGTTGGTTTTTGACATATCATAATATGT




GTATCTTGTGCTAACTGATCAGAAACTTCATTACATTATCTGT




TTTTTCTGGGTCATTTTCTTATGGGACAT[C/T]TCCTCTAKGTC




CTCAGATCTGAAGAGGCTGAGAATATTGGATATGTGATTCCT




ACAACTGTTGTATCTCATTTTTTGACCGATTATGAAAGGAATG




GCAGGTATACTGGTAAGAAACTCTTCTAGAATATGGTTATATT




TGATAGATTTGGCATGTCACTATGTCTTATTTGAGTAAGCACA




CTGGATTGTGTATTTTTT






S11212-1
TAGTCTTATAAGAACTTCAAGACTTGTTTCTTTAAGGTGACAA
259



TAAAKTCGATTACTGGGAATAACTTATTCTTTGATTCAAAGAA




ATCCTTGTTTCCTTTATATAGGCTKCCATATCTGTGTYAGTAT




GATGAATACCTTAAGGATTTTTTTTASCAAAAGGAGCTCCAAG




ACCTTGCCTTCCTCGAGACCCTCCCCCG[G/T]TTACTTCCTGTT




AAACAAATATTTTCCCTCCTTAAGTCTCCTAATCTCATTTGGG




ATCTTGAGGGTTAGTTGTTTTTCTTTTGTGCACATBTCATTTTT




TGCCCAAACTTTCTGATTTATTATTTTTGACTTGTTTCAGGTAT




AACGACTCAAGCACGCGCCAACATGACTGCTTTCTAACTTGC




CCACAAAAACTGTAT






S00543-1
CTTCAATTGATAAAGCATTTGCAAGCTGTGAATTGAAGTTGC
264



AAAACCACAACTTCATGGAAATCCTTCAAGAAAATAAACTAG




AAACTATGAATATGTAGTAGTAGCAGTGTTAGTCAGAATAAG




TAGCATGCAATATGAACAACTTGACAACACACTATAAACATA




AATGATAAATAACTGACTGTTCCACTTATCCA[G/T]TGGTTCA




TTAATATCAAATATCAAACATCTTTGACAATTATTAGACATGC




ATACTCATGTGAAAAGGGAAAGGTAATTTTGATGTTGAAAAT




ATRCAGAATATGTATGTATACTACATACCATGGTTACTAATTA




CTATTTACTATCTACGGGATTGTAGGCTACAGCTACTATTGTT




ATACTCCACCTCTAGCTGAAAC






S01452-1
TTCAAACCAACTTAGSAGCTTGAAGCTCAAGAATAGGAAGAT
269



AGGTCCCCAAAATGGAATTGAGTGTRAAGTAGAAACTGAAAA




TACATTTGATGGTGTTTTGGATCCTTCTGTTGTATGAGCATAA




GGATGCAGTCAAACAATGAATGATGAATCCAATAGCTATAAG




AGGAWACAAACATGTGGCATATGTGAAGCAA[C/T]TGGTCAC




AACAGACGAAAATGTATTGGTGGCTCTCAAAATGCACAACAT




GCAGTTGGTGGGTTTGGTATTCCTTCAAGTCAGCAAACATAC




AATGCTCCTAAACCTACAGTTGAGTATAATTATCATCTGGTAT




ATAATTGCTTACTTTAGCCTCATTAATTGTAAATGGTTGTTAT




TTAATCAATAGTTACTTAAGTAC






S11993-1
CATCTCATTCTGAATCTTGCGCCGTTTCCCTCTCCCACTCGCC
274



AGGTACGTCATGTMGTTTTTGCTTCCCCGTTGTTGCGTCGATA




CGACTTGTCGTTTAGCACGTTCATGTTCATGTTCGGTTCGTGT




GTGTTGCAGTGAGGTGATTTGATTTGATTTGTGAGTGCTGCGG




ANTTTTTTTTTTCCATTTCCAGCACAAT[C/T]GATTCGTCGGTA




CAACTTGTCGTTTAGCACGTTCATGTTCATGTTTGATTCGTGT




GTTGCATTGRTTTGATAGTGTTGCGGAATTTTTTAGAAGTGTG




AATGTTCGTTCATGCATGAGCGGCTCTTAAAGTTKCCTTGCGG




ATTCGATTGCGATATATTGAGACTGCGATGGCCTCAGCCGTC




GTGAATTTCTTGAACGC






S13446-1
GGGGTTTATAAGRCCTTAGACTTTCGAACTACAACAGCTAGC
279



ATCTATGGTGTGATTCTCCGAAGTTTAGTTTTTGGGGTGTGAT




TCTCCTAACCGAACTAGTCAAACAACTATTGCACAACCAGCC




TGCATGGGCACGGGGCTGCGCTACTTTCGTACTCAAGCCTTCT




GATACTGAATCCTAGATTATTCAATCTGAG[C/T]GTGGTGGGA




TGTTAAGATCCAATTTCAAGGTATGGTACTTATGTCCCACATT




AGAAGCYTGGGATTCTAGAGTAGGGTTTATAAGGCCTTAGGC




TTTCCAACTACAACAACTAGMATCTATGGCATGATTCTCCAA




AGGTTAGCTTTTGGGGTGTGATTCTCCCAACAAAACTAGTCA




AACGGCTAGTGCACAACCAACC






S00252-1
TTTGTAAGAGAACCTAATTTTTGACTATAATGTGCTTGAATTT
284



GATACATATATCTTATTTAAGGAAGATCCAAACTCATATCAA




CCACATGTTGATTATATAACACATAATTAAATAATTAAGTGG




ATGGTATGAATTAGTTTTGGTGATGGCACATGTATGCATGCA




GCTGGGCAATAATGGATGGGGAAGCGGTCTC[A/T]TGGTACTA




TGATTCAAGCTCCCCGGACGGCACCGGTGCTTCTTCATCGGTG




GCATCTAAGAACATTGTCTCCGAGAGGAATAGAAGGAAGAA




GCTCAACGATAGGCTTTTGGCACTTAGAGCAGTGGTCCCCAA




CATTACCAAGGTACTCCATCACCTTAATTAATTAAACTAGCAA




TTATTATTGTTCATCATATATTT






S04060-1
GCCGCCGGAACTGCTTCGGACTCCCTCACTCTTGGAGCGACTC
289



AGGTCCTTCCATAATCTTTCTCTCCACAAACATGTCCAACCCG




AGCCAGAGCCAGAGCCAGAACCTGAACCTGAAAAGCCGGAA




CTGGTTCGGAGTCCCTCGCTCTTGCAGCGGATTCAGTCCATAA




ACTTCTCCCATCTCTACAGATCCGACTTCA[C/G]TCACCGAGA




TGATGAGGATCCSGATTCGGGTTCGGATCCGGGTCGCGGTTC




GGGAAAGGCGGCGGAGATGAGGAAATCGGCGAGCGTGAGAG




GGGGTTTGACGGATAGCGAGTGGGAGGAGGTCGAGAAGCGG




AGGCCGCAGACGGCGAGGCCGGTTGAAACGACGACGTCGTG




GAGAGAAGACGAAGAAGTAGACGCCA






S02664-1
AATCGTTTACAGTTGTGAAAAAACTGCATTGGTCCTTTAATTT
294



AATTTATAAAATGATAAATATATCTTTAGAATTTAGTTTAATR




AATTCTAAGGATGAGATTTAGAACTGCTGCACATTGCAGTTC




ATTTTTAAACTGCAGAGGTCCCATTCTCTGTAAAAAAAAGAA




TTTTCTTCCTCTGCTATTCCTTTCYATCTT[A/G]TTCGTTCATT




TCTCAACTAGTTCATCCTAAGGAAACCAAAACTACTAATATAT




GAAATGAAGGACACTATATAACTAAAGAGACATATGWCGGA




CCATTTTTAAATATATAAAACTCTATTAGAACTGCTAAAGTGA




AGATCCTTATTCTTTGCCTACAAATTTACTTACGTACAATACG




AAGGAGGAACTAAAGTTTAT






S00281-1
AATTTTCTTGCTACCATACCCAAATGGATTGGGAGGTCCTACT
299



TTTTCCTTTTCATTGAGTGACATAGAGAAGAATTTGAAGGCTT




CATATTCCAATTCGGATATAGCTTCCATGGAGACACCATGATT




GATGACTTTGAAGAATCCAAACTCCTCACAAGCCTTCACTAT




AAGGGTCTTTGCATCAGGTTTGGAGAGGT[C/T]CACTATGGGA




ATTGTTGAGGAAAATTTGGTTGGCATGCAGTTCTTAATGTAGG




AGTATTGTTCTGTTGTTGCTTTGGACAACAACACCATTTTTCTT




GTTGTTTGCTCGGCCGTTRTTCTGTTTTGTGGTGTTAAGAAGT




GGAGTGAAAGATAGGGAGAAGGTACGTGAGAGAGGAACAGA




GATATTTGAAAAGCTTTTG






S01109-1
GATTCTGGTGACCCTKCTCTCGGTTCTCTCTTCTGCATCGTGT
304



GCACGAATGGTTGGGGGGAAGACGGAGATCCCTGAAGTGAG




AAAAAACAGGCAAGTGCAAGAGCTTGGAAGGTTCGCGGTGG




AGGAGTATAACCTTGGTTTAAAGCTGTTGAAGAACAACAACG




TCGACAATGGGAGAGAACAGTTGAACTTTTCAG[A/C]GGTGGT




GGAGGCGCAGCAACAAGTGGTGTCAGGGATGAAGTACTACTT




GAAGATCTCTGCTACTCATAATGGTGTTCACGAAATGTTCAM




CTCTGTGGTGGTGGTCAAGCCATGGCTTCATTCCAAGCAGCTC




CTCCATTTTGCGCCTGCATCATCATCCACCACCACCACCACCA




CCACCATGCATCCAGTAGTACGTA






S13844-1
GCCACCGTGTTTTTTAAGATCTGTGCTCATTAAGAAAAACAA
309



AGCAACTTGMTGAAACCTTTTATCCACATACATATATGGTTA




GTTAACCTTAATCCCCATTGCTCAAGCAGATATTAAATATTCT




TTGTGAGCACTGAGCAGCCCTCATATGTTTATGTACTGAAAG




ATCAATATTACTTGTTAGTGATAAAGACTAC[G/T]TAAGGGAT




AAGAATGAACATAGCTGCAGGAATATTCTTGGTTTTTTTTAGT




ACTGCACAATTAATTCTGTATTTATGTCTCTCTTTAGTCTTTTT




CGGCTTTCCATCATGCATATATCTAATATTTACTTTAAATTTAT




TGGTATCTTTTTTTTTTTACTCTTCCTGAATTTTATATTTCATAC




ATTCTTTTAATTAAAA






S05058-1
CCCTCAATACAGAGCCACTGGGCAGATACTCATCCATTTGAA
314



GTTCTCCCGACATTAATAGTGGATCTGTGAGTTTCCACCATTT




GAGTTGTGCTATATTATCACCATTTTTATCTTTTGACATGCTAT




TATTTGTAAATCAACCAAAAATGATACGCAATTTTGACTCAG




AATTGTTTAGACCAATTACTAATATTTGC[A/G]GTTCATTGTAC




TCCAATATTTGATAAGTTTGATTGGTAGGCATAACACATTAAT




CAATGAAATGGGGTGTAAAATACAACTAGATTATAGGGACAT




GTAACTTTCAAAGTGTTTTGAGTTAACCTGCTGTGACACTGAT




CAGCTGAACATGTCCTTTTTCTAGAAACTAGAAATACAATTGC




TTAATGTCAAAAACAAGA






S04660-1
ATTGGAAGGAATAAAGTTGGGGTTTTGGAAGCAATGGAGTGG
319



AATTCATTCCATTCTAGTTTAACAAATTCAAACAATGGAACAT




ATCAAAATTCCATTCCATCCTACTCCATTCCTTCAATTTCCCC




ATATCCAATCACATTTCTCTGTTAATAGTTTGGGTGCAAAAAG




ATAGATCAAAGAAGTAGATACAGGAAGGA[C/T]GTAGATGCA




GAAATGAACTTCTATGACGAAGGCYGAGGCAGGCGGCAACT




AAGTGAGGGATATGCCTTAGTAACAAAACAAGGAATTAAAA




CTTGGTTTATTTTATTGGTAAAACATTGGCAACATTTTTCTCA




GCCATGTCCATGTAAATGTGCATTTGTAATAAAAGAGTTTGGT




GTAGTGGAGCATGGTTATTGTAA






S09955-1
GGGTCAGATTAGAGAGTGAGATAMAGTGAGAGGGACTCATT
324



TGAGAGGAAAAAATAGTTAAAAATCATTGAGAGAGAAAGGA




GAGGRAAAATCATTRTGATTTTCGCATACCCACTAGAGAGCA




TTTTTCATATTGAAACARCAGATTGGTTCACCGTTGGATCGGG




ATGATTTTTGGAAATMTGGTTCAGCACACTTGA[C/T]ACTCCA




AGTTGTCTGGTTGGATCATGAAAAGATATCTGGAGAGAGAGA




TAAGTKCTTCATATTCTCTGTTCTATATTTTRGGATTTCCTCTT




CTTGTCTCTATTGTATCAACTCAGGGTCTGTTTTGATTTGGCTG




TTTGTAGCACATTTTAGTGTACTTGTTGGAGGCTCTCTTGTAT




CTTTATTGATTATAGTGGAGT






S08034-1
TTGAACCAATCAGATGAAAGAGGTTGAAACTTTGCAAGACAA
329



TGGCGAAGAATTGCTATTCCAACCACGCCTTCAKCAACATCA




GTCAAGAGGCTGCACAATGCTTGCCMTCTGTATGTAAGAGAT




TCCTTTCTAAACCCTGCTGTGTATGCTAAAAATGGAACTATCC




AAGATCCTACAAACCAAAATGAGGCAATCCA[C/G]AGGAGCA




TTACCTGCAGAACCCAACTTAGTTTGCTCTTCTAGATGAGATG




AAACTAAGAAAGAGACCAATCAAATAAACTATATAGTTTCTG




AATATTTTTCAATTCCATCCATTCACAAGTTCTTAATTGAAGY




AGACTATAACAAATAGCCTTACTGTCAAATCAATAAAAAAAT




TATAATAAGTAACCAACTTTTAG






S10293-1
TTTTTGTATTAGAATCATGAAAKTGTGACTGAGATTTTGTGTA
334



AATGATAAATTGAATATGTATTGAATTGTAAGATACATGTGT




ATTGAGATGTTGTGTGCATTGAGTTGTAAGCTATGAACCGTAC




AATCACACAACTTTAAGACCCTTTAAGGGRGAHGATTTAATG




CACGACGAGTATTGTGATGAGATCGACTGT[A/G]GAAACCCC




ACGAGTTTAATCACTTTKAGGCARGACRAGTTAAATTTATTTT




GAAAATAATTGAAGAGTCGTGTGTTTTGTATAATTCATAGAT




AAAGTCTGGATGCCCAACGAAGTTTTTTACTGACATGATACC




ATATTGCATATATGATTGAGTCTTAGTATATTTGTTGCATAAC




GCTTGTGTATTGATCGATATTG






S03813-1
GTGCTCATCAYGTGTTGTGCATGGAATGGCAGAGTTGAAGAA
339



TCTCTTGAACTTTTCAGGGAGTTACAGTTTACTAGATTTGACC




GGAGGCAGTTCCCTTTTGCTACCTTGTTGAGCATTGCTGCAAA




TGCTTTGAAWCTGGAAATGGGTAGGCAAATCCATTCCCAGGC




TATTGTAACAGAAGCCATTTCAGAAATTCT[A/G]GTTAGGAAT




TCGTTAGTTGACATGTACGCTAAATGTGACAAATTTGGGGAA




GCAAATAGGATTTTTGCAGATCTGGCACATCAAAGTTCAGTT




CCATGGACAGCCTTGATCTCGGGTTATGTTCAGAAGGGACTC




CATGAAGATGGCCTAAAGCTATTCGTTGAGATGCAAAGAGCC




AAAATAGGTGCTGACTCGBCCAC






S02042-1
CCTTGTGTTCTCTAAGAACTATGCATCTTCTTCGTTTTGCTTAG
344



ATGAACTTGTTAAAATCATGGAGTGTGTTAAGGCAAAGGGTC




GGTTGATTTTTCCCATTTTTTATGATGTGGATCCTTGTCATGTG




CGGCATCAGTCTGGGAGTTATGGAGAAGCGTTGGCTATGCAC




GAGGAAAGGTTCACAAGTAGCAAGGAAA[A/G]CCTCAAGGAG




AACATGGAGAGKTTGCAGAAATGGAAGATGGCTCTTAACCAA




GCAGCTGATGTGTCTGGCAAGCATTACAAACTTGGGTATAGT




ACCCCTCTTCACGAGATTTTCCAATACAATCACGTGTTCATGG




TCCCGATCAATCTCCACGTGACATAGTCAAGGTCAAGATTGG




TCGGGACCATGATCACTTGGT






S16601-
CAATCCCAAYAGCCTGCTYAAACATAGAAATAAAGGAATTTT
349


001
ATTTGAAAATTACTTATTTTTCAGCCTTTTGAAAGAAGTTTTG




AAAAAAAAACAAACTATTATTTCTTAAAGGTAATCTCGTACC




AAACATAGTTGYATTGTATCTGAWTTCACACATGTACTAGGC




TTTGGCAATGCTCACAGTCCAAGCAGCTTCA[A/C]AACATTTA




CACCCTGAAGCATGTGGCAAGTCAAGCTGTGTGAAAGGTGGA




ATAGCAGTGATGTTCTACACATCACTGTGCTTGTTGGCATTGG




GAATGGGAGGGGTGAGAGGATCCATGACTGCATTTGGAGCTG




ACCAATTTGATGAGAAGGATCCAACTGAGGCAAAAGCCCTTG




CAAGTTTTTTCAATTGGCTTTTG






S01481-1
AGCAATAATTCTATGGCTTTTACTTTATTTTTTAGTATAACTA
354



AAAAAAAAAGAAAAAAAGCCAGAGGCTACACCAGCATACTT




GACCAGAGATTTAACTTAAGCAATAATCATGAGATAAATGGT




TTCATCTGTCCTATATAGCAGCTYAAGCTTTCAGGCTGGCTGT




TTCTTTGACATGACCATAAAGCTTCAGTCAC[G/T]TTATGGTCC




AAAGTTTGACTTTGGCACCCAGAGTAGAAATGAGATCGTTTA




TCCTTATCTAACATGCAGTTTTAAATTCAGTAGTCCTTTRWAT




TCATATTATATATAGCACCAACAAWGGCCATGACATAGGAGA




TGGGAAAATACAAAAAATGGTGAAAGTCTATARCAGCMTAA




AATGGATTCATTACCTTCTTTCT






S11309-1
GTTTAGTCAAGAAAAACAAAAAAAAAAGTAGTAAAAAATGT
359



TTTTAATAAAGTGAGAGTGGAAATTATTAATCGTGTAGATTTA




AAAATAGTTTTAGTTATCATGAAGAAGTAACATATATGGATA




GAAAGTTAAATAGAACTGGATCGGCGTATATATTGGGCTGGA




CCGGTCTGGGACAATGATTGGGCTCCAATTTC[A/T]TGGTTCT




CGTTCTTCTWGGGCAACTTGATTAAGTGATCTAACTTGTGGAT




AAAAAAAGAGAAAATAAATAAAAATAAAATTAACAATTAAA




TGTAAAATTAAAAAGTGTAAAAGATATAATATGCGTTTTTATT




TCTCTTCCATAGAATTTTGGTGTATATAATGGCGACAATAAGG




TTCAAACCTAAGTCCTTTCTCTT






S11320-1
ACTATAGTTTTTATTTTATTGCGGTTGTAATATACATGTTTTGG
364



TTAATTTTTAGATATCTGCTCTGAGGAATTAAGTGTTTCTCAG




TCTTTTGAACTGGATGGTGTAATCTCACTTTTGAATCGGATGC




GAATATGAAGTGGTATTTGGATTATTTTTAATGGGTTGTGAAT




GAAAATAACGTTACCTGTTGAGATCGT[A/T]TGTTTTATAGCG




ATAGTGTTTCATAGTAGTGTAAGCTGGCTTACATTGCTTTTGA




ACTTTGGGCGGTCAACTGATGGTTCTATKTGTCTCGTATGTAT




ATGGTCGATCCTTTGCTGTTAATGCGGCGTGTGCCTTTGGTAT




GTTGGTTTTYGGGTGCTGCAATTTGTAGTTTCTTGGCAATCTC




GTCGATGGTACTTCAA






S04040-1
AACTGCAAAGGTTCAAGTAGATACYTATTGGCCAACCTTATT
369



TGCTAAACTTGCTGAGAAGAAAAATCTTGGCGATTTGATAGC




CAATGCAGCAGGCGGTGGTGCACCAGTTGCTGTTGCAGCTGC




CCCTGTTGCTGCCTCWGGTGGTGGTGGTGCTGCTGCTGCCGC




CCCAGCTGCTGAGGAGAAAAAGAAGGTTTCCC[G/T]CTAGTG




AATTTGTTGTTCCTCCAATTGTTTTTCAATCCTGCTTTATGCTA




GTTAATGTGTATCTAATATGAATCTGTGTGTTTCTATTCTATA




GGAGGAACCTGAAGAAGAGAGTGATGATGATATGGGATTTG




GCTTGTTCGATTAGGGACATTCTCAATATGATTTGGTTAAATT




TTGTGGTTCTTTACCTTTAAGTT






S00863-1
TGTGCTCCTAGAGGAATATTTTGTGTAGACTTTCTATTATCTTT
374



TATTTTTTCATTTTTTAAAATTCAAATGTTAACAATTCAAATA




AAGAGAGAAACTAAAATTTCATAAAAGAGAATACGTGTATTA




ATTYTATTTTTGGTTGACATGACTTTTTCATACTAAGTTGGAC




ACCAATTGTGTGTGAGACTTATACCATC[A/T]TGTGAGTGTTG




AACATTCTTAGTGTATAACACTGATAATATAAAGGGRTAGAC




ACTTTTGGTCTTATAGCKTGTGTGATGCTGATTAATAATTAAC




AAAATATTTTCTTTTTTGWGTGTGGATGATATATGTGAATAAC




ACTTAAGTCCTTTAATAACTTTGCTCACGTCCACTTGTCATAA




ACTTATTAATATATNAAA






S17151- 
ACGTTACCCTTGCACTGCAATGCGTCATTAGATTTTATTTTAT
378


001
TTTNTTTTATCTRAAATAACTCAYAATAAATTTTACAAGTTTA




GCTACGGAGAAGATAACTAGATAAAGGAGCGATTGATGTAC




ATTTTGAGGGTGTATGTAGGTTGGAAAARGAGAGGCATGAGG




GGGAAAAAGAAGAAAAAATTCCACTAATGTG[A/G]TATGAAA




AAAAGAGTCACAGATTGACTGAGACTYGTCCCAAACAAGCAT




GTTAATCCTTGCAAATGCGTAGACATAAACATTTTTTTAGTTA




ATTACCTTTTTCATCTCTAGAGCTACAACAACTTTCTCATTTA




ATTTTTATAGTTTAAACATCTCATTTTAGCTCTTATAAGTATAT




AAAAAATTTAATCTTTTTTAT






S17153-
TAAAGTTYAAGAAGACACATGTTAWTTAAACATATTAGTTTA
382


001
AAAWGTAAATTACACTAATTATCCCTAAAGTTTTGAGAAATT




ACACAAATTTCCCCTACTTTTATCTACTCCTACACTAACCCCC




TAATTTTTTGAAAATATATATTGATACACCTTATATACAACAA




TAACTGCCCTTAATTGCTCAAATTTCCACT[A/C]TGAGCCTCTT




TATCAAATCTCATATTGCATTGTCACACTCCACGTCAGTTGCA




ATCACTCACACCCCTCCTATATAATACATCTTCACTCTTTGCA




TCCTCACCCTAAACCAAACCAAATCGAAGACAAAACAATTTT




AACATCAATCTCAAGGGTATYTTTCTCTCTCTTTTCTCTTTCCT




TATTTAGTTTAATATATA






S17154-
AAATTTTGCATATGGAGGAAATTGAATTGCTYTATCCAAACA
386


001
AAGAATTTACAATGRAAGAAATTTGAATTGATTTATCCAAAC




CAAGTATTTGAAAAATGAAAGAAATTAAAATCAAAACAATTC




AAATTTTAGACATTTTAAATTCTTTGAAATTTCTGATYCCAAC




ACAAGATAAATGTTTTTCTCGTGCTTGATTG[C/T]GATGGTCAT




TTCCCACCGRTAAGGTTGGAATAACAATATTTGTTGAAATTTT




AAGTCGCTTTAACTTAGAAATGTGGTTCTAGCAAGTGTTAATT




TACCTTCCTTTGACAATTCATATAATATTTAAGATTGCTAATG




AATGAGAAACAAAGTACTTTCGTTTTGCATTTTTTTTTCACAA




GAAGTCAAAGAACCTTTTT






S17156-
GGTTATTTCTACTAAAGTTCTCGATCTAACGGCTTATTTAATT
390


001
TTTTATTAGGAAAGGGAGGACAAATTCATTTCAAGAAAGCTA




TAATTTTATTTGTTGACCATCATTAAAAGAAAAGAAAAATTA




AGGCATACTAAATTTACAATTTAATTAAGGAAAAACTCAAGA




ATGCCCTTCCAATGAAATAAAGCACTTGGAT[A/G]CCTGCAGG




GTCAATTAGTTGTTGAAATCAGAARAACATTCTGAAAGCATC




AACAACTTTCTCAGGCCTGTCAAAARTACAAAGGGTATATTC




TTAAGAGGTTGAACAAATCATTTTACTATTCACTGAAATCCTA




TGTTAACTAAAGTTACTTACCAGTCCTCTTGTGGCATATGACC




AGCTCCTTCTATCAATTTAAGC






S17159-
AAATTAGTGAAAKAAATCTTATTTTAAAGAAGGTAGATGTAT
394


001
ATATGCGTACGTGTACCGACATTCACAAGCAATTAATTCAAA




TCAATAATTGAAATAACGTGGGGAAGTGCTCTTATGTTTTTTG




AATAACATTGAAAAGAAACAGCGGCAATTTAAACTTYAAAGT




CTCCAGCCYAACAAATCTCAAATGGGATCAT[A/G]GGTGGAC




CACGTGTCTTCCASAATATAGARTGTTGCTAGGTGCACCCARC




ATTCTTTAAAAATGAYAAAATTATCCCTGYYAATTTCTCCCCT




TACCTTACGGATCAAATTGATCCGTAAAATACTTACGGATCA




ACTTGATCCGTAAGGDATATTTTTGTCTTTTCGTGGTTAGTGC




TRGATGCACCAGCAATAATACT






S08590-1
CTTTCATGTAAGAAGTCATTTGATATTAACAATGAAGTTATTT
399



ATCTTCTCTTGACGCTGATGGACTACTTGTCTATTTCCAGCTA




TGAAGTTATTTATTAATTTGACTTCTCATTAACGCATTTTSTGT




TCCTAATTRGTTTAGAACAACTAAAGACCCTTGAAAGACGAC




CAAAATTGTTTGTCTTGTGTTGCAGATC[A/G]GTTGTCATTTGG




CCCGCGTGTACGGCTGCGGATAAGTTATTTATCGAAATAAGT




GTCAGATCAAAGGACGATCTACGTCCCTTAAAAATTTCAATG




ACAACAAACACATTATAAGAATTTATTTATATTTTAAATTAAG




CATCGCCTTTCATCCTAACAAATGTATTTTTAACGCAGATTAT




TCGTCYATAACATTATTT






S17242-
GTATGAAAATGATTAGTGCTGTGACCTGTGGAMTTTTCCTAA
403


001
CTAAATACATTTCTTTCACAATTGATGACGTTACAAAGAAAGT




GAACTACAACTAATGCATATAATGTGTCTTTGTATGACCGTAT




CCAGGCATAACAAAACCATTTAAAGTTCAAGATACGCAATTA




CTTGGGGAATAAACATCGTGCTTTATAATT[A/C]TTGTTGTTAT




GTCACTACGTCAAGGCACTCTGCTATCACGCCATTGTATTTTA




ATTTATCGCAATAATCAGTCTTAAATGTTTTCGAAAATAAGAT




ATTGTTTTATGATATAAATTTTTTGCACTAAAGTCGAATAATG




TCTCTTTCTTTGTAAGCTTTCTTGTTATTGGGCACATGTATGCG




TTTACAAGGAGAAAATG






S17166-
ATGCATATGTGTGTGTGTGTATAAATGGGTTTTTAAAAAATGT
408


001
TGTCAACAAATAAAAAAAAGGTAATTTCATTGAATTTTATATT




AAGCTAACAATTTATTGCTGCAATTTTTATTTTGGCTCGATCA




TATTAAGCTTAATTAAGTCCGCAAACTGTAGTACAAATCAATT




TGGACCAACACATGTCCTCCACAAGACA[C/T]GCAGCAGAAG




CCCATTTAATCAAATGACAAAAGGTAAATGCTAAATAAACTT




YCCAAGTTACTTTCTACAACCCCCTTTTCGTTTTGATTTACCTT




TATTCCAAACTYACCTTTTATCTTCTTCAACTCCCTTTATAGTT




TTATTATATCABTCATGGGCACACTCCCTCTTCTCACAGTCGT




ACCAGTCATATATGCAC






S17167-
GAAATTTTAACTAAATACATAAGACTTKTTTAGGGTGGTTATA
413


001
AGTTTTTTATTTTTGGTTTTCACATGTAATTTTCAAAACAAAA




CATTRTTTGATAAAGATAGAGTTCAAATAATTTTTAATTTAAT




TTTAAAATACTTTTACATTCATTTTTTAAACTAAAAAAAAAGA




TCTTAAAATTTGATTTTTAGTTTTAAAA[A/G]AGCACAGTTACC




CTACATTTGAAAATGACCCATTTTGTTATTGTTACCACTATTG




TTGAYACCACCAACATCACTATCATTTTCACCATCACTGACAC




TACAACCATCACAACCAATATCACCATTATCATTGTCAATCAC




AACTATCATCACCACCCGCCATTAACATTGGCATCACTGTTGT




TATTAYTGTCGTCATC






S08539-1
TGTTAAATYTATTTTTTTTAGTTAAATATATAAATACTCACTCT
418



TATTYTTTTTCTTGTRTAACTNTTTTACNAATGTTATCTTTCAC




TTHTGTAAAGAGTCAACAAGARGTTATGCATATTTTTCATGAG




AGATTAACATGYTTTRAGTATCATGCATYCAAGATCAWTGRT




TGATCATCATTGTTAGAGCTTTGAAGA[A/G]TTTCTTATTYTTT




GGACTCAARGTGTATTCAATTCAATAATCCGTTCATTYAAGAT




TATTTTTAAATATATTTTGATGATCATAATACAATTACAAAAC




CAARACRCTAAAAAATAATTTATTTAAATAATAAAAAWAATT




TATCCAAATGATTYCTAACATATATGTTGATGATCACAATACA




ATTGTAACACAAGRTC






S17178-
ATCGCATTTAGTGTTCCTTTGCCATCGTTAATGAAGTTTTCCA
422


001
GTAGTGYTGCTTATTCGTTTTCCTTTTTTGGGAAATTTTTKATA




ATATGTCCATWGAATCCTYAACACTTGCAAATTAGATAAAGG




CCTCCATACTCAACATTGAATCTATGATTGTGAAACCAAACCT




GTCCCACCACCATTCGGCTAAAGTCAAT[C/T]TTTATACACAC




CTTGACWAACTTTTTTTATGATGCATCCATGGTTGAAATATCA




ATTCTCAYGGGTCGTTGTTCTGCTGAAGCTAAAGCTACAAGA




ACACTTTCATCATAATACTCTAACTCTAAAAACGAGATCTTTA




TCCAAACTAAGGTATTGTTAACCTTCATATTCAATGGTTTGAA




ATCCAACGACCGTAGTCT






S17179-
AGGGAATATGCATGATAACGAAGCAGGACTCGAKCATTTTCT
426


001
TCTTCTTGGTCAACATATATATGGGGCCTAGCTAATTAACTTC




TTAAATTAATTAGATTATGTCTACAAATTTATTTCAATTGTAK




ATTTAWATTAAAAAATCATTTTTSAATCAAGTTCAAATTAAA




AAATATTTTCTAACTCTAAAAGCAAACTGG[A/G]ACATAATTT




GGGCACCTAACTAAGTAAACTGGGTTAGTGAGGTTTATCTCA




YCGATGTGGGYGTATTTTTTTGYAATAAGTTTCATTTTTGGCT




TATTKTTCAATAATATAGAAAAAAATGTGATACGATAAATTA




TTTTTGGTAATATTTAACCCAACTTTTTTNGTTTTTGTTTTTTTT




CCTTAAACACTCTTTATGT






S17180-
TATACAGATACTACTCATTTTTAACTTTAATTAGAAATAGTAT
430


001
CAAAACCACGTATTAAACTGTATCCAAGTTTATCTTTTAAGAA




ATTATTCGTTTTTATTTGGTTTGTTAACTAGTACTATTAATTTT




GTTCAGTGCATTTCCCATAGAAAGTTATTTGTTCTTTCTATTTT




GAATTTGATTGCAAGATATTCAACTT[A/C]ACTGAAAGCTTAC




TAGGTTTTATCATTTCTTCTAGTTTTATTATACAAATCTTTATA




ATACTTTTTRCAAKTTTTTTTTTTCTCATTTTATCCTATCTTGTC




TCAGATTTTTTTCTTATTTTTCTCACATTGTAAKAATTGTAAAA




AAAGAAAGGCGTACTTTACTCAGCGCAAAKAAATTAATCATT




AATTCATTATAG






S17181-
TATTATTAGGCTTTTCACATTTAAGGACTGGTAAAAATRTGAC
434


001
TAGTTGACTGATATTAGTGTATTGTTATTTCTTATCTAATTTTT




TATATGYAATTTTGAAATTTATATTGATACACTCACATATATC




CCAKCATATCGATTATTGATATACCGTATCACTATTTATAKGT




AATCACTAAAATTACACACTTAAATTA[C/G]CACTAACTTTAG




RGCACATTATTTTCTGGAAAGATGTAATACAAAATTGGCCCA




TTAGCTCTTTTGAGTTTTGACCCTAAACCTTAAACACATTCCG




TTTCTGTATAGTCTGTGGTCTATGATTTTGATGTKTTCATTTGT




TTTATGTGCAACTAATATTAAACAAAAACACTTGAAAATCGA




TAAGCACAGAAGGTATC






S17182-
AAAATTAATTGGTGAACCATATATCATCTCTAGAAATTATATT
438


001
TAGAAAGACCAAACTCATCCTCATGCTCCTGAAGAAGAACAG




AAAGAGCTTTGGTTATCTCTTCTGCTGCCGGAATAAACTAGGT




TTGGAGCTCTACCATAGCTTTGGACTCAGATATACATAGATG




AGAGGTGACAAGTGAAAACACCCTATTTTA[A/T]GAATATTCC




ATCAACATTTATGACCTATTACACTTACATATCTCTTTTTTCTC




TCTTTCTCTAAGCMTTGGATAGCATGCATGCATGGAGTGGTC




AATGCAACATTTTCCTATAATATTGTTACATCTTTATCTCAAA




CAACCTTTGTAGCAATGTTCCTATAAATAACCCCTGTCTCTTC




AACTCTCACAGTGACTTTG






S17183-
AAAAACTTATAAGGTATTTTTATTATTTAAATGAKTAATACCA
442


001
CCGACTTWGGCATAATTACATGACTAATTTTGCCGTTACTTGA




AATGAAGACGAGAGAACTTATAGCGTGGAATCCGTGGGAGC




ACAATGTTTGTGGGGCATGGGAGATCTGGAGCCGTTCATTGA




CGATGTGGTTTGATCGTGCGGTGGATGTGAA[G/T]TGCCGACA




GGTAATACAGGTAAATGGTGGCCACGHATGCATGTAAAAAA




ATGAATGAAGTTTATCTGTTTATACATTGAATGATGAAAATG




GTGGTGGAGGAAGTCTTATTCTTCTTCGGTTGGTGGGTCCCMC




TTAGATTTTGGTTGAGGTGRCACCATCTTTGAAAAGAGATTTC




GGAAGATAGCTAGAATAAGTGAA






S02780-1
GGCATGGAAGGGCTACATTTTCTGTTCTTCTTTTTCAGGTTCC
447



TTTGTAACTTACCATCTATCTAGAAACTGCAGGATTCTCTTGT




AAAATAAAATATTAAATGATATAATACTTGAGATATGTAGTT




GGCTAAAYTTCATCTTATATGAGGCATTTGCTTCAATTTTCCA




GACTATTGCCTTTACCTTCTAGACTCTGG[A/G]TAACCTGAACT




GCATATCMTATAGGGGCAAAAGTATGTTATTTTGTCAGCATA




TAAACATGTTTGCATCCTATACAGTCAAGTATTCTACACAGAT




ACTATAGAAAGTAGAAAGAATAGTGGTRCTTTTCACTTGTTTC




TGTTGAAAACTGAATACAAAGATATAGAGAGAGTAGAGAGA




AAAGGGAGATAAGGTTTCTC






S12107-1
GAGAAGGTGATAATAATAAATATGAAAATGACTTGAGATTYT
452



GACTCTTGACTCCTAACCAAAATTTKAAAGCTTTTTTACACAG




GGAGTTCCACTTTGAATTCCCCATCCTTGAAAGAAGTGGGGTT




CACCMAATGCTACAGCACGAAACTTTTCCATCTTGGTGRCAA




AGTGCGAACAATATTGAAGGTGACAATAGA[G/T]GGAGYCGA 




TTGGGAAGGTGATATAAATTCTGCGCCCATGCAACAGTTTGA




GGAGGAGGCAGCCTGAGTKAGTTTAATTTGGTGACAATTCCC




CTCGACTTGTATGTAAGTACATACATTGATAATTATTTCTCAT




GGACATGCAATTAATATATGCTGATCAACTGCTACTAACTGA




GNGAGAGAAGGCATTAAATATCT






S03624-1
CTCTTGGTGCAAAAAAAWNTACACTATAGAAATCATGGTAGG
457



TATGAMTTTTAAGGTAGTTATTGTACAGGCCAATAAACTTAC




CATCGATGTATAGACTATAGACKATTTTCTCTTTATATATGAG




CTGTTTATCCATACTTTTTTTTCTCAAAAACATTACATGCATA




ACCTCTCATCATGTAATCATTTAATATGTG[A/G]CACTAATGA




TGCTAACTTGAGAGAAATTTACCTCTAATCTTATTTGCAGATG




CATCTACTTCTTCATGCTCCACCGCAAGATCATCTGTTATTGT




AGTGATTATTCTAATCTCAGGGCAATCATCTTTACAAGAAAC




ATCAAGACTATCTGCTTCTTCTGGTCCTAACTGTTCATCCCGT




TCAGAAGAATAATGAATTAA






S01953-1
TTATTGAAAAATATAGACTTATTCCGAAAGTATATCTCCAAT
462



WCTGTGAATCATAGTTCCAGAAATACATTTCTAGAATAGATA




TATGTAATTCTGGAAAGACATTTCCAAAAAGCAAAAGGAGTG




TACTTCTTTATGAAAAACGGTGAAGGGTATGAGGGTGTDCTT




AGGAAACTGGTGTAAATATCAAAATTCCTTCT[C/T]CACTTKA




TTGATTAAAAAAGAGGCACAAATCAAAGCAACAAAGGCTCC




AAAATTAGTGGCCCGACACGATAGATAAAAGGGAATTGCTAT




ATCCAGTTCCTCTTTTTGCTAATACACTCCCATTTTAATTTTTA




TTTTCAAAAGTACCCCTAATAAATACACTCCCTGTACCATGAC




ATCATCATCATCCAACCTACGAG






S00111-1
TTTGGCCCAATCCAATCCTCATAATCAACCTCTCTCTCTGACT
467



GTTCACTCTGTGTTTGGAGGTGGAAACTTAGGGTATGATTTCT




GATTCCCCTTCCTTCTCGCTCGTTCTCTGCTTTAATCCATCAAC




TATCCTTCTWACTCATATTTCTTCACTGATTCAAGATACGATC




AAGTTTCCTTATCATTTGCTCCTAATT[A/G]CGTTTACCCTATA




CACACTTTTCCAGTTATTCACCRAAACCAACAAATAACAAATT




TCAGGTTGTTGGAGAAATTGTTCTGTTGGGGGGATAYGATGT




CGATGGAGAAGGAAGCTTCAAGCTCCACACCTACGCGCAAAT




TGTCGTGCACTGCATGCTTCGACGCTGTCTATTTCTGCTACTG




TAATACCCATTGCCCTA






S04180-1
GATGATAAAACTTGGGGAGAATTTTGTCAAATCCTCCCCAGC
472



GCATTTCTTAGTCCATGCTAGCAGATTTACTTTTTTTTTTTTCC




ATTTCTTCTGGTTGTGTAATTCAGGGATTAATYGATAATTAGA




CGACATGATTATGTAAATATATGTGCATTCACCATTTATGAAT




TTTGATCCTTATGTAGTCTTACATTGAC[A/G]CTGTTCCTTTCT




CTWTATAATTGTGGCACATTATTCTGGGTTTTCATTTTTCAGT




ATTTGATTTCTCTCWACATCATGCACTGAGCACCTGCGTTAGG




CTGAAAGAAAATAAATTAATAATGTTTTGATTCATAAMCTAC




AGAATTCAAGCTTTCCTTTAGTATYAACTTATTGAGTAGCTAA




TGGCMAAATTGGAATTG






S01008-1
TATGTCAMCTTCACCTTGGGCAGRAAAGAAATTCCGTACTTG
477



GACTTAAAGAATTTTCATGTTAAGCTATAACAATCAGAGAAA




GATATTAATGAAGCAGCAAGCACATAATATGGAGATATGTGA




GTTGCACCTTCAATCTTGGAGGACAAATCATGCACTGGATTGT




GAAGAGAATATAAGCATATAGACTTACAGWG[C/G]AATTGTT




ACAYGTAGCAAAACTACATGAATAACAATCTAGTTAAAGCAA




GGGATGCATACTAACAAACATCAAACTCTTATCACCTCATCT




AGTCCCACGGTGGATCTAGTTTGAAACATGTAAGCAGTCTTA




AAGCAAAATAGCAGGCATATSGTATCTATCTCAAACAGAAGT




GGATAKAACAGTAAACCACATGCCA






S12862-1
GGGACCTAATTTGAAACATTCAAGTAAAATATATTGTTTAATC
482



TACCTATTACATCAAGGTGATGACCTTTTTAACCTAACCTACT




ACTCACATCCTCACCAATATTCAAGGATTGAGTCATTATGAC




WTATCAATATGATACACGATTGCACCCATATCGACCTTATTG




TGTGTTCCATACTCTGAATATGATCGAATC[C/T]GATAAATGTT




CATCGAAACAACCGGATMAATTACAGCTTGAGGATACACAA




CCCTCGAACCTTAAGTATAATGTACGTAAAGGCTAAGGCTTG




AGGATCGGGTTASAATCCGAGATGTAATTCATTTTCGTTTTAG




TTCATGTTGACTGATACACATGTCATCCTGCACTTGTCACATG




GGGTGTCCTACGTGGTTTACT






S12867-1
ATTGTAATATAATTGAAANAAAACATGCATTCATGATATGTA
487



TTACCGGCATTCCAACCATGGCGCGCGGATGATGAAAAATGG




TTGTAACTTCAATCAGACTTGTATTCACAATTAAGCAAAACTG




AAACCCAAACACACGTTAAAACTCTTGTTCAGCTCGAGCTTA




TARCATTAMGAGATCATGACCACATGTAACT[A/G]TTATTATA




ACACACACATACACACATGAGAAGGGATCATATTTATTCTAG




GATCAAATATACATGTGTGGGACCACTTGAACACAAAGTTAT




GTAGARTAGTTGTTTCATGCCATTGCTAATCAGGACTTCTCAC




TGCCAATCTATGTGTCTCATTCTCTCTAATMTCTCTGTCATTTT




GTGTCTCATTCACTAGGATGA






S04966-1
CTCATCACAGCATGCTTATGGCGTTGTCACACWAAAGCATTG
492



AAGATAGATGCAGATAAGGATGTTCGAATGATGGTCGCCGTC




AACGCACGTGCTAAGTTCAATCCTCCTTTACCTGTTGGTTATT




ACGGTAATGCCATTGCATACCCAGCAGCAGTCACCACAGCAG




GGAAGCTTTGTGGAAATCCATTTGGGTATGC[A/T]GTGGAATT




AATAAATAAAGTGAAAGGTAAGGCCACACAAGAGTAWATGC




ATTCTGTGGCAGATCTATTGGCTATTAAGGGACGATACATAC




CAAGAATGGTGAGGTCTCTTACTGTGTCAGATTTGAGAGGTTT




TGATCCCAGACAAATTGATTTTGGGTGGGGCCATGCTCTCTAT




GCTGGACCAGCTCAAGGAGGCCT






S10631-1
AACCGTTATGGTTGGGGATCGTTGCTACCAAAACCAAAATCT
497



CGGGACGGATCAATGTCCCAATGCTTGGTATGTCTGATGCCA




CTTTTTGATTCTATGTCTCCTCAATTTTTTTTTTCTGATTCCGTC




TATGATTGYTAGATTAGATCAAAAGAATTAGCCGGTTTACTT




GGAATGTTTGAATCATTTCTAATAAGATC[C/T]GAGTTGGAAG




AAAAGACACATGTCTAGAAATTCGACAGATCTTGCATCAAAT




AGAAGGGAAAATAATTAATCATTTTCATAATTTTTTTTAGTAT




TTACGTCCTAATTAGTAGTCAATATGATAATTCTACCAATGAG




GTTTATAGGTCATGTAAGTAACTGTTTATTATTCAGAAAAATA




GGAATACRTAACATAAAAT






S01574-1
ATCATAATTATTGCAAAAACAATATCTGGTCTAGTGTGGGCT
502



AGATAAATAAGTTTTCCCTATCRGTCTTTGATATCAAGCCTAT




TTGAGCTCTCCCATTCCTATCCAATGATTTTGCTTGATGAGCT




CTCCATATGTTTTACGACCCAATTTGCTTGTCTCTCTAAGAAG




ATMAATGGCATATTTTCTTTGAGAGATAA[A/C]GATGCCTTTT




TTGGAGCAGGCAACTTCTATTTTCAGGATTTTTTTCAGCTTTC




CTAGTTGCTTCATTTCAAATTGAGTTGTCAACCTTTCCCTTAG




GATTTGTTTTCAACTTCATCATTTCCTGCAACAAACATATCAT




CTACTTAGACCAAAAGGATTGCCAATTTACCTGTCTGAAAAT




GCTTTATAGAGTGTGGTCA






S16594-
CAGAGGTTGAATGGAGAATTTGCACCATTTGGGAAGCTGCTT
507


001
ATGGCCTGATTCCTACAAGTTCCTCAGCTGTTGATCTTCCGGA




AATTATAGTTGCAACCCCACTACAGCCTCCCGTGCTGTCATGG




AATTTGTACATACCCCTATTGAAGGTCCTGGAATATCTTCCTC




GTGGAAGCCCATCAGAGGCATGTCTTATG[A/T]AAATATTTGC




TGCTACAGTGGAAGCTATTCTTCAGAGGACATTTCCACCTGA




GTCCACTAGAGAACAAAACAGAAAATCAAAATACCTAGCTG




GCATAGGCTTTGGCTCTGCCTCAAAAAACCTGGCCGTGGCAG




AACTTCGTACAATGGTTCATTCACTCTTCTTAGAATCATGTGC




ATCTGTAGAGCTTGCTTCACGC






S02777-1
CTTTGAAAATATAAAGTAAAATATTTTTATGTGAATCTATATA
512



TATAAAATCTAATATCACATTCACACCCKAAAATTTATCTCAT




GCGAACATGCTTACAAAAGCATTGAATTGGAAAMAAACATA




TCGAAGTGCAAATACGGTATATCATACTAAATATCAGTTATA




TTTCCTTAATTTTAAAAGTTTGTTATCTTCC[A/G]CTTTAGACT




ATATATTCATCATTTTCCAAAATTTCAGTTTCCATTTCAAGTC




GAGTTTGATTCAATTCAGCTGTTTAGCGATDTTGAAGTGGAA




ACAGTCAGTAGATTTAGTGTACTGATGAGGTTGAACACAAGT




TAGGATATTACGTCTTGCATGTGAATTTGTTGGTCAATTACA




CTTGCTTCCATTCACWAAATT









The SNP markers identified in these studies could be useful, for example, for detecting and/or selecting soybean plants with a preferred reproductive growth phenotype. The physical position of each SNP is provided in Table 24 based upon the JGI Glymal assembly (Schmutz et al. (2010) Nature 463:178-183). Any marker capable of detecting a polymorphism at one of these physical positions, or a marker associated, linked, or closely linked thereto, could also be useful, for example, for detecting and/or selecting soybean plants with an altered reproductive growth phenotype. In some examples, the SNP allele present in the first parental line could be used as a favorable allele to detect or select plants with altered time to R1, such as a shorter time to floral initiation. In other examples, the SNP allele present in the second parent line could be used as an allele to detect or select plants for unaltered time to R1.


These SNP markers could also be used to determine a preferred or non-preferred haplotype. In certain examples, a favorable haplotype would include any combinations of two or more of the alleles provided in Tables 24 and 25. In addition to the markers listed in the tables (e.g., Tables 26-27), other closely linked markers could also be useful for detecting and/or selecting soybean plants with a preferred reproductive growth phenotype, for example exemplary markers provided in FIGS. 1-19. Further, chromosome intervals containing the markers provided herein could also be used.

Claims
  • 1. A method of producing a soybean plant or soybean germplasm with a late maturity phenotype, the method comprising: (a) isolating nucleic acids from a genome of a first soybean plant or soybean germplasm;(b) detecting in the first soybean plant or soybean germplasm at least one favorable allele of one or more marker locus selected from the group consisting of: i) at least marker locus S17166-001;ii) at least marker S17166-001-Q006;iii) at least one marker within a region of SEQ ID NO: 408;iv) at least one marker within 5 cM of Gm07:4319368; andv) at least a polynucleotide comprising a polymorphism at a genomic position of Gm07:4319368,wherein the at least one favorable allele is a C at Gm07:4319368 for marker S17166-001;(c) selecting said first soybean plant or soybean germplasm, or selecting a progeny of said first soybean plant or soybean germplasm, wherein the plant, germplasm or progeny thereof comprise at least one favorable allele associated with the late maturity phenotype; and(d) crossing said selected first soybean plant or soybean germplasm with a second soybean plant or soybean germplasm, thus producing a soybean plant or soybean germplasm with a marker profile associated with the late maturity phenotype, wherein said produced soybean plant or soybean germplasm has a marker profile comprising at least a C at Gm07:4319368.
  • 2. The method of claim 1, wherein said detecting comprises detection of a haplotype comprising at least two or more markers selected from the group consisting of S17166-001-Q006 and S16594-001-Q10.
  • 3. The method of claim 1, wherein said detecting comprises detection of a haplotype comprising at least three or more markers selected from the group consisting of S16601-001-Q001, S17166-001-Q006, and S16594-001-Q010.
  • 4. The method of claim 1, wherein the detecting comprises sequencing at least one of said marker loci.
  • 5. The method of claim 1, wherein the detecting comprises amplifying the marker locus or a portion of the marker locus and detecting the resulting amplified marker amplicon.
  • 6. The method of claim 5, wherein the amplifying comprises: a) admixing an amplification primer or amplification primer pair with a nucleic acid isolated from the first soybean plant or germplasm, wherein the primer or primer pair is complementary or partially complementary to at least a portion of the marker locus and is capable of initiating DNA polymerization by a DNA polymerase using the soybean nucleic acid as a template; andb) extending the primer or primer pair in a DNA polymerization reaction comprising a DNA polymerase and a template nucleic acid to generate at least one amplicon.
  • 7. The method of claim 6, wherein the admixing of step a) further comprises admixing at least one nucleic acid probe.
  • 8. The method of claim 6, wherein the amplifying comprises PCR analysis.
  • 9. The method of claim 1, wherein the detecting comprises sequencing.
  • 10. The method of claim 1, wherein the second soybean plant or soybean germplasm comprises an exotic soybean strain or an elite soybean strain.
PCT Information
Filing Document Filing Date Country Kind
PCT/US2014/021517 3/7/2014 WO 00
Publishing Document Publishing Date Country Kind
WO2014/149920 9/25/2014 WO A
US Referenced Citations (7)
Number Name Date Kind
6455758 Johnson Sep 2002 B1
8329982 Kyle et al. Dec 2012 B2
8847006 Jenkinson et al. Sep 2014 B2
20080256660 Jenkinson et al. Oct 2008 A1
20100122372 Sebastian et al. May 2010 A1
20110191693 Arlindo Harada et al. Aug 2011 A1
20120324598 Lu et al. Dec 2012 A1
Non-Patent Literature Citations (29)
Entry
Xin et al., 2008, Euphytica 162: 155-165.
Cregan et al., 1999, Crop Science 39: 1464-1490.
Elroy Cober et al., A New Locus for Early Maturity in Soybean, Crop Science, 2010, pp. 524-527, vol. 50.
P. B. Cregan et al., An Integrated Genetic Linkage Map of the Soybean Genome, Crop Science, 1999, pp. 1464-1490, vol. 39.
R. H. Ellis et al., Effects of Photoperiod and Maturity Genes on Plant Growth, Partitioning, Radiation Use Efficiency, and Yield in Soyabean [Glycine max (L.) Merrill] ‘Clark’, Annals of Botany, 2000, pp. 335-343, vol. 85.
Larry Heatherly, Soybean maturity group, planting date and development related, Delta Farm Press, Oct. 14, 2005.
Kunihiko Komatsu et al., Identification of QTL controlling post-flowering period in soybean, Breeding Science, 2012, pp. 646-652, vol. 61.
Chad Lee et al., Predicting Soybean First Flowering Date, University of Kentucky—College of Agriculture, 2005, AGR-184, 2 pages.
Wenxin Li et al., QTL Mapping for Major Agronomic Traits across Two Years in Soybean (Glycine max L. Merr.), J. Crop Sci. Biotech. Sep. 2008, pp. 171-190, vol. 11(3).
Weixian Liu et al., QTL identification flowering time at three different latitudes reveals homeologous genomic regions that control flowering in soybean, Theor Appl Genet, 2011, pp. 545-553, vol. 123.
L. M. Mansur et al., Interval mapping of quantitative trait loci for reproductive, morphological, and seed traits of soybean (Clycine max L.), Theor Appl Genet, 1993, pp. 907-913, vol. 86.
Hisakazu Matsumura et al., AFLP Mapping of Soybean Maturity Gene E4, Journal of Heredity, 2008, pp. 193-197, vol. 99(2).
D. A. McWilliams et al., Soybean Growth and Management Quick Guide A-1174, North Dakota State University, Jun. 1999, 8 pages.
C. D. Messina et al., A Gene-Based Model to Simulate Soybean Development and Yield Responses to Environment, Crop Science, 2006, pp. 456-466, vol. 46.
Stephen J. Molnar et al, Simple sequence repeat (SSR) markers linked to E1, E3, E4, and E7 maturity genes in soybean, Genome, 2003, pp. 1024-1036, vol. 46.
Praveen K. Pallikonda, Impact of E-genes on Soybean (Glycine max L. [Merr]) Development, Senescence and Yield, 2006, University of Kentucky Master's Theses.
V. E. Rozenzweig et al., Prospects of exploiting of photoperiod sensitivity gene E7 in early soybean breeding and revealing of its sources with SSR-markers, Institute of Genetics and Cytology, National Acad. Sci. of Belarus; Soya-North Co. Ltd.
Jin Hee Shin et al., Molecular markers for the E2 and E3 genes controlling flowering and maturity in soybean, Mol. Breeding, 2012, pp. 1793-1798, vol. 30.
I. M. Tasma et al., Mapping genetic loci for flowering time, maturity, and photoperiod insensitivity in soybean, Molecular Breeding, 2001, pp. 25-35, vol. 8.
D. Wang et al., identification of putative QTL that underlie yield in interspecific soybean backcross populations, Theor Appl Genet, 2004, pp. 458-467, vol. 108.
Satoshi Watanabe et al., Map-Based Cloning of the Gene Associated With the Soybean Maturity Locus E3, Genetics, Aug. 2009, 1251-1262, vol. 182.
Satoshi Watanabe et al., Genetic and molecular bases of photoperiod responses of flowering in soybean, Breeding Science, 2012, pp. 531-543, vol. 61.
Zhengjun Xia et al, Positional cloning and characterization reveal the molecular basis for soybean maturity locus E1 that regulates photoperiodic flowering, PNAS, May 22, 2012, E2155-E2164.
Zhengjun Xia et al., Molecular identification of genes controlling flowering time, maturity, and photoperiod response in soybean, Plant Syst Evol, 2012, pp. 1217-1227, vol. 298.
Da-Wei Xin et al., Analysis of quantitative trait loci underlying the period of reproductive growth stages in soybean (Glycine max [L.] Merr.), Euphytica, 2008, pp. 155-165, vol. 162.
Tetsuya Yamada et al., Effects on flowering and seed yield of dominant alleles at maturity loci E2 and E3 in a Japanese cultivar, Enrei, Breeding Science, 2012, pp. 653-660, vol. 61.
Naoki Yamanaka et al., An Informative Linkage Map of Soybean Reveals QTLs for Flowering Time, Leaflet Morphology and Regions of Segregation Distortion, DNA Research, 2001, pp. 61-72, vol. 8.
U.S. Appl. No. 15/125,738, filed Mar. 11, 2015, Status—Allowed.
International Search Report and Written Opinion—PCT/US2014/021517—dated Jun. 2, 2014.
Related Publications (1)
Number Date Country
20160024599 A1 Jan 2016 US
Provisional Applications (1)
Number Date Country
61781826 Mar 2013 US