Claims
- 1. A computer-based method for determining whether or not a first peptide sequence database obtained by in silico tryptic digestion of a second peptide sequence database contains one or more peptide sequences that correspond to an experimental peptide prepared y tryptic digestion of a polypeptide, the method comprising executing the following steps by one or more automated computer algorithms without the intervention of an operator:
(a) analyzing a first fragmentation spectrum obtained by mass spectrometry of the experimental peptide to generate a first peak list comprising an assigned mass value for each of a plurality of peaks detected in the first fragmentation spectrum; (b) interpreting the first peak list by a computer-mediated spectral read to construct one or more tripeptide search sequences (X) and deriving from the first peak list the following mass data: a mass (M1) of a sequence flanking the N-terminus of X, a mass (M2) of a sequence flanking the C-terminus of X, and a total mass, wherein each search sequence and associated mass data together constitute a search string (M1-X-M2), the one or more tripeptide search sequences being constructed by a computer-mediated process comprising performing the following steps in accordance with previously ordained criteria:
(i) interpreting the first peak list to deduce one or more tripeptide sequences within the experimental peptide; (ii) selecting at least one deduced sequence by vectorial quality ranking; (iii) permuting the deduced sequence or sequences to obtain a set of one or more permuted tripeptide sequences; and (iv) constraining the set of one or more permuted sequences to obtain the one or more search sequences; (c) searching the first database with at least one search string to determine whether the first database contains one or more candidate sequences that include a search sequence of a search string and are compatible with the mass data associated with that search string; and (d) performing a computer-mediated back-read that tests the candidate sequences, if any, against a second peak list derived from the first fragmentation spectrum that contains at least one peak absent from the first peak list and determining whether one or more candidate sequences fit the data in the peak list according to one or more matching criteria, the back-read comprising:
(i) for each candidate sequence,
(1) identifying one or more amino acids flanking the search sequence (X) that is included in the candidate sequence; (2) generating a list of theoretical m/z values of at least one suite of ions for the identified flanking amino acids; (3) comparing the theoretical m/z values or corresponding assigned mass values with observed values in the first peak list or a second peak list derived from the first fragmentation spectrum and recording any matches that support the flanking amino acids; and (ii) scoring the supported flanking amino acids and determining whether a candidate sequences satisfies the matching criteria, wherein upon satisfaction of the matching criteria, the candidate sequences, if any, that satisfy the matching criteria are identified as corresponding sequences.
- 2. A computer-readable medium comprising instructions for causing a computer to perform the method according to claim 1.
- 3. A computer comprising instructions for performing the method according to claim 1.
- 4. A peptide or nucleic acid database comprising information obtained by performing the method according to claim 1.
- 5. A computer-readable file comprising information obtained by performing the method according to claim 1.
- 6. A display comprising information obtained by performing the method according to claim 1.
Priority Claims (1)
Number |
Date |
Country |
Kind |
0022136.6 |
Sep 2000 |
GB |
|
Parent Case Info
[0001] This application claims the benefit of UK Application No. 0022,136.6, filed Sep. 8, 2000, and U.S. Provisional Application No. 60/232,273, filed Sep. 13, 2000; the contents of each of the foregoing are incorporated by reference herein in their entirety.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60232273 |
Sep 2000 |
US |