Claims
- 1. A method of analyzing a digested protein sample, comprising:
(a) generating an MS data set for the digested sample; (b) selecting at least one peptide represented in the MS data set; (c) generating an MS/MS data set for said at least one peptide selected in step (b); (d) searching a first level protein database to find candidate proteins containing said at least one peptide selected in step (b); (e) comparing MS data from the digested sample with one or more of the candidate proteins only, to find a match therebetween.
- 2. A method as defined in claim 1 wherein step (d) includes the step of:
(f) preparing a second level database containing only the candidate proteins, and searching said second database to find candidate proteins containing said at least one peptide selected in step (b).
- 3. A method as defined in claim 1 wherein step (e) includes the step of:
(g) narrowing the search field in the first level protein database to search only the candidate proteins.
- 4. A method as defined in claim 1 wherein the first level database includes digest data for each of the candidate proteins.
- 5. A method as defined in claim 1, further comprising the step of:
(h) generating in silico digest data for at least one of the candidate proteins.
- 6. A method as defined in claim 4 or 5, wherein step (e) includes the step of:
(i) selecting a peptide from the MS data set; (j) searching the digest data for the candidate proteins to identify the selected peptide therein; and (k) recording a match when the selected peptide is found in a candidate protein.
- 7. A method as defined in claim 1 wherein step (e) includes obtaining another MS data set for the protein sample.
- 8. A method as defined in claim 6 wherein steps (i), (j) and (k) are repeated until a sufficient number of selected peptides are identified in a candidate protein to declare a match.
- 9. A method as defined in claim 6 wherein, when a selected peptide of step (i) is not found in any one candidate protein, further comprising the steps of:
(l) generating an MS/MS data set for the selected peptide; and (m) searching a first level protein database to find candidate proteins according to the selected peptide.
- 10. A method as defined in claim 1 wherein step (e) is conducted on an online database.
- 11. A method as defined in claim 1 wherein steps (a) to (d) are carried out in a tandem mass spectrometer (MS/MS).
- 12. A method as defined in claim 1 wherein steps (a) to (d) are carried out in a mass spectrometer capable of generating MS and MS/MS data.
- 13. A method as defined in claim 12 wherein steps (a) to (d) are carried out on an ion trap mass spectrometer.
- 14. A method of analyzing a digested protein sample, comprising the steps of:
(a) generating an MS data set for the digested sample; (b) selecting a first peptide represented in the data set; (c) generating an MS/MS data set for the first selected peptide; (d) searching at least one first level protein database to find at least one candidate protein which, by a predetermined measure, is identified to contain the first selected peptide; (e) preparing a second level database containing only the candidate proteins of step (d); (f) selecting a second peptide; and (g) searching the second level database to find candidates which are identified to contain the selected second peptide; and
wherein, if more than one candidate protein is identified in step (g), further comprising the steps of: (h) selecting an nth peptide; (i) searching the second level database to find candidates which are identified to contain the selected nth peptide; and (j) incrementing n and repeating steps (h), (i), if necessary, until a single candidate protein is identified.
- 15. A method as defined in claim 14 wherein the step (c) includes the step of narrowing the search field in the first level database.
- 16. A protein analysis system, comprising:
(a) an MS unit for generating MS data on a digested protein sample; (b) a selector unit for selecting a first peptide from the digested protein sample; (c) an MS/MS unit for generating MS/MS data for the first peptide; and (d) an identification unit for identifying the protein sample, the identification unit comprising:
(I) a search station operable in a first phase for searching at least one first level database to identify candidate proteins containing the first peptide; (II) a memory station for storing art least one second level database containing only the candidate proteins; (III) the search station being operable in a second phase to find a single target candidate protein by comparing the MS data from the digested protein sample with MS data for the candidate proteins.
- 17. A system as defined in claim 16 wherein the MS/MS unit is in tandem with the MS unit.
- 18. A protein analysis system, comprising:
(a) an MS unit for generating mass spectrum data on a digested protein sample; (b) selection means for selecting a peptide from the digested protein sample; (c) an MS/MS unit for generating mass spectrum data for the selected peptide; and (d) an identification unit for identifying the protein sample, the identification unit comprising a general purpose computer programmed to carry out the steps of, (e) searching at least one first level database to identify candidate proteins containing the first peptide; (f) storing at least one second level database containing only the candidate proteins; (g) searching the second level database to identify a single target candidate protein by comparing the MS data from the digested protein sample with MS data for the candidate proteins.
- 19. A computer program product recorded on a computer-readable medium and including the computer executable steps of:
(a) initiating a computer data input to receive MS data of a digested protein sample; (b) selecting one peptide from the MS data; (c) initiating a computer data input to receive MS/MS data of the selected peptide; (d) initiating a search of a protein database to find candidate proteins which, by some measure of confidence, contain the selected peptide; (e) comparing the peptides of the digested protein with the candidate proteins in order to identify a candidate sharing a sufficient predetermined number of peptides to declare a match; and (f) generating an output to report the match.
- 20. A method of protein analysis, comprising the steps of:
(a) selecting a peptide from MS data for a digested protein sample; (b) recording MS/MS data for the selected peptide; (c) initiating a search of a protein database to find candidate proteins which, by some measure of confidence, contain the selected peptide; and (d) iteratively comparing the peptides of the digested protein with the candidate proteins in order to identify a candidate sharing a sufficient predetermined number of peptides to declare a match.
- 21. A method of protein analysis, comprising:
(a) preparing a sample comprising at least one unknown protein; (b) adding to the sample at least one bait molecule; (c) subjecting the baited sample to the method of claim 20, wherein before step (c), the method includes the step of building a binding protein database according to proteins known to bind with said bait molecule or a consequential molecule thereof
- 22. A method as defined in claim 21 wherein step (c) includes the steps of:
(d) assembling a list of proteins known to bind with said bait molecule or a consequential molecule thereof; (e) conducting an in silico digestion of the list of proteins to form said binding protein database.
- 23. A method of protein analysis, comprising:
(a) preparing a list of known proteins and conducting an in silico digestion of the list of proteins to form a peptide database; (b) providing a digested protein sample; (c) recording MS data for the digested protein sample; (d) selecting a first peptide from the digested protein sample; (e) recording MS/MS data for the first selected peptide; (f) initiating a search in the peptide database to find candidate proteins which, by a predetermined confidence value, contain the first selected peptide; (g) selecting a second peptide; and (h) comparing the MS data of the second peptide with the candidate proteins in order to find candidate proteins which contain both the first and second selected peptides.
- 24. A method as defined in claim 23 wherein, when more than one match has been found in step (h), further comprising the step of:
(i) selecting another peptide and repeating step (h).
- 25. A method as defined in claim 23 wherein, when a match is not found in step (h), further comprising the steps of:
(j) recording MS/MS data for the second selected peptide; (k) initiating a search in the peptide database to find candidate proteins which, by a predetermined confidence value, contain the second selected peptide.
REFERENCE TO RELATED APPLICATIONS
[0001] This application claims priority to U.S. Provisional Application 60/297,574, filed on Jun. 12, 2001, the entire content of which is hereby incorporated by reference.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60297574 |
Jun 2001 |
US |