Claims
- 1. A method for obtaining a predicted protein-protein interaction map across organisms, comprising:
(a) creating an intermediary domain cluster interaction map; (b) searching for similarities between the cluster for each selected interacting domain cluster and in a target organism; (c) creating a correspondence between the intermediary domain cluster interaction map and the target organism from the similarities; and (d) predicting a target protein-protein interaction map along the correspondence.
- 2. The method of claim 1 further comprising (e) building a profile for each selected interacting domain cluster from the intermediary domain cluster interaction map.
- 3. The method of claim I wherein the clustering is non-transitive and non-exclusive.
- 4. The method of claim I wherein the intermediary domain cluster interaction map is generated from at least one of a connectivity link (I-link) and a sequence similarity link (S-link) taken from at least one of a source organism map, a protein expression profile and an art annotation.
- 5. The method of claim 4 wherein said S-link clusters and I-link clusters resulting from step (a) are similarity and interaction cliques, respectively.
- 6. The method of claim 5 further comprising (f) further analyzing the clusters of similarity and interaction cliques to find interacting domain profile pairs (IDPP).
- 7. The method of claim 6 wherein said pairs of similarity and interaction cliques (n-SIC) are defined as (SIC1; SIC2), SIC1={ID1,1, . . . , ID1,n1} and SIC2={ID2.1, . . . , ID2,n2}, and defines an IDPP if the number of (ID1,i, ID2,j) pairs connected in the source interaction map divided by n1n2 (the total number of possible ID pairs between SIC1 and SIC2) is superior or equal to a threshold T of between about 50% and 100%.
- 8. The method of claim 2 wherein the profile is built when each sequence and interaction clique contains more than one member from a multiple sequence alignment of interacting domain sequences.
- 9. The method of claim 8 wherein the sequence alignment is a previously computed pairwise comparison if n=2 or if n>2 said sequence alignment is computed as a multiple sequence alignment.
- 10. The method of claim 8 wherein a Hidden Markov profile is built from said sequence alignment.
- 11. The method of claim 9 wherein a Hidden Markov profile is built from said sequence alignment.
- 12. The method of claim 1 wherein said searching of (b) is performed by using a single interacting domain sequence if n=1, or by using an interacting domain profile if n>1.
- 13. The method of claim 1 wherein the correspondence in (c) is performed by associating to each n-similarity and interacting cliques (n-SIC) a set of target protein domains similar to said n-SIC profile.
- 14. The method of claim 1 wherein a predicted biological score (PBS®) is provided with the predicted target protein-protein interaction map.
- 15. The method of claim 2 wherein the profile of interacting domains is a flexible sequence pattern correlated to physically interacting structures.
- 16. The method of claim 15 wherein the flexible sequence pattern represents new binding motifs.
- 17. A protein-protein interaction map obtained by the method of claim 1.
- 18. A record of the protein-protein interaction map of claim 17 in electronic, paper or digital form.
- 19. A method of predicting a target organism protein interaction map from a source organism protein interaction map, comprising:
(i) comparing each target organism protein sequence with each source organism protein; and (ii) transporting an interacting property of two source organism proteins along two target organism proteins showing significant similarities with said two interacting source proteins.
- 20. A method for predicting a target organism protein interaction map from a source organism protein interaction map, comprising comparing each target organism protein sequence with each interacting domain of a source organism protein specifically involved in an interaction.
Parent Case Info
[0001] The present invention claims priority from U.S. provisional application serial No. 60/277,021, the contents of which are incorporated herein by reference.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60277021 |
Mar 2001 |
US |