Claims
- 1. A method for identifying a motif or a combination of motifs having a Boolean state of predetermined mutations in a set of sequences comprising:
a) aligning a set of sequences of ordered motifs represented by a single-character code, b) comparing a reference sequence with the set of sequences aligned in step (a), c) identifying motifs not having mutated simultaneously or motifs having mutated simultaneously at least once on at least one sequence of the set and not having mutated on another sequence of said set.
- 2. The method according to claim 1, wherein the motif or the combination of motifs is a nucleotide or a combination of nucleotides and a subset of sequences is selected from sequences in a databank of nucleic acids.
- 3. The method according to claim 1, wherein the motif or the combination of motifs is an amino acid or a combination of amino acids and a subset of sequences is selected from sequences in a databank of polypeptides and/or proteins.
- 4. The method according to claim 1, wherein the reference sequence is a wild sequence.
- 5. The method according to claim 1, wherein the reference sequence is a sequence comprising in a position i a motif present in position i in a predetermined number of sequences of step (a).
- 6. The method according to claim 1, wherein step (b) comprises:
forming a first numerical matrix A of dimensions N×M in which N designates a number of sequences and M designates a number of motifs of one sequence of said alignment, with value Ai,j being equal to a first value A1 when the motif of position i of sequence j is mutated in relation to a motif of position i of the reference sequence and equal to a second value A2 in the other cases, forming two analysis matrices B, C of mutations in which this matrix is:
a matrix B of unmutated couples, of couples which do not mutate simultaneously, of dimension M×M, value Bi,k=Bk,i being equal:
to a first value B1 when Ai,j=Ak,j=A1 irrespective of the value of j ranging from 0 to N. to a second value B2 in other cases; a matrix C of mutated couples of dimension M×M, value Ck,i=Ci,k being equal:
to a second value C1 when Ai,j=Ak,j irrespective of the value of j ranging from 0 to N, to a first value C2 in the other cases; determining for a set E of positions a coefficient RE whose value is R1 when values Bi,k are equal to a second value B2, irrespective of the values of i and k belonging to set E of said positions, in which i j, determining for a set F of positions, a coefficient RF, the value of which is R1 when values Ci,k are equal to second value C2, irrespective of the values of i and k belonging to set F of said position in which i j.
- 7. The method according to claim 6, wherein positions of the sets E and/or F are designated by the user.
- 8. The method according to claim 6, wherein step (b) comprises a test step including generating a totality of combinations of possible positions, determining for each of said combinations the value of coefficients RR or RF, and retaining the combination corresponding to a largest set of positions coefficient RE or RF of which corresponds to said second value.
- 9. The method according to claim 1, wherein the set of sequences comprises sequences of motifs of pathogenic organisms having a high level of mutability.
- 10. The method according to claim 1, wherein the set of sequences comprises sequences of motifs of genes implicated in human, animal or plant pathologies having a high level of mutability.
- 11. An influenza vaccine comprising the motif or combination of motifs according to claim 1.
- 12. An HIV vaccine comprising the motif or combination of motifs according to claim 1.
- 13. A hepatitis C vaccine comprising the motif or combination of motifs according to claim 1.
- 14. A pharmaceutical composition for treatment of influenza comprising a therapeutically effective amount of the motif or combination of motifs according to claim 1.
- 15. A pharmaceutical composition for treatment of HIV comprising a therapeutically effective amount of the motif or combination of motifs according to claim 1.
- 16. A pharmaceutical composition for treatment of hepatitis C comprising a therapeutically effective amount of the motif or combination of motifs according to claim 1.
- 17. A method of treating influenza comprising administering a therapeutically effective amount of the pharmaceutical composition according to claim 14 to a patient in need thereof.
- 18. A method of treating HIV comprising administering a therapeutically effective amount of the pharmaceutical composition according to claim 15 to a patient in need thereof.
- 19. A method of treating hepatitis C comprising administering a therapeutically effective amount of the pharmaceutical composition according to claim 16 to a patient in need thereof.
- 20. The method according to claim 1, wherein the set of sequences of step (a) comprises all polypeptide sequences of different variants of a protease of human immunodeficiency virus.
- 21. The method according to claim 1, wherein the set of sequences of step (a) comprises all polypeptide sequences of different variants of a reverse transcriptase of human immunodeficiency virus.
- 22. The method according to claim 1, wherein the set of sequences of step (a) comprises all polypeptide sequences of different variants of integrase of human immunodeficiency virus.
- 23. The method according to claim 1, wherein the set of sequences of step (a) comprises all sequences of motifs of different variants of a gene or protein of neuraminidase of flu virus.
- 24. The method according to claim 1, wherein the set of sequences of step (a) comprises all sequences of motifs of different variants of a gene or protein of hemagglutinin of flu virus.
- 25. The method according to claim 1, wherein the set of sequences of step (a) comprises all sequences of motifs of different variants of a gene and/or protein of hepatitis C virus.
- 26. The method according to claim 1, wherein the set of sequences of step (a) comprises all sequences of motifs of different variants of a gene or protein HspA of bacterium Helicobacter pilori.
- 27. The method according to claim 1, wherein the subset of sequences of motifs selected in step (a) comprises all sequences of different variants of a gene or protein of type HA adhesin of bacterium Escherichia coli.
- 28. The method according to claim 1, further comprising, after step (c), a step (d) of comparing motifs identified in step (c) with known drug resistances to observed mutations.
- 29. The method according to claim 1, further comprising, after step (c), a step (e) of comparing motifs identified in step (c) with motifs of sequences implicated in a catalytic site and/or in sites linked by noncompetitive inhibitors.
- 30. A method of preparing an oligonucleotide sequence comprising:
identifying a nucleotide sequence according to claim 1, and synthesizing said sequence.
- 31. A method of preparing a polypeptide sequence comprising:
identifying an amino acid sequence according to claim 1, and synthesizing said sequence.
Priority Claims (1)
Number |
Date |
Country |
Kind |
01/07808 |
Jun 2001 |
FR |
|
RELATED APPLICATION
[0001] This is a continuation of International Application No. PCT/FR02/02068, with an international filing date of Jun. 14, 2002, which is based on French Patent Application No. 01/07808, filed Jun. 14, 2001.
Continuations (1)
|
Number |
Date |
Country |
Parent |
PCT/FR02/02068 |
Jun 2002 |
US |
Child |
10734023 |
Dec 2003 |
US |