Claims
- 1. A method of representing a carbohydrate structure as a linear sequence of characters, the method being performed by a data processor, the method comprising:
(a)representing saccharide units of the carbohydrate structure with character codes, each character code defining:
(i)an identity of a saccharide unit; and (ii)a presence or absence of a chemical modification to said saccharide unit and optionally a type of said chemical modification; and (b)assembling said character codes into a linear string of characters being uniquely representative of the carbohydrate structure.
- 2. The method of claim 1, wherein step (b) includes adding to said linear string of characters additional characters defining a type of connection between adjacent saccharide units.
- 3. The method of claim 1, wherein step (b) includes adding to said linear string of characters additional characters defining at least one branch of the carbohydrate structure.
- 4. The method of claim 3, wherein said additional characters define a structure and/or a hierarchy of said at least one branch.
- 5. The method of claim 2, wherein said linear string of characters includes letters defining said saccharide units and syntax defining said type of connection between adjacent saccharide units.
- 6. The method of claim 3, wherein said at least one branch is defined by a start point character and an end point character.
- 7. The method of claim 1, wherein said character code is a one letter code.
- 8. The method of claim 1, wherein step (b) includes adding to said linear string of characters additional characters identifying uncertainties within the carbohydrate structure.
- 9. A method of representing a branched carbohydrate structure as a linear sequence of characters, the method being performed by a data processor, the method comprising:
(a)representing saccharide units of the branched carbohydrate structure with character codes, each of said character codes defining an identity of a specific saccharide unit; (b)defining at least one branch of the branched carbohydrate with at least one branch specific character; and (c)assembling said character codes and said at least one branch specific character into a linear string of characters being uniquely representative of the branched carbohydrate structure.
- 10. The method of claim 9, wherein step (b) is effected by defining a branch start point with a first branch specific character and branch end point with a second branch specific character.
- 11. The method of claim 9, wherein said at least one branch specific character defines a structure and/or a hierarchy of said at least one branch.
- 12. The method of claim 9, further comprising the step of defining a type of connection between said saccharide units of the branched carbohydrate structure prior to step (c).
- 13. The method of claim 12, wherein said linear string of characters includes letters defining said saccharide units and syntax defining said type of said connection between said saccharide units.
- 14. The method of claim 9, further comprising the step of adding to said linear string of characters additional characters identifying uncertainties within the branched carbohydrate structure.
- 15. A method of representing a partially characterized carbohydrate structure as a linear sequence of characters, the method being performed by a data processor, the method comprising:
(a)representing saccharide units of the carbohydrate structure with first character codes, each of said first character codes defining an identity of a specific saccharide unit; (b)representing uncertainties within the carbohydrate structure with second character codes, each of said second character codes identifying a specific structural uncertainty; and (c)assembling said first character codes and said second character codes into a linear string of characters being uniquely representative of the partially characterized carbohydrate structure.
- 16. The method of claim 15, further comprising the step of adding to said linear string of characters additional characters defining a type of connection between adjacent saccharide units.
- 17. The method of claim 15, further comprising the step of adding to said linear string of characters additional characters defining at least one branch of the carbohydrate structure.
- 18. The method of claim 17, wherein said additional characters define a structure and/or a hierarchy of said at least one branch.
- 19. The method of claim 16, wherein said linear string of characters includes letters defining said saccharide units and syntax defining said type of connection between adjacent saccharide units.
- 20. The method of claim 17, wherein said at least one branch is defined by a start point character and an end point character.
- 21. The method of claim 15, wherein said first and/or second character code is a one letter code.
- 22. A system for representing a carbohydrate structure as a linear sequence of characters, the system comprising:
(a)a representor designed for representing saccharide units of the carbohydrate structure with character codes, each character code defining:
(i)an identity of a saccharide unit; and (ii)a presence or absence of a chemical modification to said saccharide unit and optionally a type of said chemical modification; and (b)an assembler for assembling said character codes into a linear string of characters being uniquely representative of the carbohydrate structure.
- 23. The system of claim 22, wherein said representor is further designed for adding to said linear string of characters additional characters defining a type of connection between adjacent saccharide units.
- 24. The system of claim 22, wherein said representor is further designed for adding to said linear string of characters additional characters defining at least one branch of the carbohydrate structure.
- 25. The system of claim 24, wherein said additional characters define a structure and/or a hierarchy of said at least one branch.
- 26. The system of claim 23, wherein said representor utilizes letters to define said saccharide units and syntax to define said type of connection between adjacent saccharide units.
- 27. The system of claim 24, wherein said representor defines said at least one branch with a start point character and an end point character.
- 28. The system of claim 22, wherein said representor is further designed for adding to said linear string of characters additional characters identifying uncertainties within the carbohydrate structure.
- 29. A system for representing a branched carbohydrate structure as a linear sequence of characters, the system comprising:
(a) a representor designed for:
(i) representing saccharide units of the branched carbohydrate structure with character codes, each of said character codes defining an identity of a specific saccharide unit; and (ii) defining at least one branch of the branched carbohydrate with at least one branch specific character; and (b) an assembler designed for assembling said character codes and said at least one branch specific character to generate a linear string of characters being uniquely representative of the branched carbohydrate structure.
- 30. The system of claim 29, wherein said representor is designed for defining a branch start point with a first branch specific character and branch end point with a second branch specific character.
- 31. The system of claim 30, wherein said at least one branch specific character defines a structure and/or a hierarchy of said at least one branch.
- 32. The system of claim 29, wherein said representor is further designed for defining a type of connection between said saccharide units of the branched carbohydrate structure.
- 33. The system of claim 32, wherein said representor is designed for defining said saccharide units with letters and defining said type of said connection between said saccharide units with syntax.
- 34. The system of claim 29, wherein said representor is further designed for adding to said linear string of characters additional characters identifying uncertainties within the branched carbohydrate structure.
- 35. A system for representing a partially characterized carbohydrate structure as a linear sequence of characters, the system comprising:
(a) a representor designed for:
(i) representing saccharide units of the carbohydrate structure with first character codes, each of said first character codes defining an identity of a specific saccharide unit; (ii) representing uncertainties within the carbohydrate structure with second character codes, each of said second character codes identifying a specific structural uncertainty; and (b) an assembler designed for assembling said first character codes and said second character codes into a linear string of characters being uniquely representative of the partially characterized carbohydrate structure.
- 36. The method of claim 35, wherein said representor is further designed for adding to said linear string of characters additional characters defining a type of connection between adjacent saccharide units.
- 37. The method of claim 35, wherein said representor is further designed for adding to said linear string of characters additional characters defining at least one branch of the carbohydrate structure.
- 38. The system of claim 37, wherein said additional characters define a structure and/or a hierarchy of said at least one branch.
- 39. The method of claim 36, wherein said representor is designed for defining said saccharide units with letters and defining said type of said connection between said saccharide units with syntax.
- 40. The method of claim 37, wherein said at least one branch is defined by said representor using a start point character and an end point character.
- 41. The method of claim 35, wherein said first and/or second character code is a one letter code.
RELATED APPLICATION
[0001] This application is a continuation of U.S. patent application No. Ser. 09/573,548 filed May 19, 2000.
Continuations (1)
|
Number |
Date |
Country |
Parent |
09573548 |
May 2000 |
US |
Child |
10419729 |
Apr 2003 |
US |