Claims
- 1. A method for mapping a plurality of characters in a string, wherein the string comprises a combination of characters representing indexed expressions and a combination of characters representing non-indexed expressions, wherein the method comprises:
generating a set of special weights for a plurality of characters representing a plurality of indexed and non-indexed expressions, wherein the order of the special weights are based on an order of the plurality of characters representing the plurality of indexed and non-indexed expressions; if a specific group of characters within the string represent a specific non-indexed expression, associating a weight value of an indexed expression with the specific group of characters representing the specific non-indexed expression; retrieving a plurality of special weights associated with the specific group of characters representing the specific non-indexed expression; and generating a weight array, wherein the weight array comprises the associated weight value of the indexed expression and the plurality of special weights associated with the specific group of characters representing the specific non-indexed expression.
- 2. The method of claim 1, wherein the weight value associated with the specific group of characters representing the specific non-indexed expression matches an indexed weight value of a proceeding indexed expression.
- 3. The method of claim 1, wherein the weight array is configured to a format that is compatible with the Unicode standard.
- 4. The method of claim 1, wherein the weight value represents an alphanumeric weight value in accordance with the Unicode standard.
- 5. The method of claim 1, wherein each indexed expression represents a modern Hangul character, and wherein each non-indexed expression represents an old Hangul character.
- 6. The method of claim 5, wherein the weight value associated with the specific group of characters representing the specific non-indexed expression matches a indexed weight value of a proceeding indexed expression.
- 7. The method of claim 1, wherein the set of special weights for specific characters representing the non-indexed expressions is generated by independently indexing a plurality of categories of characters ordered by the grammatical rules of each category of characters.
- 8. The method of claim 7, wherein the categories of characters comprises a first category of leading consonants, a second category of vowels, and a third category of trailing consonants.
- 9. The method of claim 1, further comprising, determining if the specific group of characters in the string properly represent an old Hangul character.
- 10. The method of claim 1, further comprising, determining if the specific group of characters in the string is valid, wherein the specific group of characters is determined to be valid by determining if a valid number of leading consonants exists in the specific group of characters.
- 11. The method of claim 1, further comprising the step of compressing the weight array.
- 12. The method of claim 1, wherein the indexed expressions in the string are indexed in accordance with the Unicode standard.
- 13. The method of claim 1, wherein the generation of the weight array, comprises, appending the plurality of special weights associated with the specific group of characters representing the specific non-indexed expression to the associated weight value of the non-indexed expression.
- 14. A computer-readable medium having computer executable code, which when executed, perform the method of claim 1.
- 15. A computer system having computer executable code, which when executed, perform the method of claim 1.
- 16. A method for mapping a plurality of characters in a string, wherein the string comprises a combination of Jamo representing modem Hangul, wherein the modem Hangul are indexed by a plurality of standardized weight values, and wherein the string comprises a combination of Jamo representing old Hangul, wherein the method comprises:
generating a set of special weights for Jamo a plurality of characters that represent a plurality of modem and old Hangul, wherein the order of the special weights are based on the order of the plurality of characters that represent a plurality of modern and old Hangul; if a specific group of Jamo within the string represent a specific old Hangul, associating at least one standardized weight value of a modem Hangul with the specific group of Jamo representing the specific old Hangul, wherein the associated standardized weight value is selected from the modem Hangul that precedes the old Hangul; retrieving a plurality of special weights associated with the specific group of Jamo representing the specific old Hangul; and generating a weight array, wherein the weight array comprises the associated standardized weight value of the modern Hangul and the plurality of special weights associated with the specific Jamo representing the specific old Hangul.
- 17. The method of claim 16, wherein the standardized weight value represents an alphanumeric weight value in accordance with the Unicode standard.
- 18. A computer-readable medium having computer executable code, which when executed, perform the method of claim 16.
- 19. A computer system having computer executable code, which when executed, perform the method of claim 16.
- 20. A method of sorting a first set of characters and a second set of characters, wherein the method comprises:
generating a weight array for the first set of characters, thereby providing a plurality of weighting values for the first set of characters; comparing the plurality of weighting values for the first set of characters to an assigned set of weighting values for the second set of characters; and sorting the first and second set of characters based on the comparison of the plurality of weighting values and assigned weighting values.
- 21. A computer-readable medium having stored thereon a data structure, comprising a weight array having an associated weight value of an indexed expression and a plurality of special weights associated with a specific group of characters representing a non-indexed expression.
CROSS-REFERENCE TO RELATED APPLICATION
[0001] This non-provisional patent application claims the benefit under 35 U.S.C. § 119(e) of U.S. Provisional Application No. 60/244,288 filed on Oct. 30, 2000, and entitled “METHOD AND SYSTEM FOR MAPPING STRINGS FOR COMPARISON,” the subject matter of which is specifically incorporated herein by reference.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60244288 |
Oct 2000 |
US |