Claims
- 1. A method for organizing and aiding the interpretation of gene data, said method comprising the steps of:
receiving gene names; associating the gene names to gene-word pair relationships; and grouping the gene names with high strength of gene-word relationships, the strength of the gene-word relationships corresponding to the relatedness in function of corresponding grouped genes.
- 2. The method of claim 1, wherein the receiving gene names includes receiving alias names for the gene names.
- 3. The method of claim 1, further including querying the gene names in a literature database.
- 4. The method of claim 3, wherein the receiving includes, responsive to the query of the gene names in a literature database, receiving abstracts comprising the gene names.
- 5. The method of claim 4, further including generating a background set and a query set from the returned abstracts.
- 6. The method of claim 5, further including calculating word frequencies in the query set and the background set.
- 7. The method of claim 6, further including providing a numerical value calculated for each word in which a word frequency was calculated for the query set.
- 8. The method of claim 7, wherein the providing includes calculating z scores.
- 9. The method of claim 7, wherein the providing includes using term frequency-inverse document frequency methods.
- 10. The method of claim 4, further including stemming words of the returned abstracts.
- 11. The method of claim 10, further including filtering the stemmed words using a stop list.
- 12. A system for organizing and aiding the interpretation of data, said system comprising:
a memory with logic; and a processor configured with the logic to receiving gene names, said processor further configured with the logic to associate the gene names to gene-word pair relationships, said processor further configured with the logic to group the gene names with high strength of gene-word relationships, the strength of the gene-word relationships corresponding to the relatedness in function of corresponding grouped genes.
- 13. The system of claim 12, wherein the processor is further configured with the logic to generate keywords that describe the common function of each group.
- 14. A system for organizing and aiding the interpretation of gene data, said system comprising:
means for receiving gene names; means for associating the gene names to gene-word pair relationships; and means for grouping the gene names with a similar strength of gene-word relationships, the strength of the gene-word relationships corresponding to the relatedness in function of corresponding grouped genes.
CROSS REFERENCE TO RELATED APPLICATION
[0001] This application claims the benefit of U.S. Provisional Application No. 60/441,850, filed Jan. 21, 2003, which is entirely incorporated herein by reference.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60441850 |
Jan 2003 |
US |