Claims
- 1. A method for selecting a set of initial cluster centers in wavefront clustering a collection of objects, each object being represented by a set of multi-modal feature vectors, comprising the steps of:selecting a first number of first objects from the collection; computing a vector centroid of the first objects using the set of multi-modal feature vectors associated with each object; selecting a second number of second objects from the collection; identifying a second number of initial cluster centers between the centroid and the second objects; and wavefront clustering the collection of objects using the second number of initial cluster centers.
- 2. The method of claim 1, wherein the first objects are selected randomly from the collection.
- 3. The method of claim 2, wherein the first number is much smaller than the number of objects in the collection.
- 4. The method of claim 3, wherein the first number is greater than five (5).
- 5. The method of claim 4, wherein the second number is ten (10).
- 6. The method of claim 1, wherein the second objects are selected randomly from the collection.
- 7. The method of claim 6, wherein the second number is equal to a desired number of initial cluster centers.
- 8. The method of claim 1, wherein each of the second number of initial cluster centers is calculated according to {right arrow over (x)}i′=α{right arrow over (c)}+(1−α){right arrow over (x)}i, where {right arrow over (x)}i′ represents an initial cluster center, α represents a scalar factor, {right arrow over (x)}i represents one of the second objects, and {right arrow over (c)} represents the vector centroid of the first objects.
- 9. The method of claim 8, wherein α is approximately equal to 0.9.
- 10. A computer readable medium storing instructions for wavefront clustering a collection of objects, each object being represented by a set of multi-modal feature vectors, comprising the instructions for:randomly selecting a first number of first objects from the collection; computing a vector centroid of the first objects using the set of multi-modal feature vectors associated with each object; randomly selecting a second number of second objects from the collection; identifying a second number of initial cluster centers between the centroid and the second objects; and performing iterated k-means wavefront clustering around the initial cluster centers to cluster the objects.
- 11. The computer readable medium of claim 10, wherein the first number is much smaller than the number of objects in the collection.
- 12. The computer readable medium of claim 11, wherein the first number is greater than five (5).
- 13. The computer readable medium of claim 10, wherein the second number is equal to a desired number of initial cluster centers.
- 14. The computer readable medium of claim 13, wherein each of the second number of initial cluster centers is calculated according to {right arrow over (x)}i′=α{right arrow over (c)}+(1−α){right arrow over (x)}i, where {right arrow over (x)}i′ represents an initial cluster center, α represents a scalar factor, {right arrow over (x)}i represents one of the second objects, and {right arrow over (c)} represents the vector centroid of the first objects.
- 15. The method of claim 14, wherein α is approximately equal to 0.9.
- 16. A signal for transmitting computer instructions for selecting a set of initial cluster centers in wavefront clustering a collection of objects, each object being represented by a set of multi-modal feature vectors, the instructions comprising:randomly selecting a first number of first objects from the collection, the first number being less than a number of objects in the collection; computing a vector centroid of the first objects using the set of multi-modal feature vectors associated with each object; selecting a second number of second objects from the collection, the second number equaling a desired number of initial cluster centers; identifying a second number of initial cluster centers between the centroid and the second objects; and wavefront clustering the collection of objects using the second number of initial cluster centers.
- 17. The signal of claim 16, wherein each of the second number of initial cluster centers is calculated according to {right arrow over (x)}i′=α{right arrow over (c)}+(1−α){right arrow over (x)}i, where {right arrow over (x)}i′ represents an initial cluster center, α represents a scalar factor, {right arrow over (x)}i represents one of the second objects, and {right arrow over (c)} represents the vector centroid of the first objects.
- 18. The signal of claim 17, wherein α is approximately equal to 0.9.
CROSS-REFERENCE TO RELATED APPLICATIONS
This Application claims the benefit of U.S. Provisional Application No. 60/117,462, filed on Jan. 26, 1999.
This Application is also related to U.S. patent application Ser. No. 09/421,770 non-final Action mailed Nov. 7, 2000 entitled “SYSTEM AND METHOD FOR INFORMATION BROWSING USING MULTI-MODAL FEATURES,” U.S. patent application Ser. No. 09/425,038 Allowed (Pub) entitled “SYSTEM AND METHOD FOR PROVIDING RECOMMENDATIONS BASED ON MULTI-MODAL USER CLUSTERS,” U.S. patent application Ser. No. 09/421,416 Non-final mailed Jan. 3, 2003 entitled “SYSTEM AND METHOD FOR QUANTITATIVELY REPRESENTING DATA OBJECTS IN VECTOR SPACE,” U.S. patent application Ser. No. 09/421,767 Non-final mailed Nov. 3, 2003 entitled “SYSTEM AND METHOD FOR IDENTIFYING SIMILARITIES AMONG DATA OBJECTS IN A COLLECTION,” U.S. patent application Ser. No. 09/425,039 Present Application “SYSTEM AND METHOD FOR CLUSTERING DATA OBJECTS IN A COLLECTION,” and U.S. patent application Ser. No. 09/421,419 Allowed (Pub) Feb. 12, 2003 entitled “SYSTEM AND METHOD FOR VISUALLY REPRESENTING THE CONTENTS OF A MULTIPLE DATA OBJECT CLUSTER,” all filed of even date herewith.
US Referenced Citations (4)
Number |
Name |
Date |
Kind |
5619709 |
Caid et al. |
Apr 1997 |
A |
5794178 |
Caid et al. |
Aug 1998 |
A |
6003027 |
Prager |
Dec 1999 |
A |
6289353 |
Hazlchurst et al. |
Sep 2001 |
B1 |
Provisional Applications (1)
|
Number |
Date |
Country |
|
60/117462 |
Jan 1999 |
US |