William DuMouchel, Data squashing: constructing summary data sets, dumouchel@research.att.com, Apr. 2000, pp 1-12.* |
Art Owen, Data squashing by empirical likelihood, Stanford University, Sep. 1999, pp 1-18.* |
Barbara, D. (1997). The New Jersey Data Reduction Report. Bulletin on the Technical Committee on Data Engineering 20(4), pp. 3-45, Dec. 1997, vol 20, No. 4. |
Bradley, P.S., U. Fayyad, and C. Reina (1998). Scaling Clustering Algorithms to Large Databases. In Proc. 4th Intl. Conf. on Knowledge Discovery and Data Mining (KDD), pp. 9-15. |
DuMouchel, W. (1999). Bayesian Data Mining in Large Frequency Tables, With an Application to the FDA Spontaneous Reporting System. The American Statistician, Aug., 1999, vol. 53, No. 3, pp. 177-190. |
Johnson, T. and T. Dasu (1998). Comparing Massive High Dimensional Data Sets. In Proc. 4th Intl. Conf. on Knowledge Discovery and Data Mining (KDD), pp. 229-233. |
Zhang, T., R. Ramakrishnan, and M. Livny (1997). Birch: A New Data Clustering Algorithm and Its Applications. Data Mining and Knowledge Discovery 1(2), pp. 141-181. |