About Enhanced Word Clustering for Hierarchical Text Classification
Enhanced Word Clustering for Hierarchical Text Classification- By Inderjit S. Dhillon, Subramanyam Mallela and Rahul Kumar, University of Texas, Austin, USA. Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, 2002. The authors propose a new information-theoretic divisive algorithm for word clustering applied to text classification. Experimental results are based on a 20 Newsgroups data set and a 3-level hierarchy of HTML documents collected from ODP´s Science toplevel.