Subspace clustering of text documents with feature weighting k-means algorithm

Jing, Liping; Ng, Michael K.; Xu, Jun; Huang, Joshua Zhexue

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1007/11430919_94
Scopus: eid_2-s2.0-26944481948
Find via

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Mathematics: Conference papers

Conference Paper: Subspace clustering of text documents with feature weighting k-means algorithm

Title	Subspace clustering of text documents with feature weighting k-means algorithm
Authors	Jing, Liping Ng, Michael K.Xu, Jun Huang, Joshua Zhexue
Keywords	Feature Weighting Text Mining Subspace Clustering Cluster Interpretation High Dimensional Data
Issue Date	2005
Publisher	Springer.
Citation	9th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2005), Hanoi, Vietnam, 18-20 May 2005. In Advances in Knowledge Discovery and Data Mining: 9th Pacific-Asia Conference, PAKDD 2005, Hanoi, Vietnam, May 18-20, 2005: Proceedings, 2005, p. 802-812 How to Cite? DOI: http://dx.doi.org/10.1007/11430919_94
Abstract	This paper presents a new method to solve the problem of clustering large and complex text data. The method is based on a new subspace clustering algorithm that automatically calculates the feature weights in the k-means clustering process. In clustering sparse text data the feature weights are used to discover clusters from subspaces of the document vector space and identify key words that represent the semantics of the clusters. We present a modification of the published algorithm to solve the sparsity problem that occurs in text clustering. Experimental results on real-world text data have shown that the new method outper-formed the Standard K Means and Bisection-KMeans algorithms, while still maintaining efficiency of the k-means clustering process. © Springer-Verlag Berlin Heidelberg 2005.
Persistent Identifier	http://hdl.handle.net/10722/276781
ISBN	9783540260769
ISSN	0302-9743 2023 SCImago Journal Rankings: 0.606
Series/Report no.	Lecture Notes in Computer Science ; 3518

DC Field	Value	Language
dc.contributor.author	Jing, Liping	-
dc.contributor.author	Ng, Michael K.	-
dc.contributor.author	Xu, Jun	-
dc.contributor.author	Huang, Joshua Zhexue	-
dc.date.accessioned	2019-09-18T08:34:38Z	-
dc.date.available	2019-09-18T08:34:38Z	-
dc.date.issued	2005	-
dc.identifier.citation	9th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2005), Hanoi, Vietnam, 18-20 May 2005. In Advances in Knowledge Discovery and Data Mining: 9th Pacific-Asia Conference, PAKDD 2005, Hanoi, Vietnam, May 18-20, 2005: Proceedings, 2005, p. 802-812	-
dc.identifier.isbn	9783540260769	-
dc.identifier.issn	0302-9743	-
dc.identifier.uri	http://hdl.handle.net/10722/276781	-
dc.description.abstract	This paper presents a new method to solve the problem of clustering large and complex text data. The method is based on a new subspace clustering algorithm that automatically calculates the feature weights in the k-means clustering process. In clustering sparse text data the feature weights are used to discover clusters from subspaces of the document vector space and identify key words that represent the semantics of the clusters. We present a modification of the published algorithm to solve the sparsity problem that occurs in text clustering. Experimental results on real-world text data have shown that the new method outper-formed the Standard K Means and Bisection-KMeans algorithms, while still maintaining efficiency of the k-means clustering process. © Springer-Verlag Berlin Heidelberg 2005.	-
dc.language	eng	-
dc.publisher	Springer.	-
dc.relation.ispartof	Advances in Knowledge Discovery and Data Mining: 9th Pacific-Asia Conference, PAKDD 2005, Hanoi, Vietnam, May 18-20, 2005: Proceedings	-
dc.relation.ispartofseries	Lecture Notes in Computer Science ; 3518	-
dc.subject	Feature Weighting	-
dc.subject	Text Mining	-
dc.subject	Subspace Clustering	-
dc.subject	Cluster Interpretation	-
dc.subject	High Dimensional Data	-
dc.title	Subspace clustering of text documents with feature weighting k-means algorithm	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1007/11430919_94	-
dc.identifier.scopus	eid_2-s2.0-26944481948	-
dc.identifier.spage	802	-
dc.identifier.epage	812	-
dc.identifier.eissn	1611-3349	-
dc.publisher.place	Berlin	-
dc.identifier.issnl	0302-9743	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Subspace clustering of text documents with feature weighting k-means algorithm

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats