File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1007/11430919_94
- Scopus: eid_2-s2.0-26944481948
- Find via
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: Subspace clustering of text documents with feature weighting k-means algorithm
Title | Subspace clustering of text documents with feature weighting k-means algorithm |
---|---|
Authors | |
Keywords | Feature Weighting Text Mining Subspace Clustering Cluster Interpretation High Dimensional Data |
Issue Date | 2005 |
Publisher | Springer. |
Citation | 9th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2005), Hanoi, Vietnam, 18-20 May 2005. In Advances in Knowledge Discovery and Data Mining: 9th Pacific-Asia Conference, PAKDD 2005, Hanoi, Vietnam, May 18-20, 2005: Proceedings, 2005, p. 802-812 How to Cite? |
Abstract | This paper presents a new method to solve the problem of clustering large and complex text data. The method is based on a new subspace clustering algorithm that automatically calculates the feature weights in the k-means clustering process. In clustering sparse text data the feature weights are used to discover clusters from subspaces of the document vector space and identify key words that represent the semantics of the clusters. We present a modification of the published algorithm to solve the sparsity problem that occurs in text clustering. Experimental results on real-world text data have shown that the new method outper-formed the Standard K Means and Bisection-KMeans algorithms, while still maintaining efficiency of the k-means clustering process. © Springer-Verlag Berlin Heidelberg 2005. |
Persistent Identifier | http://hdl.handle.net/10722/276781 |
ISBN | |
ISSN | 2023 SCImago Journal Rankings: 0.606 |
Series/Report no. | Lecture Notes in Computer Science ; 3518 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Jing, Liping | - |
dc.contributor.author | Ng, Michael K. | - |
dc.contributor.author | Xu, Jun | - |
dc.contributor.author | Huang, Joshua Zhexue | - |
dc.date.accessioned | 2019-09-18T08:34:38Z | - |
dc.date.available | 2019-09-18T08:34:38Z | - |
dc.date.issued | 2005 | - |
dc.identifier.citation | 9th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2005), Hanoi, Vietnam, 18-20 May 2005. In Advances in Knowledge Discovery and Data Mining: 9th Pacific-Asia Conference, PAKDD 2005, Hanoi, Vietnam, May 18-20, 2005: Proceedings, 2005, p. 802-812 | - |
dc.identifier.isbn | 9783540260769 | - |
dc.identifier.issn | 0302-9743 | - |
dc.identifier.uri | http://hdl.handle.net/10722/276781 | - |
dc.description.abstract | This paper presents a new method to solve the problem of clustering large and complex text data. The method is based on a new subspace clustering algorithm that automatically calculates the feature weights in the k-means clustering process. In clustering sparse text data the feature weights are used to discover clusters from subspaces of the document vector space and identify key words that represent the semantics of the clusters. We present a modification of the published algorithm to solve the sparsity problem that occurs in text clustering. Experimental results on real-world text data have shown that the new method outper-formed the Standard K Means and Bisection-KMeans algorithms, while still maintaining efficiency of the k-means clustering process. © Springer-Verlag Berlin Heidelberg 2005. | - |
dc.language | eng | - |
dc.publisher | Springer. | - |
dc.relation.ispartof | Advances in Knowledge Discovery and Data Mining: 9th Pacific-Asia Conference, PAKDD 2005, Hanoi, Vietnam, May 18-20, 2005: Proceedings | - |
dc.relation.ispartofseries | Lecture Notes in Computer Science ; 3518 | - |
dc.subject | Feature Weighting | - |
dc.subject | Text Mining | - |
dc.subject | Subspace Clustering | - |
dc.subject | Cluster Interpretation | - |
dc.subject | High Dimensional Data | - |
dc.title | Subspace clustering of text documents with feature weighting k-means algorithm | - |
dc.type | Conference_Paper | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1007/11430919_94 | - |
dc.identifier.scopus | eid_2-s2.0-26944481948 | - |
dc.identifier.spage | 802 | - |
dc.identifier.epage | 812 | - |
dc.identifier.eissn | 1611-3349 | - |
dc.publisher.place | Berlin | - |
dc.identifier.issnl | 0302-9743 | - |