Exploiting word cluster information for unsupervised feature selection

Wu, Qingyao; Ye, Yunming; Ng, Michael; Su, Hanjing; Huang, Joshua

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1007/978-3-642-15246-7_28
Scopus: eid_2-s2.0-78049241090
Find via

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Mathematics: Conference papers

Conference Paper: Exploiting word cluster information for unsupervised feature selection

Title	Exploiting word cluster information for unsupervised feature selection
Authors	Wu, Qingyao Ye, Yunming Ng, Michael Su, Hanjing Huang, Joshua
Issue Date	2010
Publisher	Springer.
Citation	11th Pacific Rim International Conference on Artificial Intelligence (PRICAI 2010), Daegu, Korea, 30 August - 2 September 2010. In PRICAI 2010: Trends in Artificial Intelligence: 11th Pacific Rim International Conference on Artificial Intelligence, Daegu, Korea, August 30–September 2, 2010: Proceedings, 2010, p. 292-303 How to Cite? DOI: http://dx.doi.org/10.1007/978-3-642-15246-7_28
Abstract	This paper presents an approach to integrate word clustering information into the process of unsupervised feature selection. In our scheme, the words in the whole feature space are clustered into groups based on the co-occurrence statistics of words. The resulted word clustering information and the bag-of-word information are combined together to measure the goodness of each word, which is our basic metric for selecting discriminative features. By exploiting word cluster information, we extend three well-known unsupervised feature selection methods and propose three new methods. A series of experiments are performed on three benchmark text data sets (the 20 Newsgroups, Reuters-21578 and CLASSIC3). The experimental results have shown that the new unsupervised feature selection methods can select more discriminative features, and in turn improve the clustering performance. © 2010 Springer-Verlag Berlin Heidelberg.
Persistent Identifier	http://hdl.handle.net/10722/276877
ISBN	9783642152450
ISSN	0302-9743 2023 SCImago Journal Rankings: 0.606
Series/Report no.	Lecture Notes in Computer Science ; 6230

DC Field	Value	Language
dc.contributor.author	Wu, Qingyao	-
dc.contributor.author	Ye, Yunming	-
dc.contributor.author	Ng, Michael	-
dc.contributor.author	Su, Hanjing	-
dc.contributor.author	Huang, Joshua	-
dc.date.accessioned	2019-09-18T08:34:55Z	-
dc.date.available	2019-09-18T08:34:55Z	-
dc.date.issued	2010	-
dc.identifier.citation	11th Pacific Rim International Conference on Artificial Intelligence (PRICAI 2010), Daegu, Korea, 30 August - 2 September 2010. In PRICAI 2010: Trends in Artificial Intelligence: 11th Pacific Rim International Conference on Artificial Intelligence, Daegu, Korea, August 30–September 2, 2010: Proceedings, 2010, p. 292-303	-
dc.identifier.isbn	9783642152450	-
dc.identifier.issn	0302-9743	-
dc.identifier.uri	http://hdl.handle.net/10722/276877	-
dc.description.abstract	This paper presents an approach to integrate word clustering information into the process of unsupervised feature selection. In our scheme, the words in the whole feature space are clustered into groups based on the co-occurrence statistics of words. The resulted word clustering information and the bag-of-word information are combined together to measure the goodness of each word, which is our basic metric for selecting discriminative features. By exploiting word cluster information, we extend three well-known unsupervised feature selection methods and propose three new methods. A series of experiments are performed on three benchmark text data sets (the 20 Newsgroups, Reuters-21578 and CLASSIC3). The experimental results have shown that the new unsupervised feature selection methods can select more discriminative features, and in turn improve the clustering performance. © 2010 Springer-Verlag Berlin Heidelberg.	-
dc.language	eng	-
dc.publisher	Springer.	-
dc.relation.ispartof	PRICAI 2010: Trends in Artificial Intelligence: 11th Pacific Rim International Conference on Artificial Intelligence, Daegu, Korea, August 30–September 2, 2010: Proceedings	-
dc.relation.ispartofseries	Lecture Notes in Computer Science ; 6230	-
dc.title	Exploiting word cluster information for unsupervised feature selection	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1007/978-3-642-15246-7_28	-
dc.identifier.scopus	eid_2-s2.0-78049241090	-
dc.identifier.spage	292	-
dc.identifier.epage	303	-
dc.identifier.eissn	1611-3349	-
dc.publisher.place	Berlin	-
dc.identifier.issnl	0302-9743	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Exploiting word cluster information for unsupervised feature selection

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats