A discriminative and semantic feature selection method for text categorization

Zong, W; Wu, F; Chu, LK; Sculli, D

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1016/j.ijpe.2014.12.035
Scopus: eid_2-s2.0-84929519537
WOS: WOS:000356110400021

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Industrial & Manufacturing Systems Engineering: Journal/Magazine Articles

Article: A discriminative and semantic feature selection method for text categorization

Title	A discriminative and semantic feature selection method for text categorization
Authors	Zong, W Wu, F Chu, LK Sculli, D
Keywords	Big data Discriminative power Feature selection Semantic similarity Support vector machine (SVM) Text categorization
Issue Date	2015
Citation	International Journal of Production Economics, 2015, v. 165, p. 215-222 How to Cite? DOI: http://dx.doi.org/10.1016/j.ijpe.2014.12.035
Abstract	Text categorization is an important and critical task in the current era of high volume data storage and handling. Feature selection is obviously one of the most important steps in text categorization. Traditional feature selection methods tend to only consider the correlation between features and categories, and have in the main ignored the semantic similarity between features and documents. To further explore this issue, this paper proposes a novel feature selection method that first selects features in documents with discriminative power and then computes the semantic similarity between features and documents. The proposed feature selection method is tested using a support vector machine (SVM) classifier upon two published datasets, viz. Reuters-21578 and 20-Newsgroups. The experimental results show that the proposed feature selection method generally outperforms the traditional feature selection methods for text categorization for both published datasets.
Persistent Identifier	http://hdl.handle.net/10722/208209
ISI Accession Number ID	WOS:000356110400021

DC Field	Value	Language
dc.contributor.author	Zong, W	en_US
dc.contributor.author	Wu, F	en_US
dc.contributor.author	Chu, LK	en_US
dc.contributor.author	Sculli, D	en_US
dc.date.accessioned	2015-02-23T08:07:57Z	-
dc.date.available	2015-02-23T08:07:57Z	-
dc.date.issued	2015	en_US
dc.identifier.citation	International Journal of Production Economics, 2015, v. 165, p. 215-222	en_US
dc.identifier.uri	http://hdl.handle.net/10722/208209	-
dc.description.abstract	Text categorization is an important and critical task in the current era of high volume data storage and handling. Feature selection is obviously one of the most important steps in text categorization. Traditional feature selection methods tend to only consider the correlation between features and categories, and have in the main ignored the semantic similarity between features and documents. To further explore this issue, this paper proposes a novel feature selection method that first selects features in documents with discriminative power and then computes the semantic similarity between features and documents. The proposed feature selection method is tested using a support vector machine (SVM) classifier upon two published datasets, viz. Reuters-21578 and 20-Newsgroups. The experimental results show that the proposed feature selection method generally outperforms the traditional feature selection methods for text categorization for both published datasets.	en_US
dc.language	eng	en_US
dc.relation.ispartof	International Journal of Production Economics	en_US
dc.subject	Big data	-
dc.subject	Discriminative power	-
dc.subject	Feature selection	-
dc.subject	Semantic similarity	-
dc.subject	Support vector machine (SVM)	-
dc.subject	Text categorization	-
dc.title	A discriminative and semantic feature selection method for text categorization	en_US
dc.type	Article	en_US
dc.identifier.email	Chu, LK: lkchu@hkucc.hku.hk	en_US
dc.identifier.email	Sculli, D: hreidsc@hkucc.hku.hk	en_US
dc.identifier.authority	Chu, LK=rp00113	en_US
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1016/j.ijpe.2014.12.035	en_US
dc.identifier.scopus	eid_2-s2.0-84929519537	-
dc.identifier.hkuros	242307	en_US
dc.identifier.volume	165	-
dc.identifier.spage	215	-
dc.identifier.epage	222	-
dc.identifier.isi	WOS:000356110400021	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: A discriminative and semantic feature selection method for text categorization

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats