File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1016/j.ijpe.2014.12.035
- Scopus: eid_2-s2.0-84929519537
- WOS: WOS:000356110400021
Supplementary
- Citations:
- Appears in Collections:
Article: A discriminative and semantic feature selection method for text categorization
Title | A discriminative and semantic feature selection method for text categorization |
---|---|
Authors | |
Keywords | Big data Discriminative power Feature selection Semantic similarity Support vector machine (SVM) Text categorization |
Issue Date | 2015 |
Citation | International Journal of Production Economics, 2015, v. 165, p. 215-222 How to Cite? |
Abstract | Text categorization is an important and critical task in the current era of high volume data storage and handling. Feature selection is obviously one of the most important steps in text categorization. Traditional feature selection methods tend to only consider the correlation between features and categories, and have in the main ignored the semantic similarity between features and documents. To further explore this issue, this paper proposes a novel feature selection method that first selects features in documents with discriminative power and then computes the semantic similarity between features and documents. The proposed feature selection method is tested using a support vector machine (SVM) classifier upon two published datasets, viz. Reuters-21578 and 20-Newsgroups. The experimental results show that the proposed feature selection method generally outperforms the traditional feature selection methods for text categorization for both published datasets. |
Persistent Identifier | http://hdl.handle.net/10722/208209 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Zong, W | en_US |
dc.contributor.author | Wu, F | en_US |
dc.contributor.author | Chu, LK | en_US |
dc.contributor.author | Sculli, D | en_US |
dc.date.accessioned | 2015-02-23T08:07:57Z | - |
dc.date.available | 2015-02-23T08:07:57Z | - |
dc.date.issued | 2015 | en_US |
dc.identifier.citation | International Journal of Production Economics, 2015, v. 165, p. 215-222 | en_US |
dc.identifier.uri | http://hdl.handle.net/10722/208209 | - |
dc.description.abstract | Text categorization is an important and critical task in the current era of high volume data storage and handling. Feature selection is obviously one of the most important steps in text categorization. Traditional feature selection methods tend to only consider the correlation between features and categories, and have in the main ignored the semantic similarity between features and documents. To further explore this issue, this paper proposes a novel feature selection method that first selects features in documents with discriminative power and then computes the semantic similarity between features and documents. The proposed feature selection method is tested using a support vector machine (SVM) classifier upon two published datasets, viz. Reuters-21578 and 20-Newsgroups. The experimental results show that the proposed feature selection method generally outperforms the traditional feature selection methods for text categorization for both published datasets. | en_US |
dc.language | eng | en_US |
dc.relation.ispartof | International Journal of Production Economics | en_US |
dc.subject | Big data | - |
dc.subject | Discriminative power | - |
dc.subject | Feature selection | - |
dc.subject | Semantic similarity | - |
dc.subject | Support vector machine (SVM) | - |
dc.subject | Text categorization | - |
dc.title | A discriminative and semantic feature selection method for text categorization | en_US |
dc.type | Article | en_US |
dc.identifier.email | Chu, LK: lkchu@hkucc.hku.hk | en_US |
dc.identifier.email | Sculli, D: hreidsc@hkucc.hku.hk | en_US |
dc.identifier.authority | Chu, LK=rp00113 | en_US |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1016/j.ijpe.2014.12.035 | en_US |
dc.identifier.scopus | eid_2-s2.0-84929519537 | - |
dc.identifier.hkuros | 242307 | en_US |
dc.identifier.volume | 165 | - |
dc.identifier.spage | 215 | - |
dc.identifier.epage | 222 | - |
dc.identifier.isi | WOS:000356110400021 | - |