File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/TCBB.2012.63
- Scopus: eid_2-s2.0-84864947491
- PMID: 22547432
- WOS: WOS:000307299200017
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: Gene selection using iterative feature elimination random forests for survival outcomes
Title | Gene selection using iterative feature elimination random forests for survival outcomes |
---|---|
Authors | |
Keywords | Cancer gene selection iterative feature elimination microarrays random forest survival |
Issue Date | 2012 |
Citation | IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2012, v. 9 n. 5, p. 1422-1431 How to Cite? |
Abstract | Although many feature selection methods for classification have been developed, there is a need to identify genes in high-dimensional data with censored survival outcomes. Traditional methods for gene selection in classification problems have several drawbacks. First, the majority of the gene selection approaches for classification are single-gene based. Second, many of the gene selection procedures are not embedded within the algorithm itself. The technique of random forests has been found to perform well in high-dimensional data settings with survival outcomes. It also has an embedded feature to identify variables of importance. Therefore, it is an ideal candidate for gene selection in high-dimensional data with survival outcomes. In this paper, we develop a novel method based on the random forests to identify a set of prognostic genes. We compare our method with several machine learning methods and various node split criteria using several real data sets. Our method performed well in both simulations and real data analysis. Additionally, we have shown the advantages of our approach over single-gene-based approaches. Our method incorporates multivariate correlations in microarray data for survival outcomes. The described method allows us to better utilize the information available from microarray data with survival outcomes. © 2004-2012 IEEE. |
Persistent Identifier | http://hdl.handle.net/10722/194433 |
ISSN | 2023 Impact Factor: 3.6 2023 SCImago Journal Rankings: 0.794 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Pang, H | - |
dc.contributor.author | George, SL | - |
dc.contributor.author | Hui, K | - |
dc.contributor.author | Tong, T | - |
dc.date.accessioned | 2014-01-30T03:32:35Z | - |
dc.date.available | 2014-01-30T03:32:35Z | - |
dc.date.issued | 2012 | - |
dc.identifier.citation | IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2012, v. 9 n. 5, p. 1422-1431 | - |
dc.identifier.issn | 1545-5963 | - |
dc.identifier.uri | http://hdl.handle.net/10722/194433 | - |
dc.description.abstract | Although many feature selection methods for classification have been developed, there is a need to identify genes in high-dimensional data with censored survival outcomes. Traditional methods for gene selection in classification problems have several drawbacks. First, the majority of the gene selection approaches for classification are single-gene based. Second, many of the gene selection procedures are not embedded within the algorithm itself. The technique of random forests has been found to perform well in high-dimensional data settings with survival outcomes. It also has an embedded feature to identify variables of importance. Therefore, it is an ideal candidate for gene selection in high-dimensional data with survival outcomes. In this paper, we develop a novel method based on the random forests to identify a set of prognostic genes. We compare our method with several machine learning methods and various node split criteria using several real data sets. Our method performed well in both simulations and real data analysis. Additionally, we have shown the advantages of our approach over single-gene-based approaches. Our method incorporates multivariate correlations in microarray data for survival outcomes. The described method allows us to better utilize the information available from microarray data with survival outcomes. © 2004-2012 IEEE. | - |
dc.language | eng | - |
dc.relation.ispartof | IEEE/ACM Transactions on Computational Biology and Bioinformatics | - |
dc.subject | Cancer | - |
dc.subject | gene selection | - |
dc.subject | iterative feature elimination | - |
dc.subject | microarrays | - |
dc.subject | random forest | - |
dc.subject | survival | - |
dc.title | Gene selection using iterative feature elimination random forests for survival outcomes | - |
dc.type | Article | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1109/TCBB.2012.63 | - |
dc.identifier.pmid | 22547432 | - |
dc.identifier.scopus | eid_2-s2.0-84864947491 | - |
dc.identifier.volume | 9 | - |
dc.identifier.issue | 5 | - |
dc.identifier.spage | 1422 | - |
dc.identifier.epage | 1431 | - |
dc.identifier.isi | WOS:000307299200017 | - |
dc.identifier.issnl | 1545-5963 | - |