Gene selection using iterative feature elimination random forests for survival outcomes

Pang, H; George, SL; Hui, K; Tong, T

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/TCBB.2012.63
Scopus: eid_2-s2.0-84864947491
PMID: 22547432
WOS: WOS:000307299200017
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
- PubMed Central: 0
Appears in Collections:
- Public Health: Journal/Magazine Articles

Article: Gene selection using iterative feature elimination random forests for survival outcomes

Title	Gene selection using iterative feature elimination random forests for survival outcomes
Authors	Pang, H George, SL Hui, K Tong, T
Keywords	Cancer gene selection iterative feature elimination microarrays random forest survival
Issue Date	2012
Citation	IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2012, v. 9 n. 5, p. 1422-1431 How to Cite? DOI: http://dx.doi.org/10.1109/TCBB.2012.63
Abstract	Although many feature selection methods for classification have been developed, there is a need to identify genes in high-dimensional data with censored survival outcomes. Traditional methods for gene selection in classification problems have several drawbacks. First, the majority of the gene selection approaches for classification are single-gene based. Second, many of the gene selection procedures are not embedded within the algorithm itself. The technique of random forests has been found to perform well in high-dimensional data settings with survival outcomes. It also has an embedded feature to identify variables of importance. Therefore, it is an ideal candidate for gene selection in high-dimensional data with survival outcomes. In this paper, we develop a novel method based on the random forests to identify a set of prognostic genes. We compare our method with several machine learning methods and various node split criteria using several real data sets. Our method performed well in both simulations and real data analysis. Additionally, we have shown the advantages of our approach over single-gene-based approaches. Our method incorporates multivariate correlations in microarray data for survival outcomes. The described method allows us to better utilize the information available from microarray data with survival outcomes. © 2004-2012 IEEE.
Persistent Identifier	http://hdl.handle.net/10722/194433
ISSN	1545-5963 2023 Impact Factor: 3.6 2023 SCImago Journal Rankings: 0.794
ISI Accession Number ID	WOS:000307299200017

DC Field	Value	Language
dc.contributor.author	Pang, H	-
dc.contributor.author	George, SL	-
dc.contributor.author	Hui, K	-
dc.contributor.author	Tong, T	-
dc.date.accessioned	2014-01-30T03:32:35Z	-
dc.date.available	2014-01-30T03:32:35Z	-
dc.date.issued	2012	-
dc.identifier.citation	IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2012, v. 9 n. 5, p. 1422-1431	-
dc.identifier.issn	1545-5963	-
dc.identifier.uri	http://hdl.handle.net/10722/194433	-
dc.description.abstract	Although many feature selection methods for classification have been developed, there is a need to identify genes in high-dimensional data with censored survival outcomes. Traditional methods for gene selection in classification problems have several drawbacks. First, the majority of the gene selection approaches for classification are single-gene based. Second, many of the gene selection procedures are not embedded within the algorithm itself. The technique of random forests has been found to perform well in high-dimensional data settings with survival outcomes. It also has an embedded feature to identify variables of importance. Therefore, it is an ideal candidate for gene selection in high-dimensional data with survival outcomes. In this paper, we develop a novel method based on the random forests to identify a set of prognostic genes. We compare our method with several machine learning methods and various node split criteria using several real data sets. Our method performed well in both simulations and real data analysis. Additionally, we have shown the advantages of our approach over single-gene-based approaches. Our method incorporates multivariate correlations in microarray data for survival outcomes. The described method allows us to better utilize the information available from microarray data with survival outcomes. © 2004-2012 IEEE.	-
dc.language	eng	-
dc.relation.ispartof	IEEE/ACM Transactions on Computational Biology and Bioinformatics	-
dc.subject	Cancer	-
dc.subject	gene selection	-
dc.subject	iterative feature elimination	-
dc.subject	microarrays	-
dc.subject	random forest	-
dc.subject	survival	-
dc.title	Gene selection using iterative feature elimination random forests for survival outcomes	-
dc.type	Article	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1109/TCBB.2012.63	-
dc.identifier.pmid	22547432	-
dc.identifier.scopus	eid_2-s2.0-84864947491	-
dc.identifier.volume	9	-
dc.identifier.issue	5	-
dc.identifier.spage	1422	-
dc.identifier.epage	1431	-
dc.identifier.isi	WOS:000307299200017	-
dc.identifier.issnl	1545-5963	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Gene selection using iterative feature elimination random forests for survival outcomes

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats