copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Conditional variable importance for random forests

C. Strobl, A. Boulesteix, T. Kneib, T. Augustin, and A. Zeileis. BMC Bioinformatics, 9 (1): 307 (2008)
DOI: 10.1186/1471-2105-9-307

Abstract

Background Random forests are becoming increasingly popular in many scientific fields because they can cope with "small n large p" problems, complex interactions and even highly correlated predictor variables. Their variable importance measures have recently been suggested as screening tools for, e.g., gene expression studies. However, these variable importance measures show a bias towards correlated predictor variables. Results We identify two mechanisms responsible for this finding: (i) A preference for the selection of correlated predictors in the tree building process and (ii) an additional advantage for correlated predictor variables induced by the unconditional permutation scheme that is employed in the computation of the variable importance measure. Based on these considerations we develop a new, conditional permutation scheme for the computation of the variable importance measure. Conclusion The resulting conditional variable importance reflects the true impact of each predictor variable more reliably than the original marginal approach.

Links and resources

BibTeX key: strobl_conditional_2008
entry type: article
year: 2008
journal: BMC Bioinformatics
number: 1
pages: 307
volume: 9
issn: 1471-2105
DOI: 10.1186/1471-2105-9-307
urldate: 2013-06-28
url: http://www.biomedcentral.com.proxy.lib.uiowa.edu/1471-2105/9/307/abstract

@yourwelcome's tags highlighted

Cite this publication

search on

Meta data

Last update 7 years ago
Created 7 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Conditional variable importance for random forests

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Conditional variable importance for random forests

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Conditional variable importance for random forests

Comments and Reviews
(0)