Artikel,

A practical guide to methods controlling false discoveries in computational biology

K. Korthauer, P. Kimes, C. Duvallet, A. Reyes, A. Subramanian, M. Teng, C. Shukla, E. Alm, und S. Hicks.
bioRxiv, (2018)
DOI: 10.1101/458786

Zusammenfassung

In high-throughput studies, hundreds to millions of hypotheses are typically tested. Statistical methods that control the false discovery rate (FDR) have emerged as popular and powerful tools for error rate control. While classic FDR methods use only p-values as input, more modern FDR methods have been shown to increase power by incorporating complementary information as "informative covariates" to prioritize, weight, and group hypotheses. However, there is currently no consensus on how the modern methods compare to one another. We investigated the accuracy, applicability, and ease of use of two classic and six modern FDR-controlling methods by performing a systematic benchmark comparison using simulation studies as well as six case studies in computational biology. Methods that incorporate informative covariates were modestly more powerful than classic approaches, and did not underperform classic approaches, even when the covariate was completely uninformative. The majority of methods were successful at controlling the FDR, with the exception of two modern methods under certain settings. Furthermore, we found the improvement of the modern FDR methods over the classic methods increased with the informativeness of the covariate, total number of hypothesis tests, and proportion of truly non-null hypotheses. Modern FDR methods that use an informative covariate provide advantages over classic FDR-controlling procedures, with the relative gain dependent on the application and informativeness of available covariates. We present our findings as a practical guide and provide recommendations to aid researchers in their choice of methods to correct for false discoveries.

BibTeX-Schlüssel: Korthauer458786
Eintragstyp: article
Jahr: 2018
Zeitschrift: bioRxiv
Verlag: Cold Spring Harbor Laboratory
eprint: https://www.biorxiv.org/content/early/2018/10/31/458786.full.pdf
DOI: 10.1101/458786
URL: https://www.biorxiv.org/content/early/2018/10/31/458786

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Bitte melden Sie sich an um selbst Rezensionen oder Kommentare zu erstellen.

Zitieren Sie diese Publikation

%0 Journal Article %1 Korthauer458786 %A Korthauer, Keegan %A Kimes, Patrick K %A Duvallet, Claire %A Reyes, Alejandro %A Subramanian, Ayshwarya %A Teng, Mingxiang %A Shukla, Chinmay %A Alm, Eric J %A Hicks, Stephanie C %D 2018 %I Cold Spring Harbor Laboratory %J bioRxiv %K MUSTREAD false-discovery-rate-correction methods statistics %R 10.1101/458786 %T A practical guide to methods controlling false discoveries in computational biology %U https://www.biorxiv.org/content/early/2018/10/31/458786 %X In high-throughput studies, hundreds to millions of hypotheses are typically tested. Statistical methods that control the false discovery rate (FDR) have emerged as popular and powerful tools for error rate control. While classic FDR methods use only p-values as input, more modern FDR methods have been shown to increase power by incorporating complementary information as "informative covariates" to prioritize, weight, and group hypotheses. However, there is currently no consensus on how the modern methods compare to one another. We investigated the accuracy, applicability, and ease of use of two classic and six modern FDR-controlling methods by performing a systematic benchmark comparison using simulation studies as well as six case studies in computational biology. Methods that incorporate informative covariates were modestly more powerful than classic approaches, and did not underperform classic approaches, even when the covariate was completely uninformative. The majority of methods were successful at controlling the FDR, with the exception of two modern methods under certain settings. Furthermore, we found the improvement of the modern FDR methods over the classic methods increased with the informativeness of the covariate, total number of hypothesis tests, and proportion of truly non-null hypotheses. Modern FDR methods that use an informative covariate provide advantages over classic FDR-controlling procedures, with the relative gain dependent on the application and informativeness of available covariates. We present our findings as a practical guide and provide recommendations to aid researchers in their choice of methods to correct for false discoveries.

@article{Korthauer458786, abstract = {In high-throughput studies, hundreds to millions of hypotheses are typically tested. Statistical methods that control the false discovery rate (FDR) have emerged as popular and powerful tools for error rate control. While classic FDR methods use only p-values as input, more modern FDR methods have been shown to increase power by incorporating complementary information as "informative covariates" to prioritize, weight, and group hypotheses. However, there is currently no consensus on how the modern methods compare to one another. We investigated the accuracy, applicability, and ease of use of two classic and six modern FDR-controlling methods by performing a systematic benchmark comparison using simulation studies as well as six case studies in computational biology. Methods that incorporate informative covariates were modestly more powerful than classic approaches, and did not underperform classic approaches, even when the covariate was completely uninformative. The majority of methods were successful at controlling the FDR, with the exception of two modern methods under certain settings. Furthermore, we found the improvement of the modern FDR methods over the classic methods increased with the informativeness of the covariate, total number of hypothesis tests, and proportion of truly non-null hypotheses. Modern FDR methods that use an informative covariate provide advantages over classic FDR-controlling procedures, with the relative gain dependent on the application and informativeness of available covariates. We present our findings as a practical guide and provide recommendations to aid researchers in their choice of methods to correct for false discoveries.}, added-at = {2018-11-01T18:21:55.000+0100}, author = {Korthauer, Keegan and Kimes, Patrick K and Duvallet, Claire and Reyes, Alejandro and Subramanian, Ayshwarya and Teng, Mingxiang and Shukla, Chinmay and Alm, Eric J and Hicks, Stephanie C}, biburl = {https://www.bibsonomy.org/bibtex/2655ca474edd9e3af1b41e6c59964e938/marcsaric}, description = {A practical guide to methods controlling false discoveries in computational biology | bioRxiv}, doi = {10.1101/458786}, eprint = {https://www.biorxiv.org/content/early/2018/10/31/458786.full.pdf}, interhash = {164ee5bbdb9ea865d2cce743cc79a985}, intrahash = {655ca474edd9e3af1b41e6c59964e938}, journal = {bioRxiv}, keywords = {MUSTREAD false-discovery-rate-correction methods statistics}, publisher = {Cold Spring Harbor Laboratory}, timestamp = {2019-05-01T16:47:55.000+0200}, title = {A practical guide to methods controlling false discoveries in computational biology}, url = {https://www.biorxiv.org/content/early/2018/10/31/458786}, year = 2018 }

BibSonomy

A practical guide to methods controlling false discoveries in computational biology

Zusammenfassung

Tags

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf