Artikel,

Conflation of Short Identity-by-Descent Segments Bias Their Inferred Length Distribution

C. Chiang, P. Ralph, und J. Novembre.
G3 (Bethesda), 6 (5): 1287-1296 (Mai 2016)
DOI: 10.1534/g3.116.027581

Zusammenfassung

Identity-by-descent (IBD) is a fundamental concept in genetics with many applications. In a common definition, two haplotypes are said to share an IBD segment if that segment is inherited from a recent shared common ancestor without intervening recombination. Segments several cM long can be efficiently detected by a number of algorithms using high-density SNP array data from a population sample, and there are currently efforts to detect shorter segments from sequencing. Here, we study a problem of identifiability: because existing approaches detect IBD based on contiguous segments of identity-by-state, inferred long segments of IBD may arise from the conflation of smaller, nearby IBD segments. We quantified this effect using coalescent simulations, finding that significant proportions of inferred segments 1-2 cM long are results of conflations of two or more shorter segments, each at least 0.2 cM or longer, under demographic scenarios typical for modern humans for all programs tested. The impact of such conflation is much smaller for longer (> 2 cM) segments. This biases the inferred IBD segment length distribution, and so can affect downstream inferences that depend on the assumption that each segment of IBD derives from a single common ancestor. As an example, we present and analyze an estimator of the de novo mutation rate using IBD segments, and demonstrate that unmodeled conflation leads to underestimates of the ages of the common ancestors on these segments, and hence a significant overestimate of the mutation rate. Understanding the conflation effect in detail will make its correction in future methods more tractable.

BibTeX-Schlüssel: chiang2016conflation
Eintragstyp: article
Jahr: 2016
Monat: 05
Zeitschrift: G3 (Bethesda)
Nummer: 5
Seiten: 1287-1296
Band: 6
pmid: 26935417
DOI: 10.1534/g3.116.027581
URL: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4856080/

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Bitte melden Sie sich an um selbst Rezensionen oder Kommentare zu erstellen.

Zitieren Sie diese Publikation

%0 Journal Article %1 chiang2016conflation %A Chiang, C W %A Ralph, P %A Novembre, J %D 2016 %J G3 (Bethesda) %K IBD haplotype_inference haplotype_length myown %N 5 %P 1287-1296 %R 10.1534/g3.116.027581 %T Conflation of Short Identity-by-Descent Segments Bias Their Inferred Length Distribution %U https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4856080/ %V 6 %X Identity-by-descent (IBD) is a fundamental concept in genetics with many applications. In a common definition, two haplotypes are said to share an IBD segment if that segment is inherited from a recent shared common ancestor without intervening recombination. Segments several cM long can be efficiently detected by a number of algorithms using high-density SNP array data from a population sample, and there are currently efforts to detect shorter segments from sequencing. Here, we study a problem of identifiability: because existing approaches detect IBD based on contiguous segments of identity-by-state, inferred long segments of IBD may arise from the conflation of smaller, nearby IBD segments. We quantified this effect using coalescent simulations, finding that significant proportions of inferred segments 1-2 cM long are results of conflations of two or more shorter segments, each at least 0.2 cM or longer, under demographic scenarios typical for modern humans for all programs tested. The impact of such conflation is much smaller for longer (> 2 cM) segments. This biases the inferred IBD segment length distribution, and so can affect downstream inferences that depend on the assumption that each segment of IBD derives from a single common ancestor. As an example, we present and analyze an estimator of the de novo mutation rate using IBD segments, and demonstrate that unmodeled conflation leads to underestimates of the ages of the common ancestors on these segments, and hence a significant overestimate of the mutation rate. Understanding the conflation effect in detail will make its correction in future methods more tractable.

@article{chiang2016conflation, abstract = {Identity-by-descent (IBD) is a fundamental concept in genetics with many applications. In a common definition, two haplotypes are said to share an IBD segment if that segment is inherited from a recent shared common ancestor without intervening recombination. Segments several cM long can be efficiently detected by a number of algorithms using high-density SNP array data from a population sample, and there are currently efforts to detect shorter segments from sequencing. Here, we study a problem of identifiability: because existing approaches detect IBD based on contiguous segments of identity-by-state, inferred long segments of IBD may arise from the conflation of smaller, nearby IBD segments. We quantified this effect using coalescent simulations, finding that significant proportions of inferred segments 1-2 cM long are results of conflations of two or more shorter segments, each at least 0.2 cM or longer, under demographic scenarios typical for modern humans for all programs tested. The impact of such conflation is much smaller for longer (> 2 cM) segments. This biases the inferred IBD segment length distribution, and so can affect downstream inferences that depend on the assumption that each segment of IBD derives from a single common ancestor. As an example, we present and analyze an estimator of the de novo mutation rate using IBD segments, and demonstrate that unmodeled conflation leads to underestimates of the ages of the common ancestors on these segments, and hence a significant overestimate of the mutation rate. Understanding the conflation effect in detail will make its correction in future methods more tractable.}, added-at = {2018-09-27T19:41:30.000+0200}, author = {Chiang, C W and Ralph, P and Novembre, J}, biburl = {https://www.bibsonomy.org/bibtex/2e93cf4550804086aee502079156a4599/peter.ralph}, doi = {10.1534/g3.116.027581}, interhash = {ba4af628e0c364c0c97c508bfb546a5d}, intrahash = {e93cf4550804086aee502079156a4599}, journal = {G3 (Bethesda)}, keywords = {IBD haplotype_inference haplotype_length myown}, month = {05}, number = 5, pages = {1287-1296}, pmid = {26935417}, timestamp = {2018-09-27T19:41:30.000+0200}, title = {Conflation of Short Identity-by-Descent Segments Bias Their Inferred Length Distribution}, url = {https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4856080/}, volume = 6, year = 2016 }

BibSonomy

Conflation of Short Identity-by-Descent Segments Bias Their Inferred Length Distribution

Zusammenfassung

Tags

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf