File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Ancestry informative markers for fine-scale individual assignment to worldwide populations

TitleAncestry informative markers for fine-scale individual assignment to worldwide populations
Authors
Issue Date2010
Citation
Journal of Medical Genetics, 2010, v. 47, n. 12, p. 835-847 How to Cite?
AbstractBackground and aims: The analysis of large-scale genetic data from thousands of individuals has revealed the fact that subtle population genetic structure can be detected at levels that were previously unimaginable. Using the Human Genome Diversity Panel as reference (51 populations - 650,000 SNPs), this works describes a systematic evaluation of the resolution that can be achieved for the inference of genetic ancestry, even when small panels of genetic markers are used. Methods and results: A comprehensive investigation of human population structure around the world is undertaken by leveraging the power of Principal Components Analysis (PCA). The problem is dissected into hierarchical steps and a decision tree for the prediction of individual ancestry is proposed. A complete leave-one-out validation experiment demonstrates that, using all available SNPs, assignment of individuals to their self-reported populations of origin is essentially perfect. Ancestry informative genetic markers are selected using two different metrics (In and correlation with PCA scores). A thorough cross-validation experiment indicates that, in most cases here, the number of SNPs needed for ancestry inference can be successfully reduced to less than 0.1% of the original 650,000 while retaining close to 100% accuracy. This reduction can be achieved using a novel clustering-based redundancy removal algorithm that is also introduced here. Finally, the applicability of our suggested SNP panels is tested on HapMap Phase 3 populations. Conclusion: The proposed methods and ancestry informative marker panels, in combination with the increasingly more comprehensive databases of human genetic variation, open new horizons in a variety of fields, ranging from the study of human evolution and population history, to medical genetics and forensics.
Persistent Identifierhttp://hdl.handle.net/10722/254523
ISSN
2023 Impact Factor: 3.5
2023 SCImago Journal Rankings: 1.690
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorPaschou, Peristera-
dc.contributor.authorLewis, Jamey-
dc.contributor.authorJaved, Asif-
dc.contributor.authorDrineas, Petros-
dc.date.accessioned2018-06-19T15:40:47Z-
dc.date.available2018-06-19T15:40:47Z-
dc.date.issued2010-
dc.identifier.citationJournal of Medical Genetics, 2010, v. 47, n. 12, p. 835-847-
dc.identifier.issn0022-2593-
dc.identifier.urihttp://hdl.handle.net/10722/254523-
dc.description.abstractBackground and aims: The analysis of large-scale genetic data from thousands of individuals has revealed the fact that subtle population genetic structure can be detected at levels that were previously unimaginable. Using the Human Genome Diversity Panel as reference (51 populations - 650,000 SNPs), this works describes a systematic evaluation of the resolution that can be achieved for the inference of genetic ancestry, even when small panels of genetic markers are used. Methods and results: A comprehensive investigation of human population structure around the world is undertaken by leveraging the power of Principal Components Analysis (PCA). The problem is dissected into hierarchical steps and a decision tree for the prediction of individual ancestry is proposed. A complete leave-one-out validation experiment demonstrates that, using all available SNPs, assignment of individuals to their self-reported populations of origin is essentially perfect. Ancestry informative genetic markers are selected using two different metrics (In and correlation with PCA scores). A thorough cross-validation experiment indicates that, in most cases here, the number of SNPs needed for ancestry inference can be successfully reduced to less than 0.1% of the original 650,000 while retaining close to 100% accuracy. This reduction can be achieved using a novel clustering-based redundancy removal algorithm that is also introduced here. Finally, the applicability of our suggested SNP panels is tested on HapMap Phase 3 populations. Conclusion: The proposed methods and ancestry informative marker panels, in combination with the increasingly more comprehensive databases of human genetic variation, open new horizons in a variety of fields, ranging from the study of human evolution and population history, to medical genetics and forensics.-
dc.languageeng-
dc.relation.ispartofJournal of Medical Genetics-
dc.titleAncestry informative markers for fine-scale individual assignment to worldwide populations-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1136/jmg.2010.078212-
dc.identifier.pmid20921023-
dc.identifier.scopuseid_2-s2.0-78649635740-
dc.identifier.volume47-
dc.identifier.issue12-
dc.identifier.spage835-
dc.identifier.epage847-
dc.identifier.eissn1468-6244-
dc.identifier.isiWOS:000285685900008-
dc.identifier.issnl0022-2593-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats