File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1186/1471-2164-15-455
- Scopus: eid_2-s2.0-84902383419
- PMID: 24916671
- WOS: WOS:000337703600002
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: EFIN: predicting the functional impact of nonsynonymous single nucleotide polymorphisms in human genome
Title | EFIN: predicting the functional impact of nonsynonymous single nucleotide polymorphisms in human genome |
---|---|
Authors | |
Keywords | Amino acid conservation Coding mutation Evolutionary distance Functional impact nsSNP |
Issue Date | 25-Jun-2014 |
Publisher | BioMed Central Ltd. The Journal's web site is located at http://www.biomedcentral.com/bmcgenomics/ |
Citation | BMC genomics, 2014, v. 15, article no. 455 How to Cite? |
Abstract | Background Predicting the functional impact of amino acid substitutions (AAS) caused by nonsynonymous single nucleotide polymorphisms (nsSNPs) is becoming increasingly important as more and more novel variants are being discovered. Bioinformatics analysis is essential to predict potentially causal or contributing AAS to human diseases for further analysis, as for each genome, thousands of rare or private AAS exist and only a very small number of which are related to an underlying disease. Existing algorithms in this field still have high false prediction rate and novel development is needed to take full advantage of vast amount of genomic data. Results Here we report a novel algorithm that features two innovative changes: 1. making better use of sequence conservation information by grouping the homologous protein sequences into six blocks according to evolutionary distances to human and evaluating sequence conservation in each block independently, and 2. including as many such homologous sequences as possible in analyses. Random forests are used to evaluate sequence conservation in each block and to predict potential impact of an AAS on protein function. Testing of this algorithm on a comprehensive dataset showed significant improvement on prediction accuracy upon currently widely-used programs. The algorithm and a web-based application tool implementing it, EFIN (Evaluation of Functional Impact of Nonsynonymous SNPs) were made freely available (http://paed.hku.hk/efin/) to the public. Conclusions Grouping homologous sequences into different blocks according to evolutionary distance of the species to human and evaluating sequence conservation in each group independently significantly improved prediction accuracy. This approach may help us better understand the roles of genetic variants in human disease and health. |
Persistent Identifier | http://hdl.handle.net/10722/198060 |
ISSN | 2023 Impact Factor: 3.5 2023 SCImago Journal Rankings: 1.047 |
PubMed Central ID | |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Zeng, S | en_US |
dc.contributor.author | Yang, J | en_US |
dc.contributor.author | Chung, BHY | en_US |
dc.contributor.author | Lau, YL | en_US |
dc.contributor.author | Yang, W | - |
dc.date.accessioned | 2014-06-25T02:43:48Z | - |
dc.date.available | 2014-06-25T02:43:48Z | - |
dc.date.issued | 2014-06-25 | - |
dc.identifier.citation | BMC genomics, 2014, v. 15, article no. 455 | en_US |
dc.identifier.issn | 1471-2164 | - |
dc.identifier.uri | http://hdl.handle.net/10722/198060 | - |
dc.description.abstract | Background Predicting the functional impact of amino acid substitutions (AAS) caused by nonsynonymous single nucleotide polymorphisms (nsSNPs) is becoming increasingly important as more and more novel variants are being discovered. Bioinformatics analysis is essential to predict potentially causal or contributing AAS to human diseases for further analysis, as for each genome, thousands of rare or private AAS exist and only a very small number of which are related to an underlying disease. Existing algorithms in this field still have high false prediction rate and novel development is needed to take full advantage of vast amount of genomic data. Results Here we report a novel algorithm that features two innovative changes: 1. making better use of sequence conservation information by grouping the homologous protein sequences into six blocks according to evolutionary distances to human and evaluating sequence conservation in each block independently, and 2. including as many such homologous sequences as possible in analyses. Random forests are used to evaluate sequence conservation in each block and to predict potential impact of an AAS on protein function. Testing of this algorithm on a comprehensive dataset showed significant improvement on prediction accuracy upon currently widely-used programs. The algorithm and a web-based application tool implementing it, EFIN (Evaluation of Functional Impact of Nonsynonymous SNPs) were made freely available (http://paed.hku.hk/efin/) to the public. Conclusions Grouping homologous sequences into different blocks according to evolutionary distance of the species to human and evaluating sequence conservation in each group independently significantly improved prediction accuracy. This approach may help us better understand the roles of genetic variants in human disease and health. | en_US |
dc.language | eng | en_US |
dc.publisher | BioMed Central Ltd. The Journal's web site is located at http://www.biomedcentral.com/bmcgenomics/ | - |
dc.relation.ispartof | BMC genomics | en_US |
dc.rights | BMC genomics. Copyright © BioMed Central Ltd | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject | Amino acid conservation | - |
dc.subject | Coding mutation | - |
dc.subject | Evolutionary distance | - |
dc.subject | Functional impact | - |
dc.subject | nsSNP | - |
dc.title | EFIN: predicting the functional impact of nonsynonymous single nucleotide polymorphisms in human genome | en_US |
dc.type | Article | en_US |
dc.identifier.email | Yang, J: jingy09@hku.hk | en_US |
dc.identifier.email | Chung, BHY: bhychung@hku.hk | en_US |
dc.identifier.email | Yang, W: yangwl@hkucc.hku.hk | en_US |
dc.identifier.authority | Chung, BHY=rp00473 | en_US |
dc.identifier.authority | Yang, W=rp00524 | en_US |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.1186/1471-2164-15-455 | - |
dc.identifier.pmid | 24916671 | - |
dc.identifier.pmcid | PMC4061446 | - |
dc.identifier.scopus | eid_2-s2.0-84902383419 | - |
dc.identifier.hkuros | 229168 | en_US |
dc.identifier.hkuros | 230885 | - |
dc.identifier.isi | WOS:000337703600002 | - |
dc.identifier.issnl | 1471-2164 | - |