Shrunken methodology to genome-wide SNPs selection and construction of SNPs networks

Liu, Yang; Ng, Michael

File Download

Content.pdf

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1186/1752-0509-4-S2-S5
Scopus: eid_2-s2.0-77956864585
PMID: 20840732
WOS: WOS:000208294800004
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
- PubMed Central: 0
Appears in Collections:
- Mathematics: Journal/Magazine Articles

Article: Shrunken methodology to genome-wide SNPs selection and construction of SNPs networks

Title	Shrunken methodology to genome-wide SNPs selection and construction of SNPs networks
Authors	Liu, Yang Ng, Michael
Issue Date	2010
Citation	BMC Systems Biology, 2010, v. 4, suppl. 2, article no. S5 How to Cite? DOI: http://dx.doi.org/10.1186/1752-0509-4-S2-S5
Abstract	Background: Recent development of high-resolution single nucleotide polymorphism (SNP) arrays allows detailed assessment of genome-wide human genome variations. There is increasing recognition of the importance of SNPs for medicine and developmental biology. However, SNP data set typically has a large number of SNPs (e.g., 400 thousand SNPs in genome-wide Parkinson disease data set) and a few hundred of samples. Conventional classification methods may not be effective when applied to such genome-wide SNP data. . Results: In this paper, we use shrunken dissimilarity measure to analyze and select relevant SNPs for classification problems. Examples of HapMap data and Parkinson disease (PD) data are given to demonstrate the effectiveness of the proposed method, and illustrate it has a potential to become a useful analysis tool for SNP data sets. We use Parkinson disease data as an example, and perform a whole genome analysis. For the 367440 SNPs with less than 1% missing percentage from all 22 chromosomes, we can select 357 SNPs from this data set. For the unique genes that those SNPs are located in, a gene-gene similarity value is computed using GOSemSim and gene pairs that has a similarity value being greater than a threshold are selected to construct several groups of genes. For the SNPs that involved in these groups of genes, a statistical software PLINK is employed to compute the pair-wise SNP-SNP interactions, and SNPs with significance of P < 0.01 are chosen to identify SNPs networks based on their P values. Here SNPs networks are constructed based on Gene Ontology knowledge, and therefore each SNP network plays a role in the biological process. An analysis shows that such networks have relationships directly or indirectly to Parkinson disease.Conclusions: Experimental results show that our approach is suitable to handle genetic variations, and provide useful knowledge in a genome-wide SNP study. © 2010 Ng and Liu; licensee BioMed Central Ltd.
Persistent Identifier	http://hdl.handle.net/10722/276869
ISSN	1752-0509 2018 Impact Factor: 2.048 2020 SCImago Journal Rankings: 0.976
PubMed Central ID	PMC2982692
ISI Accession Number ID	WOS:000208294800004

DC Field	Value	Language
dc.contributor.author	Liu, Yang	-
dc.contributor.author	Ng, Michael	-
dc.date.accessioned	2019-09-18T08:34:54Z	-
dc.date.available	2019-09-18T08:34:54Z	-
dc.date.issued	2010	-
dc.identifier.citation	BMC Systems Biology, 2010, v. 4, suppl. 2, article no. S5	-
dc.identifier.issn	1752-0509	-
dc.identifier.uri	http://hdl.handle.net/10722/276869	-
dc.description.abstract	Background: Recent development of high-resolution single nucleotide polymorphism (SNP) arrays allows detailed assessment of genome-wide human genome variations. There is increasing recognition of the importance of SNPs for medicine and developmental biology. However, SNP data set typically has a large number of SNPs (e.g., 400 thousand SNPs in genome-wide Parkinson disease data set) and a few hundred of samples. Conventional classification methods may not be effective when applied to such genome-wide SNP data. . Results: In this paper, we use shrunken dissimilarity measure to analyze and select relevant SNPs for classification problems. Examples of HapMap data and Parkinson disease (PD) data are given to demonstrate the effectiveness of the proposed method, and illustrate it has a potential to become a useful analysis tool for SNP data sets. We use Parkinson disease data as an example, and perform a whole genome analysis. For the 367440 SNPs with less than 1% missing percentage from all 22 chromosomes, we can select 357 SNPs from this data set. For the unique genes that those SNPs are located in, a gene-gene similarity value is computed using GOSemSim and gene pairs that has a similarity value being greater than a threshold are selected to construct several groups of genes. For the SNPs that involved in these groups of genes, a statistical software PLINK is employed to compute the pair-wise SNP-SNP interactions, and SNPs with significance of P < 0.01 are chosen to identify SNPs networks based on their P values. Here SNPs networks are constructed based on Gene Ontology knowledge, and therefore each SNP network plays a role in the biological process. An analysis shows that such networks have relationships directly or indirectly to Parkinson disease.Conclusions: Experimental results show that our approach is suitable to handle genetic variations, and provide useful knowledge in a genome-wide SNP study. © 2010 Ng and Liu; licensee BioMed Central Ltd.	-
dc.language	eng	-
dc.relation.ispartof	BMC Systems Biology	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.title	Shrunken methodology to genome-wide SNPs selection and construction of SNPs networks	-
dc.type	Article	-
dc.description.nature	published_or_final_version	-
dc.identifier.doi	10.1186/1752-0509-4-S2-S5	-
dc.identifier.pmid	20840732	-
dc.identifier.pmcid	PMC2982692	-
dc.identifier.scopus	eid_2-s2.0-77956864585	-
dc.identifier.volume	4	-
dc.identifier.issue	suppl. 2	-
dc.identifier.spage	article no. S5	-
dc.identifier.epage	article no. S5	-
dc.identifier.isi	WOS:000208294800004	-
dc.identifier.issnl	1752-0509	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Shrunken methodology to genome-wide SNPs selection and construction of SNPs networks

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats