File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Re-annotation of protein-coding genes in 10 complete genomes of Neisseriaceae family by combining similarity-based and composition-based methods

TitleRe-annotation of protein-coding genes in 10 complete genomes of Neisseriaceae family by combining similarity-based and composition-based methods
Authors
KeywordsThe Neisseriaceae family
Re-annotation
Newly found genes
Eliminated non-coding ORFs
Newly assigned functions
Issue Date2013
PublisherOxford University Press for Kazusa DNA Research Institute. The Journal's web site is located at http://dnaresearch.oxfordjournals.org
Citation
DNA Research, 2013, v. 20 n. 3, p. 273-286 How to Cite?
AbstractIn this paper, we performed a comprehensive re-annotation of protein-coding genes by a systematic method combining composition- and similarity-based approaches in 10 complete bacterial genomes of the family Neisseriaceae. First, 418 hypothetical genes were predicted as non-coding using the composition-based method and 413 were eliminated from the gene list. Both the scatter plot and cluster of orthologous groups (COG) fraction analyses supported the result. Second, from 20 to 400 hypothetical proteins were assigned with functions in each of the 10 strains based on the homology search. Among newly assigned functions, 397 are so detailed to have definite gene names. Third, 106 genes missed by the original annotations were picked up by an ab initio gene finder combined with similarity alignment. Transcriptional experiments validated the effectiveness of this method in Laribacter hongkongensis and Chromobacterium violaceum. Among the 106 newly found genes, some deserve particular interests. For example, 27 transposases were newly found in Neiserria meningitidis alpha14. In Neiserria gonorrhoeae NCCP11945, four new genes with putative functions and definite names (nusG, rpsN, rpmD and infA) were found and homologues of them usually are essential for survival in bacteria. The updated annotations for the 10 Neisseriaceae genomes provide a more accurate prediction of protein-coding genes and a more detailed functional information of hypothetical proteins. It will benefit research into the lifestyle, metabolism, environmental adaption and pathogenicity of the Neisseriaceae species. The re-annotation procedure could be used directly, or after the adaption of detailed methods, for checking annotations of any other bacterial or archaeal genomes.
Persistent Identifierhttp://hdl.handle.net/10722/184602
ISSN
2021 Impact Factor: 4.477
2020 SCImago Journal Rankings: 1.647
PubMed Central ID
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorGuo, F-
dc.contributor.authorXiong, L-
dc.contributor.authorTeng, JLL-
dc.contributor.authorYuen, KY-
dc.contributor.authorLau, SKP-
dc.contributor.authorWoo, PCY-
dc.date.accessioned2013-07-15T09:57:36Z-
dc.date.available2013-07-15T09:57:36Z-
dc.date.issued2013-
dc.identifier.citationDNA Research, 2013, v. 20 n. 3, p. 273-286-
dc.identifier.issn1340-2838-
dc.identifier.urihttp://hdl.handle.net/10722/184602-
dc.description.abstractIn this paper, we performed a comprehensive re-annotation of protein-coding genes by a systematic method combining composition- and similarity-based approaches in 10 complete bacterial genomes of the family Neisseriaceae. First, 418 hypothetical genes were predicted as non-coding using the composition-based method and 413 were eliminated from the gene list. Both the scatter plot and cluster of orthologous groups (COG) fraction analyses supported the result. Second, from 20 to 400 hypothetical proteins were assigned with functions in each of the 10 strains based on the homology search. Among newly assigned functions, 397 are so detailed to have definite gene names. Third, 106 genes missed by the original annotations were picked up by an ab initio gene finder combined with similarity alignment. Transcriptional experiments validated the effectiveness of this method in Laribacter hongkongensis and Chromobacterium violaceum. Among the 106 newly found genes, some deserve particular interests. For example, 27 transposases were newly found in Neiserria meningitidis alpha14. In Neiserria gonorrhoeae NCCP11945, four new genes with putative functions and definite names (nusG, rpsN, rpmD and infA) were found and homologues of them usually are essential for survival in bacteria. The updated annotations for the 10 Neisseriaceae genomes provide a more accurate prediction of protein-coding genes and a more detailed functional information of hypothetical proteins. It will benefit research into the lifestyle, metabolism, environmental adaption and pathogenicity of the Neisseriaceae species. The re-annotation procedure could be used directly, or after the adaption of detailed methods, for checking annotations of any other bacterial or archaeal genomes.-
dc.languageeng-
dc.publisherOxford University Press for Kazusa DNA Research Institute. The Journal's web site is located at http://dnaresearch.oxfordjournals.org-
dc.relation.ispartofDNA Research-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subjectThe Neisseriaceae family-
dc.subjectRe-annotation-
dc.subjectNewly found genes-
dc.subjectEliminated non-coding ORFs-
dc.subjectNewly assigned functions-
dc.titleRe-annotation of protein-coding genes in 10 complete genomes of Neisseriaceae family by combining similarity-based and composition-based methods-
dc.typeArticle-
dc.identifier.emailGuo, F: fbguohku@hku.hk-
dc.identifier.emailTeng, JLL: llteng@hku.hk-
dc.identifier.emailYuen, KY: kyyuen@hkucc.hku.hk-
dc.identifier.emailLau, SKP: skplau@hkucc.hku.hk-
dc.identifier.emailWoo, PCY: pcywoo@hkucc.hku.hk-
dc.identifier.authorityTeng, JLL=rp00277-
dc.identifier.authorityYuen, KY=rp00366-
dc.identifier.authorityLau, SKP=rp00486-
dc.identifier.authorityWoo, PCY=rp00430-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.1093/dnares/dst009-
dc.identifier.pmid23571676-
dc.identifier.pmcidPMC3686433-
dc.identifier.scopuseid_2-s2.0-84884186033-
dc.identifier.hkuros216406-
dc.identifier.volume20-
dc.identifier.issue3-
dc.identifier.spage273-
dc.identifier.epage286-
dc.identifier.isiWOS:000321404200007-
dc.publisher.placeUnited Kingdom-
dc.identifier.issnl1340-2838-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats