File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Experiences in integrated data and research object publishing using GigaDB

TitleExperiences in integrated data and research object publishing using GigaDB
Authors
KeywordsReproducibility
Open-data
Data publishing
Computational biology
Data citation
Issue Date2017
PublisherSpringer. The Journal's web site is located at https://link.springer.com/journal/799
Citation
International Journal on Digital Libraries, v. 18 n. 2, p. 99–111 How to Cite?
AbstractIn the era of computation and data-driven research, traditional methods of disseminating research are no longer fit-for-purpose. New approaches for disseminating data, methods and results are required to maximize knowledge discovery. The “long tail” of small, unstructured datasets is well catered for by a number of general-purpose repositories, but there has been less support for “big data”. Outlined here are our experiences in attempting to tackle the gaps in publishing large-scale, computationally intensive research. GigaScience is an open-access, open-data journal aiming to revolutionize large-scale biological data dissemination, organization and re-use. Through use of the data handling infrastructure of the genomics centre BGI, GigaScience links standard manuscript publication with an integrated database (GigaDB) that hosts all associated data, and provides additional data analysis tools and computing resources. Furthermore, the supporting workflows and methods are also integrated to make published articles more transparent and open. GigaDB has released many new and previously unpublished datasets and data types, including as urgently needed data to tackle infectious disease outbreaks, cancer and the growing food crisis. Other “executable” research objects, such as workflows, virtual machines and software from several GigaScience articles have been archived and shared in reproducible, transparent and usable formats. With data citation producing evidence of, and credit for, its use in the wider research community, GigaScience demonstrates a move towards more executable publications. Here data analyses can be reproduced and built upon by users without coding backgrounds or heavy computational infrastructure in a more democratized manner.
Persistent Identifierhttp://hdl.handle.net/10722/279889
ISSN
2020 SCImago Journal Rankings: 0.367
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorEdmunds, SC-
dc.contributor.authorLi, P-
dc.contributor.authorHunter, CI-
dc.contributor.authorXiao, S-
dc.contributor.authorDavidson, RL-
dc.contributor.authorNogoy, N-
dc.contributor.authorGoodman, L-
dc.date.accessioned2019-12-18T07:24:39Z-
dc.date.available2019-12-18T07:24:39Z-
dc.date.issued2017-
dc.identifier.citationInternational Journal on Digital Libraries, v. 18 n. 2, p. 99–111-
dc.identifier.issn1432-5012-
dc.identifier.urihttp://hdl.handle.net/10722/279889-
dc.description.abstractIn the era of computation and data-driven research, traditional methods of disseminating research are no longer fit-for-purpose. New approaches for disseminating data, methods and results are required to maximize knowledge discovery. The “long tail” of small, unstructured datasets is well catered for by a number of general-purpose repositories, but there has been less support for “big data”. Outlined here are our experiences in attempting to tackle the gaps in publishing large-scale, computationally intensive research. GigaScience is an open-access, open-data journal aiming to revolutionize large-scale biological data dissemination, organization and re-use. Through use of the data handling infrastructure of the genomics centre BGI, GigaScience links standard manuscript publication with an integrated database (GigaDB) that hosts all associated data, and provides additional data analysis tools and computing resources. Furthermore, the supporting workflows and methods are also integrated to make published articles more transparent and open. GigaDB has released many new and previously unpublished datasets and data types, including as urgently needed data to tackle infectious disease outbreaks, cancer and the growing food crisis. Other “executable” research objects, such as workflows, virtual machines and software from several GigaScience articles have been archived and shared in reproducible, transparent and usable formats. With data citation producing evidence of, and credit for, its use in the wider research community, GigaScience demonstrates a move towards more executable publications. Here data analyses can be reproduced and built upon by users without coding backgrounds or heavy computational infrastructure in a more democratized manner.-
dc.languageeng-
dc.publisherSpringer. The Journal's web site is located at https://link.springer.com/journal/799-
dc.relation.ispartofInternational Journal on Digital Libraries-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subjectReproducibility-
dc.subjectOpen-data-
dc.subjectData publishing-
dc.subjectComputational biology-
dc.subjectData citation-
dc.titleExperiences in integrated data and research object publishing using GigaDB-
dc.typeArticle-
dc.identifier.emailXiao, S: szxiao@hku.hk-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.1007/s00799-016-0174-6-
dc.identifier.scopuseid_2-s2.0-84970969180-
dc.identifier.volume18-
dc.identifier.issue2-
dc.identifier.spage99-
dc.identifier.epage111-
dc.identifier.isiWOS:000406746000004-
dc.publisher.placeGermany-
dc.identifier.issnl1432-1300-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats