Experiences in integrated data and research object publishing using GigaDB

Edmunds, SC; Li, P; Hunter, CI; Xiao, S; Davidson, RL; Nogoy, N; Goodman, L

File Download

Content.pdf

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1007/s00799-016-0174-6
Scopus: eid_2-s2.0-84970969180
WOS: WOS:000406746000004
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Libraries: Journal/Magazine Articles

Article: Experiences in integrated data and research object publishing using GigaDB

Title	Experiences in integrated data and research object publishing using GigaDB
Authors	Edmunds, SC Li, P Hunter, CI Xiao, S Davidson, RL Nogoy, N Goodman, L
Keywords	Reproducibility Open-data Data publishing Computational biology Data citation
Issue Date	2017
Publisher	Springer. The Journal's web site is located at https://link.springer.com/journal/799
Citation	International Journal on Digital Libraries, v. 18 n. 2, p. 99–111 How to Cite? DOI: http://dx.doi.org/10.1007/s00799-016-0174-6
Abstract	In the era of computation and data-driven research, traditional methods of disseminating research are no longer fit-for-purpose. New approaches for disseminating data, methods and results are required to maximize knowledge discovery. The “long tail” of small, unstructured datasets is well catered for by a number of general-purpose repositories, but there has been less support for “big data”. Outlined here are our experiences in attempting to tackle the gaps in publishing large-scale, computationally intensive research. GigaScience is an open-access, open-data journal aiming to revolutionize large-scale biological data dissemination, organization and re-use. Through use of the data handling infrastructure of the genomics centre BGI, GigaScience links standard manuscript publication with an integrated database (GigaDB) that hosts all associated data, and provides additional data analysis tools and computing resources. Furthermore, the supporting workflows and methods are also integrated to make published articles more transparent and open. GigaDB has released many new and previously unpublished datasets and data types, including as urgently needed data to tackle infectious disease outbreaks, cancer and the growing food crisis. Other “executable” research objects, such as workflows, virtual machines and software from several GigaScience articles have been archived and shared in reproducible, transparent and usable formats. With data citation producing evidence of, and credit for, its use in the wider research community, GigaScience demonstrates a move towards more executable publications. Here data analyses can be reproduced and built upon by users without coding backgrounds or heavy computational infrastructure in a more democratized manner.
Persistent Identifier	http://hdl.handle.net/10722/279889
ISSN	1432-5012 2020 SCImago Journal Rankings: 0.367
ISI Accession Number ID	WOS:000406746000004

DC Field	Value	Language
dc.contributor.author	Edmunds, SC	-
dc.contributor.author	Li, P	-
dc.contributor.author	Hunter, CI	-
dc.contributor.author	Xiao, S	-
dc.contributor.author	Davidson, RL	-
dc.contributor.author	Nogoy, N	-
dc.contributor.author	Goodman, L	-
dc.date.accessioned	2019-12-18T07:24:39Z	-
dc.date.available	2019-12-18T07:24:39Z	-
dc.date.issued	2017	-
dc.identifier.citation	International Journal on Digital Libraries, v. 18 n. 2, p. 99–111	-
dc.identifier.issn	1432-5012	-
dc.identifier.uri	http://hdl.handle.net/10722/279889	-
dc.description.abstract	In the era of computation and data-driven research, traditional methods of disseminating research are no longer fit-for-purpose. New approaches for disseminating data, methods and results are required to maximize knowledge discovery. The “long tail” of small, unstructured datasets is well catered for by a number of general-purpose repositories, but there has been less support for “big data”. Outlined here are our experiences in attempting to tackle the gaps in publishing large-scale, computationally intensive research. GigaScience is an open-access, open-data journal aiming to revolutionize large-scale biological data dissemination, organization and re-use. Through use of the data handling infrastructure of the genomics centre BGI, GigaScience links standard manuscript publication with an integrated database (GigaDB) that hosts all associated data, and provides additional data analysis tools and computing resources. Furthermore, the supporting workflows and methods are also integrated to make published articles more transparent and open. GigaDB has released many new and previously unpublished datasets and data types, including as urgently needed data to tackle infectious disease outbreaks, cancer and the growing food crisis. Other “executable” research objects, such as workflows, virtual machines and software from several GigaScience articles have been archived and shared in reproducible, transparent and usable formats. With data citation producing evidence of, and credit for, its use in the wider research community, GigaScience demonstrates a move towards more executable publications. Here data analyses can be reproduced and built upon by users without coding backgrounds or heavy computational infrastructure in a more democratized manner.	-
dc.language	eng	-
dc.publisher	Springer. The Journal's web site is located at https://link.springer.com/journal/799	-
dc.relation.ispartof	International Journal on Digital Libraries	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject	Reproducibility	-
dc.subject	Open-data	-
dc.subject	Data publishing	-
dc.subject	Computational biology	-
dc.subject	Data citation	-
dc.title	Experiences in integrated data and research object publishing using GigaDB	-
dc.type	Article	-
dc.identifier.email	Xiao, S: szxiao@hku.hk	-
dc.description.nature	published_or_final_version	-
dc.identifier.doi	10.1007/s00799-016-0174-6	-
dc.identifier.scopus	eid_2-s2.0-84970969180	-
dc.identifier.volume	18	-
dc.identifier.issue	2	-
dc.identifier.spage	99	-
dc.identifier.epage	111	-
dc.identifier.isi	WOS:000406746000004	-
dc.publisher.place	Germany	-
dc.identifier.issnl	1432-1300	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Experiences in integrated data and research object publishing using GigaDB

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats