File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1007/s00799-016-0174-6
- Scopus: eid_2-s2.0-84970969180
- WOS: WOS:000406746000004
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: Experiences in integrated data and research object publishing using GigaDB
Title | Experiences in integrated data and research object publishing using GigaDB |
---|---|
Authors | |
Keywords | Reproducibility Open-data Data publishing Computational biology Data citation |
Issue Date | 2017 |
Publisher | Springer. The Journal's web site is located at https://link.springer.com/journal/799 |
Citation | International Journal on Digital Libraries, v. 18 n. 2, p. 99–111 How to Cite? |
Abstract | In the era of computation and data-driven research, traditional methods of disseminating research are no longer fit-for-purpose. New approaches for disseminating data, methods and results are required to maximize knowledge discovery. The “long tail” of small, unstructured datasets is well catered for by a number of general-purpose repositories, but there has been less support for “big data”. Outlined here are our experiences in attempting to tackle the gaps in publishing large-scale, computationally intensive research. GigaScience is an open-access, open-data journal aiming to revolutionize large-scale biological data dissemination, organization and re-use. Through use of the data handling infrastructure of the genomics centre BGI, GigaScience links standard manuscript publication with an integrated database (GigaDB) that hosts all associated data, and provides additional data analysis tools and computing resources. Furthermore, the supporting workflows and methods are also integrated to make published articles more transparent and open. GigaDB has released many new and previously unpublished datasets and data types, including as urgently needed data to tackle infectious disease outbreaks, cancer and the growing food crisis. Other “executable” research objects, such as workflows, virtual machines and software from several GigaScience articles have been archived and shared in reproducible, transparent and usable formats. With data citation producing evidence of, and credit for, its use in the wider research community, GigaScience demonstrates a move towards more executable publications. Here data analyses can be reproduced and built upon by users without coding backgrounds or heavy computational infrastructure in a more democratized manner. |
Persistent Identifier | http://hdl.handle.net/10722/279889 |
ISSN | 2023 Impact Factor: 1.6 2023 SCImago Journal Rankings: 0.406 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Edmunds, SC | - |
dc.contributor.author | Li, P | - |
dc.contributor.author | Hunter, CI | - |
dc.contributor.author | Xiao, S | - |
dc.contributor.author | Davidson, RL | - |
dc.contributor.author | Nogoy, N | - |
dc.contributor.author | Goodman, L | - |
dc.date.accessioned | 2019-12-18T07:24:39Z | - |
dc.date.available | 2019-12-18T07:24:39Z | - |
dc.date.issued | 2017 | - |
dc.identifier.citation | International Journal on Digital Libraries, v. 18 n. 2, p. 99–111 | - |
dc.identifier.issn | 1432-5012 | - |
dc.identifier.uri | http://hdl.handle.net/10722/279889 | - |
dc.description.abstract | In the era of computation and data-driven research, traditional methods of disseminating research are no longer fit-for-purpose. New approaches for disseminating data, methods and results are required to maximize knowledge discovery. The “long tail” of small, unstructured datasets is well catered for by a number of general-purpose repositories, but there has been less support for “big data”. Outlined here are our experiences in attempting to tackle the gaps in publishing large-scale, computationally intensive research. GigaScience is an open-access, open-data journal aiming to revolutionize large-scale biological data dissemination, organization and re-use. Through use of the data handling infrastructure of the genomics centre BGI, GigaScience links standard manuscript publication with an integrated database (GigaDB) that hosts all associated data, and provides additional data analysis tools and computing resources. Furthermore, the supporting workflows and methods are also integrated to make published articles more transparent and open. GigaDB has released many new and previously unpublished datasets and data types, including as urgently needed data to tackle infectious disease outbreaks, cancer and the growing food crisis. Other “executable” research objects, such as workflows, virtual machines and software from several GigaScience articles have been archived and shared in reproducible, transparent and usable formats. With data citation producing evidence of, and credit for, its use in the wider research community, GigaScience demonstrates a move towards more executable publications. Here data analyses can be reproduced and built upon by users without coding backgrounds or heavy computational infrastructure in a more democratized manner. | - |
dc.language | eng | - |
dc.publisher | Springer. The Journal's web site is located at https://link.springer.com/journal/799 | - |
dc.relation.ispartof | International Journal on Digital Libraries | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject | Reproducibility | - |
dc.subject | Open-data | - |
dc.subject | Data publishing | - |
dc.subject | Computational biology | - |
dc.subject | Data citation | - |
dc.title | Experiences in integrated data and research object publishing using GigaDB | - |
dc.type | Article | - |
dc.identifier.email | Xiao, S: szxiao@hku.hk | - |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.1007/s00799-016-0174-6 | - |
dc.identifier.scopus | eid_2-s2.0-84970969180 | - |
dc.identifier.volume | 18 | - |
dc.identifier.issue | 2 | - |
dc.identifier.spage | 99 | - |
dc.identifier.epage | 111 | - |
dc.identifier.isi | WOS:000406746000004 | - |
dc.publisher.place | Germany | - |
dc.identifier.issnl | 1432-1300 | - |