Julie Baldwin
Instant Data: Finding and cataloguing externally hosted research datasets through automation
Baldwin, Julie; Green, Jonathan
Authors
Jonathan Green
Abstract
Increasingly, publishers, funders and institutions are focussing not only on open access to publications, but also open access to the underlying research data. There is a wealth of data archiving options available, most of which are external to an institution’s own research data repository. It has therefore become a challenge for institutions to keep track of what data is being published by its researchers, and where. We present a novel approach to address this though the development of a separate metadata-only research data catalogue, the contents of which are automatically harvested. We examine the structure and metadata profile of the catalogue, the means of automatic harvesting including evolution of the process from semi to fully automated, our review method and experiences of running the catalogue over the last year. Finally, we present future plans for consolidating the service, and development of workflows to enable future growth.
Citation
Baldwin, J., & Green, J. (2024, February). Instant Data: Finding and cataloguing externally hosted research datasets through automation. Presented at Open Research London, The Francis Crick Institute
Presentation Conference Type | Presentation / Talk |
---|---|
Conference Name | Open Research London |
Start Date | Feb 12, 2024 |
End Date | Feb 12, 2024 |
Acceptance Date | Nov 2, 2023 |
Online Publication Date | Feb 16, 2024 |
Publication Date | Feb 12, 2024 |
Deposit Date | Oct 16, 2024 |
Publicly Available Date | Oct 23, 2024 |
Peer Reviewed | Not Peer Reviewed |
DOI | https://doi.org/10.6084/M9.FIGSHARE.25233181 |
Keywords | research data, cataloging, automation, dspace, worktribe, repository |
Public URL | https://nottingham-repository.worktribe.com/output/33027479 |
Publisher URL | https://figshare.com/articles/presentation/Instant_Data_Finding_and_cataloguing_externally_hosted_research_datasets_through_automation/25233181?file=44572141 |
Other Repo URL | https://figshare.com/articles/presentation/Instant_Data_Finding_and_cataloguing_externally_hosted_research_datasets_through_automation/25233181?file=44572141 |
Files
Instant Data Finding and cataloguing externally hosted research datasets through automation_accessible
(4.9 Mb)
Presentation
Publisher Licence URL
https://creativecommons.org/licenses/by/4.0/
You might also like
We need timely access to mental health data: implications of the Goldacre review
(2023)
Journal Article
Neuroanatomical correlates of working memory performance in Neurofibromatosis 1
(2022)
Journal Article
Test Record
(2020)
Presentation / Conference Contribution
The spatial character of sensor technology
(2006)
Presentation / Conference Contribution
Expected, sensed, and desired: A framework for designing sensing-based interaction
(2005)
Journal Article
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search