Emily Griffiths
Findability of UK health datasets available for research: a mixed methods study
Griffiths, Emily; Joseph, Rebecca M.; Tilston, George; Thew, Sarah; Kapacee, Zoher; Dixon, William; Peek, Niels
Authors
Rebecca M. Joseph
George Tilston
Sarah Thew
Zoher Kapacee
William Dixon
Niels Peek
Abstract
Objective How health researchers find secondary data to analyse is unclear. We sought to describe the approaches that UK organisations take to help researchers find data and to assess the findability of health data that are available for research.
Methods We surveyed established organisations about how they make data findable. We derived measures of findability based on the first element of the FAIR principles (Findable, Accessible, Interoperable, Reproducible). We applied these to 13 UK health datasets and measured their findability via two major internet search engines in 2018 and repeated in 2021.
Results Among 12 survey respondents, 11 indicated that they made metadata publicly available. Respondents said internet presence was important for findability, but that this needed improvement. In 2018, 8 out of 13 datasets were listed in the top 100 search results of 10 searches repeated on both search engines, while the remaining 5 were found one click away from those search results. In 2021, this had reduced to seven datasets directly listed and one dataset one click away. In 2021, Google Dataset Search had become available, which listed 3 of the 13 datasets within the top 100 search results.
Discussion Measuring findability via online search engines is one method for evaluating efforts to improve findability. Findability could perhaps be improved with catalogues that have greater inclusion of datasets, field-level metadata and persistent identifiers.
Conclusion UK organisations recognised the importance of the internet for finding data for research. However, health datasets available for research were no more findable in 2021 than in 2018.
Citation
Griffiths, E., Joseph, R. M., Tilston, G., Thew, S., Kapacee, Z., Dixon, W., & Peek, N. (2022). Findability of UK health datasets available for research: a mixed methods study. BMJ Health & Care Informatics, 29(1), Article e100325. https://doi.org/10.1136/bmjhci-2021-100325
Journal Article Type | Article |
---|---|
Acceptance Date | Oct 25, 2021 |
Online Publication Date | Feb 22, 2022 |
Publication Date | Feb 22, 2022 |
Deposit Date | Mar 11, 2022 |
Publicly Available Date | Mar 14, 2022 |
Journal | BMJ Health & Care Informatics |
Print ISSN | 2632-1009 |
Electronic ISSN | 2632-1009 |
Publisher | BMJ Publishing Group |
Peer Reviewed | Peer Reviewed |
Volume | 29 |
Issue | 1 |
Article Number | e100325 |
DOI | https://doi.org/10.1136/bmjhci-2021-100325 |
Keywords | Health Information Management; Health Informatics; Computer Science Applications |
Public URL | https://nottingham-repository.worktribe.com/output/7511608 |
Publisher URL | https://informatics.bmj.com/content/29/1/e100325 |
Files
Griffiths BMJ Health Care Inform 2022
(542 Kb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by-nc/4.0/
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: digital-library-support@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search