Elizabeth Dolan
Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models
Dolan, Elizabeth; Goulding, James; Marshall, Harry; Smith, Gavin; Long, Gavin; Tata, Laila J.
Authors
Dr JAMES GOULDING JAMES.GOULDING@NOTTINGHAM.AC.UK
PROFESSOR OF DATA SCIENCE
Harry Marshall
Dr Gavin Smith GAVIN.SMITH@NOTTINGHAM.AC.UK
ASSOCIATE PROFESSOR
GAVIN LONG Gavin.Long1@nottingham.ac.uk
Research Fellow
Professor LAILA TATA laila.tata@nottingham.ac.uk
PROFESSOR OF EPIDEMIOLOGY
Abstract
The COVID-19 pandemic led to unparalleled pressure on healthcare services. Improved healthcare planning in relation to diseases affecting the respiratory system has consequently become a key concern. We investigated the value of integrating sales of non-prescription medications commonly bought for managing respiratory symptoms, to improve forecasting of weekly registered deaths from respiratory disease at local levels across England, by using over 2 billion transactions logged by a UK high street retailer from March 2016 to March 2020. We report the results from the novel AI (Artificial Intelligence) explainability variable importance tool Model Class Reliance implemented on the PADRUS model (Prediction of Amount of Deaths by Respiratory disease Using Sales). PADRUS is a machine learning model optimised to predict registered deaths from respiratory disease in 314 local authority areas across England through the integration of shopping sales data and focused on purchases of non-prescription medications. We found strong evidence that models incorporating sales data significantly out-perform other models that solely use variables traditionally associated with respiratory disease (e.g. sociodemographics and weather data). Accuracy gains are highest (increases in R2 (coefficient of determination) between 0.09 to 0.11) in periods of maximum risk to the general public. Results demonstrate the potential to utilise sales data to monitor population health with information at a high level of geographic granularity.
Citation
Dolan, E., Goulding, J., Marshall, H., Smith, G., Long, G., & Tata, L. J. (2023). Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models. Nature Communications, 14, Article 7258. https://doi.org/10.1038/s41467-023-42776-4
Journal Article Type | Article |
---|---|
Acceptance Date | Oct 20, 2023 |
Online Publication Date | Nov 21, 2023 |
Publication Date | Dec 1, 2023 |
Deposit Date | Nov 2, 2023 |
Publicly Available Date | Nov 22, 2023 |
Journal | Nature Communications |
Electronic ISSN | 2041-1723 |
Publisher | Nature Publishing Group |
Peer Reviewed | Peer Reviewed |
Volume | 14 |
Article Number | 7258 |
DOI | https://doi.org/10.1038/s41467-023-42776-4 |
Keywords | General Physics and Astronomy; General Biochemistry, Genetics and Molecular Biology; General Chemistry; Multidisciplinary |
Public URL | https://nottingham-repository.worktribe.com/output/26805889 |
Publisher URL | https://www.nature.com/articles/s41467-023-42776-4 |
Additional Information | Received: 12 June 2023; Accepted: 20 October 2023; First Online: 21 November 2023; : The authors declare no competing interests. |
Files
s41467-023-42776-4
(4.1 Mb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by/4.0/
You might also like
Who consumes anthocyanins and anthocyanidins? Mining national retail data to reveal the influence of socioeconomic deprivation and seasonality on polyphenol dietary intake
(2023)
Presentation / Conference Contribution
Data donation of individual shopping data to help predict the occurrence of disease: A pilot study linking individual loyalty card and health survey data to investigate COVID-19
(2023)
Presentation / Conference Contribution
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search