Combining two national-scale data sets to map soil properties, the case of available magnesium in England and Wales: combining two soil surveys

Lark, R. M.; Ander, E. L.; Broadley, M. R.

doi:10.1111/ejss.12743

Combining two national-scale data sets to map soil properties, the case of available magnesium in England and Wales: combining two soil surveys

Lark, R. M.; Ander, E. L.; Broadley, M. R.

Authors

Professor MURRAY LARK MURRAY.LARK@NOTTINGHAM.AC.UK
PROFESSOR OF GEOINFORMATICS

E. L. Ander

Professor MARTIN BROADLEY MARTIN.BROADLEY@NOTTINGHAM.AC.UK
PROFESSOR OF PLANT NUTRITION

Abstract

© 2018 The Authors. European Journal of Soil Science published by John Wiley & Sons Ltd on behalf of British Society of Soil Science. Given the costs of soil survey it is necessary to make the best use of available datasets, but data that differ with respect to some aspect of the sampling or analytical protocol cannot be combined simply. In this paper we consider a case where two datasets were available on the concentration of plant-available magnesium in the topsoil. The datasets were the Representative Soil Sampling Scheme (RSSS) and the National Soil Inventory (NSI) of England and Wales. The variable was measured over the same depth interval and with the same laboratory method, but the sample supports were different and so the datasets differ in their variance. We used a multivariate geostatistical model, the linear model of coregionalization (LMCR), to model the joint spatial distribution of the two datasets. The model allowed us to elucidate the effects of the sample support on the two datasets, and to show that there was a strong correlation between the underlying variables. The LMCR allowed us to make spatial predictions of the variable on the RSSS support by cokriging the RSSS data with the NSI data. We used cross-validation to test the validity of the LMCR and showed how incorporating the NSI data restricted the range of prediction error variances relative to univariate ordinary kriging predictions from the RSSS data alone. The standardized squared prediction errors were computed and the coverage of prediction intervals (i.e. the proportion of sites at which the prediction interval included the observed value of the variable). Both these statistics suggested that the prediction error variances were consistent for the cokriging predictions but not for the ordinary kriging predictions from the simple combination of the RSSS and NSI data, which might be proposed on the basis of their very similar mean values. The LMCR is therefore proposed as a general tool for the combined analysis of different datasets on soil properties. Highlights: Differences in sample support mean that two datasets on a soil property cannot be combined simply. We showed how a multivariate geostatistical model can be used to elucidate the relationships between two such datasets. The same model allows soil properties to be mapped jointly from such data. This offers a general basis for combining soil datasets from diverse sources.

Citation

Lark, R. M., Ander, E. L., & Broadley, M. R. (2019). Combining two national-scale data sets to map soil properties, the case of available magnesium in England and Wales: combining two soil surveys. European Journal of Soil Science, 70(2), 361-377. https://doi.org/10.1111/ejss.12743

Journal Article Type	Article
Acceptance Date	Sep 10, 2018
Online Publication Date	Oct 5, 2018
Publication Date	Mar 1, 2019
Deposit Date	Oct 15, 2018
Publicly Available Date	Jan 21, 2019
Journal	European Journal of Soil Science
Print ISSN	1351-0754
Electronic ISSN	1365-2389
Publisher	Wiley
Peer Reviewed	Peer Reviewed
Volume	70
Issue	2
Pages	361-377
DOI	https://doi.org/10.1111/ejss.12743
Keywords	Soil Science
Public URL	https://nottingham-repository.worktribe.com/output/1166013
Publisher URL	https://onlinelibrary.wiley.com/doi/abs/10.1111/ejss.12743
Contract Date	Oct 15, 2018