Skip to main content

Research Repository

Advanced Search

Choosing Sample Sizes for Statistical Measures on Interval-Valued Data

McCulloch, Josie; Ellerby, Zack; Wagner, Christian

Choosing Sample Sizes for Statistical Measures on Interval-Valued Data Thumbnail


Authors

Josie McCulloch

Zack Ellerby



Abstract

Intervals have frequently been used in the literature to represent uncertainty in data, from eliciting uncertain judgements from experts to representing uncertainty in sensor measurements. This widespread use of intervals has led to research on interval statistics to help understand the data. However, even seemingly trivial statistics (such as variance) cannot be calculated on interval-valued data using the same approach as for point data without incurring substantial loss of precision to a level which can make results close to useless. This loss of precision makes it challenging for decision makers to appropriately interpret interval-valued data using familiar statistics. Although there exist several approaches to computing statistics such as variance, these are all developed for specific properties of the data, and there is no general-case method. In addition, there are many statistical measures for which no efficient and accurate method exist. For such cases, we can use a Monte Carlo sampling approach to generate approximate statistics. While sampling does not generally produce exact solutions, it can provide a useful and efficient approximation to a desired degree of accuracy given sufficient computational resources. In this paper, we focus on the application of Monte Carlo sampling to generate statistics for interval-valued data. Specifically, we explore the optimum sample size required to calculate statistics on interval-valued data for a given degree of accuracy desired. We compare different sizes of data and different sampling methods to demonstrate how these affect the choice of an optimum sample size.

Citation

McCulloch, J., Ellerby, Z., & Wagner, C. (2020). Choosing Sample Sizes for Statistical Measures on Interval-Valued Data. In 2020 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE) (1-8). https://doi.org/10.1109/FUZZ48607.2020.9177745

Conference Name 2020 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)
Conference Location Glasgow, United Kingdom
Start Date Jul 19, 2020
End Date Jul 24, 2020
Acceptance Date Mar 20, 2020
Online Publication Date Aug 26, 2020
Publication Date 2020-07
Deposit Date May 1, 2020
Publicly Available Date Jul 31, 2020
Pages 1-8
Series ISSN 1558-4739
Book Title 2020 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)
ISBN 978-1-7281-6933-0
DOI https://doi.org/10.1109/FUZZ48607.2020.9177745
Public URL https://nottingham-repository.worktribe.com/output/4370778
Publisher URL https://ieeexplore.ieee.org/document/9177745
Related Public URLs https://wcci2020.org/
https://2020.wcci-virtual.org/presentation/oral/choosing-sample-sizes-statistical-measures-interval-valued-data
Additional Information © 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Files




You might also like



Downloadable Citations