Skip to main content

Research Repository

Advanced Search

Interval-Valued Regression - Sensitivity to Data Set Features

Kabir, Shaily; Wagner, Christian

Authors



Abstract

Regression represents one of the most basic building blocks of data analysis and AI. Despite growing interest in interval-valued data across various fields, approaches to establish regression models for interval-valued data which address and handle the specific properties of given data sets are very limited. For broader use and adoption of regression for intervals, this paper conducts a sensitivity analysis of key extant linear regression approaches in respect to important features of interval-valued data sets, such as the mean and associated standard deviation of the range (size) of the intervals within the data set-a measure of overall size and size-diversity, and the dispersion of interval-centers-a measure of diversity in terms of interval position. Experiments with carefully designed synthetic exemplar data sets with these properties suggest that distant placement of intervals as well as higher standard deviation (uncertainty) of ranges increase estimation errors; that is, they result in lower linear regression model fitness for all regression methods as may intuitively be expected. However, these errors are lower for the best suited Parameterized model in comparison to the MinMax and Constrained Center and Range methods. This paper sheds light on the behaviour and 'expectable' performance of key linear regression models designed for interval-valued data and adds a further building block to supporting the broader adoption of intervals (and subsequently fuzzy sets) as a fundamental data type in AI.

Citation

Kabir, S., & Wagner, C. (2021, July). Interval-Valued Regression - Sensitivity to Data Set Features. Presented at 2021 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), Luxembourg, Luxembourg

Presentation Conference Type Conference Paper (published)
Conference Name 2021 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)
Start Date Jul 11, 2021
End Date Jul 14, 2021
Acceptance Date Feb 7, 2025
Online Publication Date Aug 5, 2021
Publication Date Jul 11, 2021
Deposit Date Feb 28, 2025
Peer Reviewed Peer Reviewed
Pages 1-7
DOI https://doi.org/10.1109/fuzz45933.2021.9494554
Keywords Index Terms-Intervals; regression; uncertainty; linear
Public URL https://nottingham-repository.worktribe.com/output/45861647
Publisher URL https://ieeexplore.ieee.org/document/9494554