Dr Gavin Smith GAVIN.SMITH@NOTTINGHAM.AC.UK
ASSOCIATE PROFESSOR
A novel symbolization technique for time-series outlier detection
Smith, Gavin; Goulding, James
Authors
Dr JAMES GOULDING JAMES.GOULDING@NOTTINGHAM.AC.UK
PROFESSOR OF DATA SCIENCE
Abstract
The detection of outliers in time series data is a core component of many data-mining applications and broadly applied in industrial applications. In large data sets algorithms that are efficient in both time and space are required. One area where speed and storage costs can be reduced is via symbolization as a pre-processing step, additionally opening up the use of an array of discrete algorithms. With this common pre-processing step in mind, this work highlights that (1) existing symbolization approaches are designed to address problems other than outlier detection and are hence sub-optimal and (2) use of off-the-shelf symbolization techniques can therefore lead to significant unnecessary data corruption and potential performance loss when outlier detection is a key aspect of the data mining task at hand. Addressing this a novel symbolization method is motivated specifically targeting the end use application of outlier detection. The method is empirically shown to outperform existing approaches.
Citation
Smith, G., & Goulding, J. A novel symbolization technique for time-series outlier detection. Presented at 2015 IEEE International Conference on Big Data
Conference Name | 2015 IEEE International Conference on Big Data |
---|---|
Acceptance Date | Sep 30, 2015 |
Online Publication Date | Dec 28, 2015 |
Publication Date | Oct 29, 2015 |
Deposit Date | Jun 8, 2018 |
Publicly Available Date | Jun 8, 2018 |
Peer Reviewed | Peer Reviewed |
Book Title | 2015 IEEE International Conference on Big Data (Big Data) |
DOI | https://doi.org/10.1109/BigData.2015.7364037 |
Keywords | Detection; Preprocessing; Symbolization; Quantization; Optimization; Time series; Data mining |
Public URL | https://nottingham-repository.worktribe.com/output/762986 |
Publisher URL | https://ieeexplore.ieee.org/document/7364037/ |
Additional Information | © 2015 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
Contract Date | Jun 8, 2018 |
Files
noval symbolization.pdf
(325 Kb)
PDF
You might also like
Detecting iodine deficiency risks from dietary transitions using shopping data
(2024)
Journal Article
Bundle entropy as an optimized measure of consumers' systematic product choice combinations in mass transactional data
(2022)
Presentation / Conference Contribution
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search