Dr ISAAC TRIGUERO VELAZQUEZ I.TrigueroVelazquez@nottingham.ac.uk
ASSOCIATE PROFESSOR
A first attempt on global evolutionary undersampling for imbalanced big data
Triguero, Isaac; Galar, M.; Bustince, H.; Herrera, Francisco
Authors
M. Galar
H. Bustince
Francisco Herrera
Abstract
The design of efficient big data learning models has become a common need in a great number of applications. The massive amounts of available data may hinder the use of traditional data mining techniques, especially when evolutionary algorithms are involved as a key step. Existing solutions typically follow a divide-and-conquer approach in which the data is split into several chunks that are addressed individually. Next, the partial knowledge acquired from every slice of data is aggregated in multiple ways to solve the entire problem. However, these approaches are missing a global view of the data as a whole, which may result in less accurate models.
In this work we carry out a first attempt on the design of a global evolutionary undersampling model for imbalanced classification problems. These are characterised by having a highly skewed distribution of classes in which evolutionary models are being used to balance it by selecting only the most relevant data. Using Apache Spark as big data technology, we have introduced a number of variations to the well-known CHC algorithm to work very large chromosomes and reduce the costs associated to fitness evaluation. We discuss some preliminary results, showing the great potential of this new kind of evolutionary big data model.
Citation
Triguero, I., Galar, M., Bustince, H., & Herrera, F. A first attempt on global evolutionary undersampling for imbalanced big data. Presented at IEEE Congress on Evolutionary Computation (CEC 2017)
Conference Name | IEEE Congress on Evolutionary Computation (CEC 2017) |
---|---|
End Date | Jun 8, 2017 |
Acceptance Date | Mar 3, 2017 |
Publication Date | Jul 7, 2017 |
Deposit Date | Jul 10, 2017 |
Publicly Available Date | Jul 10, 2017 |
Peer Reviewed | Peer Reviewed |
Public URL | https://nottingham-repository.worktribe.com/output/871647 |
Publisher URL | http://ieeexplore.ieee.org/document/7969553/ |
Contract Date | Jul 10, 2017 |
Files
EUSglobal.pdf
(440 Kb)
PDF
You might also like
Machine Learning Pipeline for Energy and Environmental Prediction in Cold Storage Facilities
(2024)
Journal Article
Local-global methods for generalised solar irradiance forecasting
(2024)
Journal Article
Explaining time series classifiers through meaningful perturbation and optimisation
(2023)
Journal Article
Identifying bird species by their calls in Soundscapes
(2023)
Journal Article
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search