Skip to main content

Research Repository

Advanced Search

All Outputs (16)

Hyper-Stacked: Scalable and Distributed Approach to AutoML for Big Data (2023)
Conference Proceeding
Dave, R., Angarita-Zapata, J. S., & Triguero, I. (2023). Hyper-Stacked: Scalable and Distributed Approach to AutoML for Big Data. In Machine Learning and Knowledge Extraction (82-102). https://doi.org/10.1007/978-3-031-40837-3_6

The emergence of Machine Learning (ML) has altered how researchers and business professionals value data. Applicable to almost every industry, considerable amounts of time are wasted creating bespoke applications and repetitively hand-tuning models t... Read More about Hyper-Stacked: Scalable and Distributed Approach to AutoML for Big Data.

Feature Importance Identification for Time Series Classifiers (2022)
Conference Proceeding
Meng, H., Wagner, C., & Triguero, I. (2022). Feature Importance Identification for Time Series Classifiers. In 2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC) (3293-3298). https://doi.org/10.1109/smc53654.2022.9945205

Time series classification is a challenging research area where machine learning techniques such as deep learning perform well, yet lack interpretability. Identifying the most important features for such classifiers provides a pathway to improving th... Read More about Feature Importance Identification for Time Series Classifiers.

A Local Search with a Surrogate Assisted Option for Instance Reduction (2020)
Conference Proceeding
Neri, F., & Triguero, I. (2020). A Local Search with a Surrogate Assisted Option for Instance Reduction. In Applications of Evolutionary Computation (578-594). https://doi.org/10.1007/978-3-030-43722-0_37

© 2020, Springer Nature Switzerland AG. In data mining, instance reduction is a key data pre-processing step that simplifies and cleans raw data, by either selecting or creating new samples, before applying a learning algorithm. This usually yields t... Read More about A Local Search with a Surrogate Assisted Option for Instance Reduction.

Fuzzy Hot Spot Identification for Big Data: An Initial Approach (2019)
Conference Proceeding
Triguero, I., Tickle, R., Figueredo, G. P., Mesgarpour, M., Ozcan, E., & John, R. I. (2019). Fuzzy Hot Spot Identification for Big Data: An Initial Approach. In 2019 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE). https://doi.org/10.1109/FUZZ-IEEE.2019.8858979

Hot spot identification problems are present across a wide range of areas, such as transportation, health care and energy. Hot spots are locations where a certain type of event occurs with high frequency. A recent big data approach is capable of iden... Read More about Fuzzy Hot Spot Identification for Big Data: An Initial Approach.

A Preliminary Approach for the Exploitation of Citizen Science Data for Fast and Robust Fuzzy k-Nearest Neighbour Classification (2019)
Conference Proceeding
Jimenez, M., Torres, M. T., John, R., & Triguero, I. (2019). A Preliminary Approach for the Exploitation of Citizen Science Data for Fast and Robust Fuzzy k-Nearest Neighbour Classification. In 2019 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE) (1-6). https://doi.org/10.1109/FUZZ-IEEE.2019.8858830

Citizen science is becoming mainstream in a wide variety of real-world applications in astronomy or bioinformatics, in which, for example, classification tasks by experts are very time consuming. These projects engage amateur volunteers that are task... Read More about A Preliminary Approach for the Exploitation of Citizen Science Data for Fast and Robust Fuzzy k-Nearest Neighbour Classification.

Evolving Deep CNN-LSTMs for Inventory Time Series Prediction (2019)
Conference Proceeding
Xue, N., Triguero, I., Figueredo, G. P., & Landa-Silva, D. (2019). Evolving Deep CNN-LSTMs for Inventory Time Series Prediction. . https://doi.org/10.1109/CEC.2019.8789957

Inventory forecasting is a key component of effective inventory management. In this work, we utilise hybrid deep learning models for inventory forecasting. According to the highly nonlinear and non-stationary characteristics of inventory data, the mo... Read More about Evolving Deep CNN-LSTMs for Inventory Time Series Prediction.

A Simulation-based Optimisation Approach for Inventory Management of Highly Perishable Food (2019)
Conference Proceeding
Xue, N., Landa-Silva, D., Figueredo, G. P., & Triguero, I. (2019). A Simulation-based Optimisation Approach for Inventory Management of Highly Perishable Food. In Proceedings of the 8th International Conference on Operations Research and Enterprise Systems - Volume 1: ICORES (406-413). https://doi.org/10.5220/0007401304060413

The taste and freshness of perishable foods decrease dramatically with time. Effective inventory management requires understanding of market demand as well as balancing customers needs and references with products’ shelf life. The objective is to av... Read More about A Simulation-based Optimisation Approach for Inventory Management of Highly Perishable Food.

A preliminary study on Hybrid Spill-Tree Fuzzy k-Nearest Neighbors for big data classification (2018)
Conference Proceeding
Maillo, J., Luengo, J., Garcia, S., Herrera, F., & Triguero, I. (2018). A preliminary study on Hybrid Spill-Tree Fuzzy k-Nearest Neighbors for big data classification. In 2018 IEEE International Conference on Fuzzy Systems (FUXX-IEEE) (1-8). https://doi.org/10.1109/FUZZ-IEEE.2018.8491595

The Fuzzy k Nearest Neighbor (Fuzzy kNN) classifier is well known for its effectiveness in supervised learning problems. kNN classifies by comparing new incoming examples with a similarity function using the samples of the training set. The fuzzy ver... Read More about A preliminary study on Hybrid Spill-Tree Fuzzy k-Nearest Neighbors for big data classification.

A Preliminary Study of the Feasibility of Global Evolutionary Feature Selection for Big Datasets under Apache Spark (2018)
Conference Proceeding
Galar, M., Triguero, I., Bustince, H., & Herrera, F. (2018). A Preliminary Study of the Feasibility of Global Evolutionary Feature Selection for Big Datasets under Apache Spark. In 2018 IEEE Congress on Evolutionary Computation (CEC) - Proceedings (1-8). https://doi.org/10.1109/CEC.2018.8477878

Designing efficient learning models capable of dealing with tons of data has become a reality in the era of big data. However, the amount of available data is too much for traditional data mining techniques to be applicable. This issue is even more s... Read More about A Preliminary Study of the Feasibility of Global Evolutionary Feature Selection for Big Datasets under Apache Spark.

A genetic algorithm with composite chromosome for shift assignment of part-time employees (2018)
Conference Proceeding
Xue, N., Landa-Silva, D., Triguero, I., & Figueredo, G. P. (2018). A genetic algorithm with composite chromosome for shift assignment of part-time employees.

Personnel scheduling problems involve multiple tasks, including assigning shifts to workers. The purpose is usually to satisfy objectives and constraints arising from management, labour unions and employee preferences. The shift assignment problem is... Read More about A genetic algorithm with composite chromosome for shift assignment of part-time employees.

A preliminary study on automatic algorithm selection for short-term traffic forecasting (2018)
Conference Proceeding
Angarita-Zapata, J. S., Triguero, I., & Masegosa, A. D. (2018). A preliminary study on automatic algorithm selection for short-term traffic forecasting. In Intelligent Distributed Computing XII. , (204-214). https://doi.org/10.1007/978-3-319-99626-4_18

© 2018, Springer Nature Switzerland AG. Despite the broad range of Machine Learning (ML) algorithms, there are no clear baselines to find the best method and its configuration given a Short-Term Traffic Forecasting (STTF) problem. In ML, this is know... Read More about A preliminary study on automatic algorithm selection for short-term traffic forecasting.

Vehicle incident hot spots identification: An approach for big data (2017)
Conference Proceeding
Triguero, I., Figueredo, G. P., Mesgarpour, M., Garibaldi, J. M., & John, R. (2017). Vehicle incident hot spots identification: An approach for big data. In Proceedings - 16th IEEE International Conference on Trust, Security and Privacy in Computing and Communications; 11th IEEE International Conference on Big Data Science and Engineering (IEEE BigDataSE); and 14th IEEE International Conference on Embedded Software and Systems, (901-908). https://doi.org/10.1109/Trustcom/BigDataSE/ICESS.2017.329

In this work we introduce a fast big data approach for road incident hot spot identification using Apache Spark. We implement an existing immuno-inspired mechanism, namely SeleSup, as a series of MapReduce-like operations. SeleSup is composed of a nu... Read More about Vehicle incident hot spots identification: An approach for big data.

Exact fuzzy k-Nearest neighbor classification for big datasets (2017)
Conference Proceeding
Maillo, J., Luengo, J., García, S., Herrera, F., & Triguero, I. (2017). Exact fuzzy k-Nearest neighbor classification for big datasets.

The k-Nearest Neighbors (kNN) classifier is one of the most effective methods in supervised learning problems. It classifies unseen cases comparing their similarity with the training data. Nevertheless, it gives to each labeled sample the same import... Read More about Exact fuzzy k-Nearest neighbor classification for big datasets.

A first attempt on global evolutionary undersampling for imbalanced big data (2017)
Conference Proceeding
Triguero, I., Galar, M., Bustince, H., & Herrera, F. (2017). A first attempt on global evolutionary undersampling for imbalanced big data.

The design of efficient big data learning models has become a common need in a great number of applications. The massive amounts of available data may hinder the use of traditional data mining techniques, especially when evolutionary algorithms are i... Read More about A first attempt on global evolutionary undersampling for imbalanced big data.

From Big data to Smart Data with the K-Nearest Neighbours algorithm (2016)
Conference Proceeding
Triguero, I., Maillo, J., Luengo, J., García, S., & Herrera, F. (2016). From Big data to Smart Data with the K-Nearest Neighbours algorithm.

The k-nearest neighbours algorithm is one of the most widely used data mining models because of its simplicity and accurate results. However, when it comes to deal with big datasets, with potentially noisy and missing information, this technique beco... Read More about From Big data to Smart Data with the K-Nearest Neighbours algorithm.

Evolutionary undersampling for extremely imbalanced big data classification under apache spark (2016)
Conference Proceeding
Triguero, I., Galar, M., Merino, D., Maillo, J., Bustince, H., & Herrera, F. (2016). Evolutionary undersampling for extremely imbalanced big data classification under apache spark.

The classification of datasets with a skewed class distribution is an important problem in data mining. Evolutionary undersampling of the majority class has proved to be a successful approach to tackle this issue. Such a challenging task may become e... Read More about Evolutionary undersampling for extremely imbalanced big data classification under apache spark.