FUSIPROD

Técnicas de fusión de información en procesos de Deep Learning.

Enlaces de interés:

Broadcast during the Navarra Televisión SINAI Space
Reportaje Navarra Capital "Redes neuronales 'made in Navarra' para implantar la inteligencia artificial en las empresas"

PUBLICACIONES

Artículos científicos

Iosu Rodriguez-Martinez, Pablo Ursua-Medrano, Javier Fernandez, Zdenko Takáč, Humberto Bustince, A study on the suitability of different pooling operators for Convolutional Neural Networks in the prediction of COVID-19 through chest x-ray image analysis, Expert Systems with Applications, Volume 235, 2024, 121162, ISSN 0957-4174. https://doi.org/10.1016/j.eswa.2023.121162.
Iosu Rodriguez-Martinez, Tiago da Cruz Asmus, Graçaliz Pereira Dimuro, Francisco Herrera, Zdenko Takáč, Humberto Bustince, Generalizing max pooling via (a,b)-grouping functions for Convolutional Neural Networks, Information Fusion, Volume 99, 2023, 101893, ISSN 1566-2535. https://doi.org/10.1016/j.inffus.2023.101893.
Tiago da Cruz Asmus, Graçaliz Pereira Dimuro, Benjamin Bedregal, José Antonio Sanz, Javier Fernandez, Iosu Rodriguez-Martinez, Radko Mesiar, Humberto Bustince, A constructive framework to define fusion functions with floating domains in arbitrary closed real intervals, Information Sciences, Volume 610, 2022, Pages 800-829.
https://doi.org/10.1016/j.ins.2022.08.007
Iosu Rodriguez-Martinez, Julio Lafuente, Regivan H.N. Santiago, Graçaliz Pereira Dimuro, Francisco Herrera, Humberto Bustince, Replacing pooling functions in Convolutional Neural Networks by linear combinations of increasing functions, Neural Networks, Volume 152, 2022, Pages 380-393. https://doi.org/10.1016/j.neunet.2022.04.028
Ferrero-Jaurrieta, M., Takac, Z; Rodríguez-Martinez, I., Marco-Detchart, C., Bernardini, A., Fernandez, J., Bustince, H., From Restricted Equivalence Functions on Ln to Similarity measures between fuzzy multisets, aceptado para publicación en IEEE Transactions on Fuzzy Systems. https://doi.org/10.1109/TFUZZ.2023.3235405

Congresos:

Iosu Rodriguez-Martinez, Pablo Aitor Lizarraga-Guerra, Tiago Asmus, Graçaliz Dimuro, Francisco Herrera, Humberto Bustince, Penalty-based pooling functions for feature reduction on Convolutional Neural Networks, IPMU 2022.
Pablo Ursúa Medrano, Iosu Rodriguez Martinez, Angela Bernardini, Javier Fernandez, Humberto Bustince, Evaluación de diferentes funciones de agregación en la capa de pooling de una Red Neuronal Convolucional, ESTYLF 2022.
Pablo Ursúa Medrano, Iosu Rodriguez Martinez, Angela Bernardini, Javier Fernandez, Humberto Bustince, Evaluation of different aggregation functions in the pooling layer of a Convolutional Neural Network, FSTA 2022.

SABER MÁS DEL PROYECTO

Deep neural networks are currently that main tool used in artificial intelligence to handle complex problems because of their power and versatility. The networks depend on having sufficient data that they use to learn to identify patterns in using a mathematical process that sets a series of parameters. For example, for images, they reduce them to simpler data so objects, textures or odd elements, etc. can be detected. This process is called training. In that way, when new images are presented to a trained network it compares them with the characteristics that were used for training in order to identify which ones are most similar or what differences are present.

The functionality of deep neural networks thus depends on their capacity to merge data in a suitable way to identify the most relevant characteristics. With convolutional neural networks the fusion is usually done using two standard processes: convolution and pooling. Those processes always use the same information merging mechanisms. To improve the results of the network, traditionally the number of parameters in increased, which makes using those networks slow and costly.

Because of that, in this project we wanted to improve the information merging processes in the network, so its performance could be improved with a lower cost. To do that, we studied new information merging mechanisms to create a set of functions beyond the typical ones that can be adapted to specific problems, depending on the needs of users, while trying to not increase the cost. In other words, we tried to replace brute force guessing to improve the results with a more refined method that is aware of the data being merged. Specifically, the neural networks developed want to be the response to new challenges in the industrial sector. In particular, we want to improve prediction for unknown data, or anomalies. An anomaly is an event that is not part of the system’s past. They are events that cannot be found in the system’s history of data. In an industrial setting, early detection of problems that, for example, cause devices to be unavailable, have a high impact on production. And, on the other hand, they can lead to significant savings in maintenance costs. Anomalous events are identified to pursue the goal of predicting them in the future early enough and with enough confidence to plan interventions that incur lower costs.

To those ends, the main goals of the project were:

– Improve the applicability and interpretability of deep neural networks with the theoretical development of new information merging mechanisms to be applied in the convolution and pooling phases

– Analyse the potential of deep neural networks to be used for detecting anomalies in univariate time series, and adapting them for AI inference functions in perimeter devices

The results were positive. Specifically, it has been shown that pooling processes can, effectively, be improved if functions are used that are capable of being aware of relationships between data using suitable metrics. That has opened the door to developing neural network models that can adapt themselves specifically to the kind of anomalies to study, as well as other image processing problems.

Insofar as convolution, it strongly depends on the way the hardware (processors) is designed in current computers. Although theoretically it is possible to improve the results if more general merge functions are considered, in practice it is not possible to implement those changes unless there is a change in the physical characteristics of the machines. That kind of modification is outside the scope of the companies, so it is not feasible.

Nevertheless, it is important to highlight that it has been shown that the new information merging mechanisms can be extended beyond the convolutional neural networks initially considered to improve any kind of architecture. And that is very significant, because neural networks evolve at a high speed, and the appearance of new and more powerful architecture is very fast. The results of our study make it possible for companies to use them for these new models.

On the other hand, with the goal of defining a detection product valid for different kinds of signals in terms of complexity (periodic, nearly periodic, aperiodic), the performance of several architectures was studied, as well as how to embed the architectures into different hardware. That analysis made it possible to show that the simplest networks are capable of detecting the most complex univariate time series characteristics, for example the price of electricity on the Spanish daily market. That involves looking being able to consider hardware with a lower processing capacity and, consequently, create lower cost detection devices to perform the inference.