Publicación:
On the relevance of the metadata used in the semantic segmentation of indoor image spaces

No hay miniatura disponible
Fecha
2021
Autores
Vasquez-Espinoza L.
Castillo-Cara M.
Orozco-Barbosa L.
Título de la revista
Revista ISSN
Título del volumen
Editor
Elsevier Ltd
Proyectos de investigación
Unidades organizativas
Número de la revista
Abstracto
The study of artificial learning processes in the area of computer vision context has mainly focused on achieving a fixed output target rather than on identifying the underlying processes as a means to develop solutions capable of performing as good as or better than the human brain. This work reviews the well-known segmentation efforts in computer vision. However, our primary focus is on the quantitative evaluation of the amount of contextual information provided to the neural network. In particular, the information used to mimic the tacit information that a human is capable of using, like a sense of unambiguous order and the capability of improving its estimation by complementing already learned information. Our results show that, after a set of pre and post-processing methods applied to both the training data and the neural network architecture, the predictions made were drastically closer to the expected output in comparison to the cases where no contextual additions were provided. Our results provide evidence that learning systems strongly rely on contextual information for the identification task process. © 2021 The Author(s)
Descripción
This work has been partially funded by the Spanish Ministry of Sci-ence, Education and Universities, the European Regional DevelopmentFund and the State Research Agency [grant number RTI2018-098156-B-C52], and by FONDECYT / World Bank [grant number 026-2019FONDECYT-BM-INC.INV].
Palabras clave
U-net, Deep learning, Fully convolutional network, Indoor scenes, Metadata preprocessing, Semantic segmentation
Citación