The Peruvian Amazon forestry dataset: A leaf image classification corpus

No hay miniatura disponible
Alfaro-Cordova E.
Alfaro-Shigueto J.
Cañedo-Apolaya R.M.
Mangel J.C.
Ortiz-Alvarez C.
Ramirez J.L.
Siccha-Ramirez R.
Velez-Zuazo X.
Yamashiro C.
Título de la revista
Revista ISSN
Título del volumen
Elsevier B.V.
Proyectos de investigación
Unidades organizativas
Número de la revista
Forest census allows getting precise data for logging planning and elaboration of the forest management plan. Species identification blunders carry inadequate forest management plans and high risks inside forest concessions. Hence, an identification protocol prevents the exploitation of non-commercial or endangered timber species. The current Peruvian legislation allows the incorporation of non-technical experts, called “materos”, during the identification. Materos use common names given by the folklore and traditions of their communities instead of formal ones, which generally lead to misclassifications. In the real world, logging companies hire materos instead of botanists due to cost/time limitations. Given such a motivation, we explore an end-to-end software solution to automatize the species identification. This paper introduces the Peruvian Amazon Forestry Dataset, which includes 59,441 leaves samples from ten of the most profitable and endangered timber-tree species. The proposal contemplates a background removal algorithm to feed a pre-trained CNN by the ImageNet dataset. We evaluate the quantitative (accuracy metric) and qualitative (visual interpretation) impacts of each stage by ablation experiments. The results show a 96.64% training accuracy and 96.52% testing accuracy on the VGG-19 model. Furthermore, the visual interpretation of the model evidences that leaf venations have the highest correlation in the plant recognition task. © 2021
Palabras clave
Visual interpretation