Publicación:
WordNet-SHP: Towards the building of a lexical database for a Peruvian minority language

dc.contributor.author Maguiño-Valencia D. es_PE
dc.contributor.author Oncevay-Marcos A. es_PE
dc.contributor.author Sobrevilla Cabezudo M.A. es_PE
dc.date.accessioned 2024-05-30T23:13:38Z
dc.date.available 2024-05-30T23:13:38Z
dc.date.issued 2019
dc.description.abstract WordNet-like resources are lexical databases with highly relevance information and data which could be exploited in more complex computational linguistics research and applications. The building process requires manual and automatic tasks, that could be more arduous if the language is a minority one with fewer digital resources. This study focuses in the construction of an initial WordNetdatabase for a low-resourced and indigenous language in Peru: Shipibo-Konibo (shp). First, the stages of development from a scarce scenario (a bilingual dictionary shp-es) are described. Then, it is proposed a synset alignment method by comparing the definition glosses in the dictionary (written in Spanish) with the content of a Spanish WordNet. In this sense, word2vec similarity was the chosen metric for the proximity measure. Finally, an evaluation process is performed for the synsets, using a manually annotated Gold Standard inShipibo-Konibo. The obtained results are promising, and this resource is expected to serve well in further applications, such as word sense disambiguation and even machine translation in the shp-es language pair.
dc.description.sponsorship Consejo Nacional de Ciencia, Tecnología e Innovación Tecnológica - Concytec
dc.identifier.isbn urn:isbn:9791095546009
dc.identifier.scopus 2-s2.0-85059915834
dc.identifier.uri https://hdl.handle.net/20.500.12390/819
dc.language.iso eng
dc.publisher European Language Resources Association (ELRA)
dc.relation.ispartof LREC 2018 - 11th International Conference on Language Resources and Evaluation
dc.rights info:eu-repo/semantics/openAccess
dc.subject Wordnet
dc.subject Computational linguistics es_PE
dc.subject Database systems es_PE
dc.subject Natural language processing systems es_PE
dc.subject Ships es_PE
dc.subject Bilingual dictionary es_PE
dc.subject Digital resources es_PE
dc.subject Lexical database es_PE
dc.subject Machine translations es_PE
dc.subject Minority languages es_PE
dc.subject Research and application es_PE
dc.subject Word Sense Disambiguation es_PE
dc.subject Ontology es_PE
dc.subject.ocde https://purl.org/pe-repo/ocde/ford#6.02.06
dc.title WordNet-SHP: Towards the building of a lexical database for a Peruvian minority language
dc.type info:eu-repo/semantics/conferenceObject
dspace.entity.type Publication
Archivos