Publicación:
Ship-lemmatagger: Building an nlp toolkit for a peruvian native language

dc.contributor.author Pereira-Noriega J. es_PE
dc.contributor.author Mercado-Gonzales R. es_PE
dc.contributor.author Melgar A. es_PE
dc.contributor.author Sobrevilla-Cabezudo M. es_PE
dc.contributor.author Oncevay-Marcos A. es_PE
dc.date.accessioned 2024-05-30T23:13:38Z
dc.date.available 2024-05-30T23:13:38Z
dc.date.issued 2017
dc.description.abstract Natural Language Processing deals with the understanding and generation of texts through computer programs. There are many different functionalities used in this area, but among them there are some functions that are the support of the remaining ones. These methods are related to the core processing of the morphology of the language (such as lemmatization) and automatic identification of the part-of-speech tag. Thereby, this paper describes the implementation of a basic NLP toolkit for a new language, focusing in the features mentioned before, and testing them in an own corpus built for the occasion. The obtained results exceeded the expected results and could be used for more complex tasks such as machine translation.
dc.description.sponsorship Consejo Nacional de Ciencia, Tecnología e Innovación Tecnológica - Concytec
dc.identifier.doi https://doi.org/10.1007/978-3-319-64206-2_53
dc.identifier.isbn urn:isbn:9783319642055
dc.identifier.scopus 2-s2.0-85028645758
dc.identifier.uri https://hdl.handle.net/20.500.12390/773
dc.language.iso eng
dc.publisher Springer Verlag
dc.relation.ispartof Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.rights info:eu-repo/semantics/openAccess
dc.subject Text processing
dc.subject Automation es_PE
dc.subject Computational linguistics es_PE
dc.subject Ships es_PE
dc.subject Automatic identification es_PE
dc.subject Core processing es_PE
dc.subject Lemmatization es_PE
dc.subject Low resource languages es_PE
dc.subject Machine translations es_PE
dc.subject Native language es_PE
dc.subject Part of speech tagging es_PE
dc.subject Shipibo-konibo es_PE
dc.subject Natural language processing systems es_PE
dc.subject.ocde https://purl.org/pe-repo/ocde/ford#2.00.00
dc.title Ship-lemmatagger: Building an nlp toolkit for a peruvian native language
dc.type info:eu-repo/semantics/conferenceObject
dspace.entity.type Publication
Archivos