Publicación:
A deep learning approach for sentiment analysis in Spanish Tweets

dc.contributor.author Vizcarra G. es_PE
dc.contributor.author Mauricio A. es_PE
dc.contributor.author Mauricio L. es_PE
dc.date.accessioned 2024-05-30T23:13:38Z
dc.date.available 2024-05-30T23:13:38Z
dc.date.issued 2018
dc.description The present work was supported by grant 234-2015-FONDECYT (Master Program) from Cienciactiva of the National Council for Science, Technology and Technological Innovation (CONCYTEC-PERU) and the Office Research of Universidad Nacional de Ingeniería (VRI - UNI).
dc.description.abstract Sentiment Analysis at Document Level is a well-known problem in Natural Language Processing (NLP), being considered as a reference in NLP, over which new architectures and models are tested in order to compare metrics that are also referents in other issues. This problem has been solved in good enough terms for English language, but its metrics are still quite low in other languages. In addition, architectures which are successful in a language do not necessarily works in another. In the case of Spanish, data quantity and quality become a problem during data preparation and architecture design, due to the few labeled data available including not-textual elements (like emoticons or expressions). This work presents an approach to solve the sentiment analysis problem in Spanish tweets and compares it with the state of art. To do so, a preprocessing algorithm is performed based on interpretation of colloquial expressions and emoticons, and trivial words elimination. Processed sentences turn into matrices using the 3 most successful methods of word embeddings (GloVe, FastText and Word2Vec), then the 3 matrices merge into a 3-channels matrix which is used to feed our CNN-based model. The proposed architecture uses parallel convolution layers as k-grams, by this way the value of each word and their contexts are weighted, to predict the sentiment polarity among 4 possible classes. After several tests, the optimal tuple which improves the accuracy were <1, 2>. Finally, our model presents %61.58 and %71.14 of accuracy in InterTASS and General Corpus respectively.
dc.description.sponsorship Consejo Nacional de Ciencia, Tecnología e Innovación Tecnológica - Concytec
dc.identifier.doi https://doi.org/10.1007/978-3-030-01424-7_61
dc.identifier.isbn 9783030014230
dc.identifier.scopus 2-s2.0-85054825905
dc.identifier.uri https://hdl.handle.net/20.500.12390/485
dc.language.iso eng
dc.publisher Springer Verlag
dc.relation.ispartof Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.rights info:eu-repo/semantics/openAccess
dc.subject Spanish tweets
dc.subject Convolution es_PE
dc.subject Data mining es_PE
dc.subject Learning algorithms es_PE
dc.subject Matrix algebra es_PE
dc.subject Natural language processing systems es_PE
dc.subject Network architecture es_PE
dc.subject Neural networks es_PE
dc.subject Sentiment analysis es_PE
dc.subject Architecture designs es_PE
dc.subject Architectures and models es_PE
dc.subject Convolutional Neural Networks (CNN) es_PE
dc.subject English languages es_PE
dc.subject Learning approach es_PE
dc.subject Pre-processing algorithms es_PE
dc.subject Deep learning es_PE
dc.subject.ocde https://purl.org/pe-repo/ocde/ford#1.02.01
dc.title A deep learning approach for sentiment analysis in Spanish Tweets
dc.type info:eu-repo/semantics/conferenceObject
dspace.entity.type Publication
oairecerif.author.affiliation #PLACEHOLDER_PARENT_METADATA_VALUE#
oairecerif.author.affiliation #PLACEHOLDER_PARENT_METADATA_VALUE#
Archivos