Publication:
Emocnn: Encoding emotional expression from text to word vector and classifying emotions—a case study in thai social network conversation

dc.contributor.authorKonlakorn Wongpatikasereeen_US
dc.contributor.authorYongyos Kaewpitakkunen_US
dc.contributor.authorSumeth Yuenyongen_US
dc.contributor.authorSiriwon Matsuoen_US
dc.contributor.authorPanida Yomabooten_US
dc.contributor.otherSiriraj Hospitalen_US
dc.contributor.otherMahidol Universityen_US
dc.contributor.otherSirindhorn International Institute of Technology, Thammasat Universityen_US
dc.contributor.otherPORDEEKUM.AIen_US
dc.date.accessioned2022-08-04T08:38:20Z
dc.date.available2022-08-04T08:38:20Z
dc.date.issued2021-01-01en_US
dc.description.abstractWe present EmoCNN, a collection of specially-trained word embedding layer and convolutional neural network model for the classification of conversational texts into 4 types of emotion. This model is part of a chatbot for depression evaluation. The difficulty in classifying emotion from conversational text is that most word embeddings are trained with emotionally-neutral corpus such as Wikipedia or news articles, where emotional words do not appear very often or at all, and the language style is formal writing. We trained a new word embedding based on the word2vec architecture in an unsupervised manner and then fine-tuned it on soft-labelled data. The data was obtained from mining Twitter using emotion keywords. We show that this emotion word embedding can differentiate between words which have the same polarity and words which have opposite polarity, as well as find similar words with the same polarity, while the standard word embedding cannot. We then used this new embedding as the first layer of EmoCNN that classifies conversational text into the 4 emotions. EmoCNN achieved macro-averaged f1-score of 0.76 over the test set. We compared EmoCNN against three different models: a shallow fully-connected neural network, fine-tuning RoBERTa, and ULMFit. These got the best macro-averaged f1-score of 0.5556, 0.6402 and 0.7386 respectively.en_US
dc.identifier.citationEngineering Journal. Vol.25, No.7 (2021), 73-82en_US
dc.identifier.doi10.4186/ej.2021.25.7.73en_US
dc.identifier.issn01258281en_US
dc.identifier.other2-s2.0-85112480821en_US
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/76979
dc.rightsMahidol Universityen_US
dc.rights.holderSCOPUSen_US
dc.source.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85112480821&origin=inwarden_US
dc.subjectEngineeringen_US
dc.titleEmocnn: Encoding emotional expression from text to word vector and classifying emotions—a case study in thai social network conversationen_US
dc.typeArticleen_US
dspace.entity.typePublication
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85112480821&origin=inwarden_US

Files

Collections