EmoCNN: Encoding Emotional Expression from Text to Word Vector and Classifying Emotions—A Case Study in Thai Social Network Conversation

Authors

  • Konlakorn Wongpatikaseree Mahidol University
  • Yongyos Kaewpitakkun PORDEEKUM.AI
  • Sumeth Yuenyong Mahidol University
  • Siriwon Matsuo Thammasat University
  • Panida Yomaboot Mahidol University

DOI:

https://doi.org/10.4186/ej.2021.25.7.73

Keywords:

emotion classification, sentiment analysis, word embedding

Abstract

We present EmoCNN, a collection of specially-trained word embedding layer and convolutional neural network model for the classification of conversational texts into 4 types of emotion. This model is part of a chatbot for depression evaluation. The difficulty in classifying emotion from conversational text is that most word embeddings are trained with emotionally-neutral corpus such as Wikipedia or news articles, where emotional words do not appear very often or at all, and the language style is formal writing. We trained a new word embedding based on the word2vec architecture in an unsupervised manner and then fine-tuned it on soft-labelled data. The data was obtained from mining Twitter using emotion keywords. We show that this emotion word embedding can differentiate between words which have the same polarity and words which have opposite polarity, as well as find similar words with the same polarity, while the standard word embedding cannot. We then used this new embedding as the first layer of EmoCNN that classifies conversational text into the 4 emotions. EmoCNN achieved macro-averaged f1-score of 0.76 over the test set. We compared EmoCNN against three different models: a shallow fully-connected neural network, fine-tuning RoBERTa, and ULMFit. These got the best macro-averaged f1-score of 0.5556, 0.6402 and 0.7386 respectively.

Downloads

Download data is not yet available.

Author Biographies

Konlakorn Wongpatikaseree

Department of Computer Engineering, Faculty of Engineering, Mahidol University, 25/25 Salaya, Phuttamonthon, Nakhon Pathom 73170, Thailand

Yongyos Kaewpitakkun

PORDEEKUM.AI, 8/18 Pave Ramintra-wongwoen Village, Kanchanapisek Rd., O-nguen, Saimai, Bangkok 10220, Thailand

Sumeth Yuenyong

Department of Computer Engineering, Faculty of Engineering, Mahidol University, 25/25 Salaya, Phuttamonthon, Nakhon Pathom 73170, Thailand

Siriwon Matsuo

Sirindhorn International Institute of Technology, Thammasat University, 99 Moo 18, Km. 41 on Paholyothin Highway Khlong Luang, Pathum Thani 12120, Thailand

Panida Yomaboot

Department of Psychiatry, Faculty of Medicine Siriraj Hospital, Mahidol University, 2 Wanglang Road, Siriraj Sub-district, Bangkok-Noi District, Bangkok 10700, Thailand

Downloads

Published In
Vol 25 No 7, Jul 31, 2021
How to Cite
[1]
K. Wongpatikaseree, Y. Kaewpitakkun, S. Yuenyong, S. Matsuo, and P. Yomaboot, “EmoCNN: Encoding Emotional Expression from Text to Word Vector and Classifying Emotions—A Case Study in Thai Social Network Conversation”, Eng. J., vol. 25, no. 7, pp. 73-82, Jul. 2021.