Learning Discriminative Sentiment Chunk Vectors for Twitter Sentiment Analysis

Leiming Yan; Wenying Zheng; Huajie (Harry) Zhang; Hao Tao; Ming He

Learning Discriminative Sentiment Chunk Vectors for Twitter Sentiment Analysis

Leiming Yan,
Wenying Zheng,
Huajie (Harry) Zhang,
Hao Tao,
Ming He,

Abstract

Due to the informal and freely constructed sentence structures, it is a difficult classification task to detect the sentiment polarity of tweets, especially for multi-class cases. Extracting features with more valuable information from tweets is crucial for sentiment analysis. In this paper, to address this problem, a hybrid feature space combining bag-of-words and word embedding, named as Discriminative Sentiment Chunk (DSC) vector, is proposed. Then a semi-supervised method is proposed based on Autoencoder technique to learn discriminative sentiment chunk vectors, which convert a high dimensional bag-of-words vector into a continuous vector space with lower dimension without losing the chunk order. Our experimental results show that using discriminative sentiment chunks gains better accuracies and F1 scores on different twitter datasets and outperforms some popular bag-of-words oriented methods and a few deep network approaches

Keywords

Bag-of-words; Word embedding; Sentiment analysis; Deep learning

Citation Format:
Leiming Yan, Wenying Zheng, Huajie (Harry) Zhang, Hao Tao, Ming He, "Learning Discriminative Sentiment Chunk Vectors for Twitter Sentiment Analysis," Journal of Internet Technology, vol. 18, no. 7 , pp. 1605-1613, Dec. 2017.

Full Text:

PDF

Refbacks

There are currently no refbacks.

Published by Executive Committee, Taiwan Academic Network, Ministry of Education, Taipei, Taiwan, R.O.C
JIT Editorial Office, Office of Library and Information Services, National Dong Hwa University
No. 1, Sec. 2, Da Hsueh Rd., Shoufeng, Hualien 974301, Taiwan, R.O.C.
Tel: +886-3-931-7314　　E-mail: jit.editorial@gmail.com

Username
Password
Remember me