Optimizing Cross Domain Sentiment Analysis Using Hidden Markov Continual Progression

P. Manivannan,
C. S. Kanimozhi Selvi,

Abstract


With the rapid increase in internet users and customer reviews playing major role in social media gave rise to sentiment analysis. Pre-processing of input text during sentiment analysis eliminates incomplete and noisy data. Typically, sentiment is manifested separately and applying pre-processing model for optimizing cross-domain sentiment classification is highly required. In this paper, a method called Hidden Markov Continual Progression Cosine Similar (HM-CPCS) is proposed to explore the impact of pre-processing and optimize sentiment analysis. First, a measure of subsequent and antecedent probabilities of tags is made using HM-POS Tagger for the given input dataset. Subsequent and antecedent probabilities of tags are obtained by measuring the transition probabilities between states and observations ensuring feature extraction accuracy. Next, the Continual Progression Stemmer continuously stems the text by adding prefix and suffix to form structured words for the given shortcuts and therefore reduce Error Rate Relative to Truncation (ERRT). Finally a Cosine Similarity function is applied to remove stop word for cross-domain sentiment analysis and classification. Experimental analysis shows that HM-CPCS method is able to reduce the time to extract the opinions from reviewers by 46% and improve the accuracy by 9% compared to the state-of-the-art works.


Citation Format:
P. Manivannan, C. S. Kanimozhi Selvi, "Optimizing Cross Domain Sentiment Analysis Using Hidden Markov Continual Progression," Journal of Internet Technology, vol. 20, no. 3 , pp. 781-788, May. 2019.

Full Text:

PDF

Refbacks

  • There are currently no refbacks.





Published by Executive Committee, Taiwan Academic Network, Ministry of Education, Taipei, Taiwan, R.O.C
JIT Editorial Office, Library and Information Center, National Dong Hwa University
No. 1, Sec. 2, Da Hsueh Rd. Shoufeng, Hualien 97401, Taiwan, R.O.C.
Tel: +886-3-931-7017  E-mail: jit.editorial@gmail.com