Open Access Open Access  Restricted Access Subscription Access

Resource Construction and Ensemble Learning based Sentiment Analysis for the Low-resource Language Uyghur

Azragul Yusup,
Degang Chen,
Yifei Ge,
Hongliang Mao,
Nujian Wang,

Abstract


To address the problem of scarce low-resource sentiment analysis corpus nowadays, this paper proposes a sentence-level sentiment analysis resource conversion method HTL based on the syntactic-semantic knowledge of the low-resource language Uyghur to convert high-resource corpus to low-resource corpus. In the conversion process, a k-fold cross-filtering method is proposed to reduce the distortion of data samples, which is used to select high-quality samples for conversion; finally, the Uyghur sentiment analysis dataset USD is constructed; the Baseline of this dataset is verified under the LSTM model, and the accuracy and F1 values reach 81.07% and 81.13%, respectively, which can provide a reference for the construction of low-resource language corpus nowadays. The accuracy and F1 values reached 81.07% and 81.13%, respectively, which can provide a reference for the construction of today's low-resource corpus. Meanwhile, this paper also proposes a sentiment analysis model based on logistic regression ensemble learning, SA-LREL, which combines the advantages of several lightweight network models such as TextCNN, RNN, and RCNN as the base model, and the meta-model is constructed using logistic regression functions for ensemble, and the accuracy and F1 values reach 82.17% and 81.86% respectively in the test set, and the experimental results show that the method can effectively improve the performance of Uyghur sentiment analysis task.

Keywords


Low-resource language, Uyghur, HTL, Stacking ensemble learning, Sentiment analysis

Citation Format:
Azragul Yusup, Degang Chen, Yifei Ge, Hongliang Mao, Nujian Wang, "Resource Construction and Ensemble Learning based Sentiment Analysis for the Low-resource Language Uyghur," Journal of Internet Technology, vol. 24, no. 4 , pp. 1009-1016, Jul. 2023.

Full Text:

PDF

Refbacks

  • There are currently no refbacks.





Published by Executive Committee, Taiwan Academic Network, Ministry of Education, Taipei, Taiwan, R.O.C
JIT Editorial Office, Office of Library and Information Services, National Dong Hwa University
No. 1, Sec. 2, Da Hsueh Rd., Shoufeng, Hualien 974301, Taiwan, R.O.C.
Tel: +886-3-931-7314  E-mail: jit.editorial@gmail.com