![Open Access](https://jit.ndhu.edu.tw/lib/pkp/templates/images/icons/fulltext_open_medium.gif)
![Restricted Access](https://jit.ndhu.edu.tw/lib/pkp/templates/images/icons/fulltext_restricted_medium.gif)
A Novel Vector Representation Model for Text Mining Based on Enhancing Features
Abstract
Semantics relationships such as polysemy and synonymy etc. contained in text documents can be used to enhance the features in the vector representation. Previous feature representation methods rarely consider such semantic relations to improve mining precision, because taking semantic relations into account greatly depends on the learning process. In this paper, we present a novel document representation method which utilizes semantic relations to enhance the features. To obtain semantic relations, we use the external resource of Wikipedia. Moreover, a term-concept mapping structure is constructed to describe features. When concepts are extracted from document, Markov random walk model is used to handle the semantic relationships between the concepts and related concepts are voted to enhance the features. We conduct comprehensive experiment using 20-newsgroup corpus and Reuters corpus. Results show that the proposed method has good performance in text mining.
Keywords
Feature representation; Semantic relation; Semantic concept; Voting algorithm
Citation Format:
Heng Chen, Hai Jin, Feng Zhao, Han-Hua Chen, Fei Fang, "A Novel Vector Representation Model for Text Mining Based on Enhancing Features," Journal of Internet Technology, vol. 16, no. 3 , pp. 475-484, May. 2015.
Heng Chen, Hai Jin, Feng Zhao, Han-Hua Chen, Fei Fang, "A Novel Vector Representation Model for Text Mining Based on Enhancing Features," Journal of Internet Technology, vol. 16, no. 3 , pp. 475-484, May. 2015.
Refbacks
- There are currently no refbacks.
Published by Executive Committee, Taiwan Academic Network, Ministry of Education, Taipei, Taiwan, R.O.C
JIT Editorial Office, Office of Library and Information Services, National Dong Hwa University
No. 1, Sec. 2, Da Hsueh Rd., Shoufeng, Hualien 974301, Taiwan, R.O.C.
Tel: +886-3-931-7314 E-mail: jit.editorial@gmail.com