Improving the Performance of Wikipedia Based on the Entry Relationship between Articles

Lin-Chih Chen,

Abstract


Wikipedia is the largest online encyclopedia in the world. It is free to access by anyone and its main advantage is that it can also be edited by any person at any time. On the one hand, this caused a rapid growth to its number of available articles and languages. It is likely to cause that most users are difficult to differentiate various synonymy and polysemy terms from the millions of articles in Wikipedia. On the other hand, traditional semantic analysis models are mainly focus on to deal with the semantic relationships between terms, or terms and documents. However, these models are lacking to deal with the semantic relationships between documents.
In this paper, to enhance the semantic relationships between documents, we use the entry relationship between any two Wikipedia articles to design our Latent Entry Analysis (LEA) model. The advantages of LEA have the following several aspects: (1) it can effectively deal with the problems of synonymy and polysemy; (2) it is a good model to find the semantic relationships between terms, terms and documents, or documents; (3) it is a good model with a high-performance and low-cost compared to other semantic analysis models; (4) it is a suitable model to effectively handle big data sets in Wikipedia.


Citation Format:
Lin-Chih Chen, "Improving the Performance of Wikipedia Based on the Entry Relationship between Articles," Journal of Internet Technology, vol. 19, no. 3 , pp. 711-723, May. 2018.

Full Text:

PDF

Refbacks

  • There are currently no refbacks.





Published by Executive Committee, Taiwan Academic Network, Ministry of Education, Taipei, Taiwan, R.O.C
JIT Editorial Office, Office of Library and Information Services, National Dong Hwa University
No. 1, Sec. 2, Da Hsueh Rd., Shoufeng, Hualien 974301, Taiwan, R.O.C.
Tel: +886-3-931-7314  E-mail: jit.editorial@gmail.com