A Novel Judge Mechanism to Enhance the Performance of Google Blog Search

Lin-Chih Chen,

Abstract


In recent years, the online blogging community is growing bigger as the social network service. When it is growing, the blog posts are increasing day by day. Generally speaking, people were using the blog search engines to search and recommend potentially interesting blog posts. When people search from the blog search engines, they were faced with two major problems: synonymy (two different terms with the same meaning) and polysemy (a term with different meanings). In this paper, we use two semantic analysis methods, Latent Semantic Indexing (LSI) and Probabilistic LSI (PLSI), to solve these two problems. LSI uses singular value decomposition as the fundamental method to capture the synonymous relationship between terms. PLSI uses the Expectation-Maximization algorithm for parameter estimation to additionally deal with the problem of polysemy. Although PLSI can gracefully deal with these two semantic problems, it needs a huge computing time. To solve the problem of computing time, in this paper, we propose a novel termination mechanism to dynamically determine the required number of iterations for PLSI. According to the experiment results, the result derived from our mechanism can not only deal with these two semantic problems but also reach a cost-effective solution.


Citation Format:
Lin-Chih Chen, "A Novel Judge Mechanism to Enhance the Performance of Google Blog Search," Journal of Internet Technology, vol. 19, no. 4 , pp. 981-993, Jul. 2018.

Full Text:

PDF

Refbacks

  • There are currently no refbacks.





Published by Executive Committee, Taiwan Academic Network, Ministry of Education, Taipei, Taiwan, R.O.C
JIT Editorial Office, Office of Library and Information Services, National Dong Hwa University
No. 1, Sec. 2, Da Hsueh Rd., Shoufeng, Hualien 974301, Taiwan, R.O.C.
Tel: +886-3-931-7314  E-mail: jit.editorial@gmail.com