Open Access Open Access  Restricted Access Subscription Access

Splog Detection Using Structural Similarity between Posts and URL Biasedness in Posts

Soo-Cheol Kim,
Su-Won Lee,
Kyoung-Jun Sung,
Sung Kwon Kim,

Abstract


Blogs are highly popular media of the Web 2.0. Spam blogs (splogs), however, interrupt normal information retrieval and waste network resources. Previous studies for detecting splogs are not always effective in coping with massively-generated splogs. In this paper, we propose a new method for detecting splogs. Our method aims to detect posts generated by machines. It is based on relational properties between posts. The key idea is that a splog has a structure similar in appearances to other posts and contains links that collectively direct to a specific target page or pages. Structural similarity between posts and URL biasedness in the posts of a blog will be used to decide whether or not the blog is a splog.

Keywords


Splog detection; HTML tags; Structural similarity; URL biasedness

Citation Format:
Soo-Cheol Kim, Su-Won Lee, Kyoung-Jun Sung, Sung Kwon Kim, "Splog Detection Using Structural Similarity between Posts and URL Biasedness in Posts," Journal of Internet Technology, vol. 13, no. 5 , pp. 767-772, Sep. 2012.

Full Text:

PDF

Refbacks

  • There are currently no refbacks.





Published by Executive Committee, Taiwan Academic Network, Ministry of Education, Taipei, Taiwan, R.O.C
JIT Editorial Office, Office of Library and Information Services, National Dong Hwa University
No. 1, Sec. 2, Da Hsueh Rd., Shoufeng, Hualien 974301, Taiwan, R.O.C.
Tel: +886-3-931-7314  E-mail: jit.editorial@gmail.com