Open Access Open Access  Restricted Access Subscription Access

Generating Behavior-based Classification Rules for Spam Filtering Using Enhanced Induction Trees

Chih-Hung Wu,
Chi-Yuan Yeh,
Chih-Chin Lai,

Abstract


We present in this paper a novel featuring method for rule-based spam filtering. Instead of classifying emails according to keywords, this study analyzes the spamming behaviors and extracts the representative ones as features for describing the characteristics of emails. An enhanced decision tree algorithm with weighted information gain is proposed, which builds decision trees by considering the importance of behavior-based features revealed from emails. Since spamming behaviors are infrequently changed, compared with the changing frequency of keywords used in spams, behavior-based features are more robust with respect to the change of time; so that the behavior-based filtering rules outperforms keyword-based filtering ones. The experimental results indicate that our method is more useful in distinguishing spam emails than that of keyword-based comparison.

Keywords


Spam Mail; Decision Trees; Induction; Rule-based Classification

Citation Format:
Chih-Hung Wu, Chi-Yuan Yeh, Chih-Chin Lai, "Generating Behavior-based Classification Rules for Spam Filtering Using Enhanced Induction Trees," Journal of Internet Technology, vol. 7, no. 4 , pp. 387-398, Oct. 2006.

Full Text:

PDF

Refbacks

  • There are currently no refbacks.





Published by Executive Committee, Taiwan Academic Network, Ministry of Education, Taipei, Taiwan, R.O.C
JIT Editorial Office, Office of Library and Information Services, National Dong Hwa University
No. 1, Sec. 2, Da Hsueh Rd., Shoufeng, Hualien 974301, Taiwan, R.O.C.
Tel: +886-3-931-7314  E-mail: jit.editorial@gmail.com