ISS Library

A MACHINE LEARNING APPROACH TO SERVER-SIDE ANTI-SPAM E-MAIL FILTERING

Mashechkin I. and Petrovskiy M. and Rozinkin A. and Gerasimov S. (2006) A MACHINE LEARNING APPROACH TO SERVER-SIDE ANTI-SPAM E-MAIL FILTERING. In: УкрПРОГ, 23-25 травня 2006 р., м. Київ, Україна.

[img]MS Word
271Kb

Abstract

Spam-detection systems based on traditional methods have several obvious disadvantages like low detection rate, necessity of regular knowledge bases’ updates, impersonal filtering rules. New intelligent methods for spam detection, which use statistical and machine learning algorithms, solve these problems successfully. But these methods are not widespread in spam filtering for enterprise-level mail servers, because of their high resources consumption and insufficient accuracy regarding false-positive errors. The developed solution offers precise and fast algorithm. Its classification quality is better than the quality of Naïve-Bayes method that is the most widespread machine learning method now. The problem of time efficiency that is typical for all learning based methods for spam filtering is solved using multi-agent architecture. It allows easy system scaling and building unified corporate spam detection system based on heterogeneous enterprise mail systems. Pilot program implementation and its experimental evaluation for standard data sets and for real mail flows have demonstrated that our approach outperforms existing learning and traditional spam filtering methods. That allows considering it as a promising platform for constructing enterprise spam filtering systems.

Item Type:Conference or Workshop Item (Paper)
Subjects:J. Computer Applications
ID Code:62
Deposited By:G.U. Volkova
Deposited On:12 Mar 2007 16:48
Last Modified:12 Mar 2007 16:48

Repository Staff Only: item control page