Version française
Home     About     Download     Resources     Contact us    

This site is updated infrequently. For up-to-date information, please visit the new OCaml website at

Browse thread
[Caml-list] posting policy and spam
[ Home ] [ Index: by date | by threads ]
[ Search: ]

[ Message by date: previous | next ] [ Message in thread: previous | next ] [ Thread: previous | next ]
Date: 2004-01-04 (12:39)
From: Vitaly Lugovsky <vsl@o...>
Subject: Re: [Caml-list] posting policy and spam

On Sun, 4 Jan 2004, Sven Luther wrote:

> Well, on a similar subject, is there any chance of
> implementing a
> workaround in spamoracle to counter those spams specifically
> designed to
> fool the bayesian filters ? You know, those who have 4 lines
> of random
> words in a text attachement, and then some html spam.

 It's possible to calculate an entropy of a text. If a words
aren't correlated, and a correlation weights distribution is
plain enough - then it's a random text without any meaning
(information content). It's a way how an advanced search engines works.

 I'd be glad to implement this approach if I'd have some free
time. :(

To unsubscribe, mail Archives:
Bug reports: FAQ:
Beginner's list: