English version
Accueil     À propos     Téléchargement     Ressources     Contactez-nous    

Ce site est rarement mis à jour. Pour les informations les plus récentes, rendez-vous sur le nouveau site OCaml à l'adresse ocaml.org.

Browse thread
[Caml-list] posting policy and spam
[ Home ] [ Index: by date | by threads ]
[ Search: ]

[ Message by date: previous | next ] [ Message in thread: previous | next ] [ Thread: previous | next ]
Date: 2004-01-04 (12:39)
From: Vitaly Lugovsky <vsl@o...>
Subject: Re: [Caml-list] posting policy and spam

On Sun, 4 Jan 2004, Sven Luther wrote:

> Well, on a similar subject, is there any chance of
> implementing a
> workaround in spamoracle to counter those spams specifically
> designed to
> fool the bayesian filters ? You know, those who have 4 lines
> of random
> words in a text attachement, and then some html spam.

 It's possible to calculate an entropy of a text. If a words
aren't correlated, and a correlation weights distribution is
plain enough - then it's a random text without any meaning
(information content). It's a way how an advanced search engines works.

 I'd be glad to implement this approach if I'd have some free
time. :(

To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr
Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/
Beginner's list: http://groups.yahoo.com/group/ocaml_beginners