Browse thread
[Caml-list] posting policy and spam
[
Home
]
[ Index:
by date
|
by threads
]
[ Message by date: previous | next ] [ Message in thread: previous | next ] [ Thread: previous | next ]
[ Message by date: previous | next ] [ Message in thread: previous | next ] [ Thread: previous | next ]
| Date: | -- (:) |
| From: | Vitaly Lugovsky <vsl@o...> |
| Subject: | Re: [Caml-list] posting policy and spam |
On Sun, 4 Jan 2004, Sven Luther wrote: > Well, on a similar subject, is there any chance of > implementing a > workaround in spamoracle to counter those spams specifically > designed to > fool the bayesian filters ? You know, those who have 4 lines > of random > words in a text attachement, and then some html spam. It's possible to calculate an entropy of a text. If a words aren't correlated, and a correlation weights distribution is plain enough - then it's a random text without any meaning (information content). It's a way how an advanced search engines works. I'd be glad to implement this approach if I'd have some free time. :( ------------------- To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/ Beginner's list: http://groups.yahoo.com/group/ocaml_beginners