On Thu, May 29, 2003 at 12:00:22AM -0400, Mike A. Harris wrote:
> On Wed, 28 May 2003, Sven Luther wrote:
> 
> >> > I was being sarcastic, his message was encoded with koi8-r, which, along
> >> > with being html, is one of the indescriminate reasons people block email
> >> > (and get a good number of false positives)
> >> 
> >> however, foreign language encoding is separate from html email.
> >> 
> >> blocking based on foreign language encodings is not such a good idea.
> >> blocking html is not so bad, though.
> >
> >You need to block multi-part mails with only one html part too though,
> >which is not so easy to do, i think.
> 
> This filter doesn't catch *everything*, but for the last 6 years 
> or so, it has had zero false positives for me while subscribed to 
> limitless numbers of mailing lists.
> 
> :0:
> * ^Content-Type:.*text/html
> HTML

Yep, i have this too, but half the html spam i get pass trough this, and
because it is :

Content-Type: multipart/alternative;
        boundary="E_BBFDE6F0B.95CA_CC.D7."
...
This is a multi-part message in MIME format.

--E_BBFDE6F0B.95CA_CC.D7.
Content-Type: text/html
Content-Transfer-Encoding: quoted-printable
...
--E_BBFDE6F0B.95CA_CC.D7.--

On the other hand i don't want to catch the emails which have a text and
an html section, since they are mostly valid ones.

Anyway, i have almost managed to write a sed script doing this, but i am
not sure if it is possible to get the value of the boundary and match on
it in the address pattern when using sed.

> I go through the HTML folder occasionally, and all of the stuff 
> is junk spam.  There may be the opportunity for false positives, 
> but in practice over the years, I've yet to see any with my mail 
> load.  Different people's mail usage may vary however...

Yes, i agree. My problem is that i use a bayesian spamfilter
(spamoracle) which learned all the html tags and thus catched valid
mails which had an html part. This has been corrected since though, but
i will have to retrain my database anyway.

Friendly,

Sven Luther


-------------------------------------------------------
This SF.net email is sponsored by: eBay
Get office equipment for less on eBay!
http://adfarm.mediaplex.com/ad/ck/711-11697-6916-5
_______________________________________________
Dri-devel mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/dri-devel

Reply via email to