On Thu, May 29, 2003 at 11:53:32AM -0400, David Dawes wrote: > On Thu, May 29, 2003 at 07:34:28AM +0200, Sven Luther wrote: > >On Thu, May 29, 2003 at 12:00:22AM -0400, Mike A. Harris wrote: > >> On Wed, 28 May 2003, Sven Luther wrote: > >> > >> >> > I was being sarcastic, his message was encoded with koi8-r, which, along > >> >> > with being html, is one of the indescriminate reasons people block email > >> >> > (and get a good number of false positives) > >> >> > >> >> however, foreign language encoding is separate from html email. > >> >> > >> >> blocking based on foreign language encodings is not such a good idea. > >> >> blocking html is not so bad, though. > >> > > >> >You need to block multi-part mails with only one html part too though, > >> >which is not so easy to do, i think. > >> > >> This filter doesn't catch *everything*, but for the last 6 years > >> or so, it has had zero false positives for me while subscribed to > >> limitless numbers of mailing lists. > >> > >> :0: > >> * ^Content-Type:.*text/html > >> HTML > > > >Yep, i have this too, but half the html spam i get pass trough this, and > >because it is : > > > >Content-Type: multipart/alternative; > > boundary="E_BBFDE6F0B.95CA_CC.D7." > >... > >This is a multi-part message in MIME format. > > > >--E_BBFDE6F0B.95CA_CC.D7. > >Content-Type: text/html > >Content-Transfer-Encoding: quoted-printable > >... > >--E_BBFDE6F0B.95CA_CC.D7.-- > > > >On the other hand i don't want to catch the emails which have a text and > >an html section, since they are mostly valid ones. > > The XFree86 mailing list filtering checks for a few different types of > html-only messages, including a few levels deep of nesting (which I've > seen in some spam). It does catch the occasional false-positive, but > it's fairly rare, and a reasonable tradeoff given its effectiveness.
Are they available somewhere so i can take a look ? > >Anyway, i have almost managed to write a sed script doing this, but i am > >not sure if it is possible to get the value of the boundary and match on > >it in the address pattern when using sed. > > If you're prepared to use perl, there are packages for breaking out the > mime structure. I would rather not use perl, if anything, i would write a small ocaml program to do it or maybe extend spamoracle which i already call. The execution cose per mail would be lower this way. Friendly, Sven Luther > > David > -- > David Dawes > Founder/committer/developer The XFree86 Project > www.XFree86.org/~dawes > > > ------------------------------------------------------- > This SF.net email is sponsored by: eBay > Get office equipment for less on eBay! > http://adfarm.mediaplex.com/ad/ck/711-11697-6916-5 > _______________________________________________ > Dri-devel mailing list > [EMAIL PROTECTED] > https://lists.sourceforge.net/lists/listinfo/dri-devel ------------------------------------------------------- This SF.net email is sponsored by: eBay Get office equipment for less on eBay! http://adfarm.mediaplex.com/ad/ck/711-11697-6916-5 _______________________________________________ Dri-devel mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/dri-devel
