Package: crm114 Version: 20090807-1 Severity: normal I updated to crm114, 20090807, BlameThorstenAndJenny today. Since cssutil complained:
Minor Caution - this file has the learncount slot in use. This is not a problem for Markovian classification, but it will have some issues with an OSB classfier. and I use a OSB classifier and also due to the debconf warning, I recreated the files from scratch. It appears to learn: mar...@shambhala:~/.crm114> ./stats.sh Als SPAM markiert: 3 Sparse spectra file spam.css statistics: Total available buckets : 1048577 Total buckets in use : 1481 Total in-use zero-count buckets : 0 Total buckets with value >= max : 0 Total hashed datums in file : 1565 Documents learned : 40 Features learned : 1566 Average datums per bucket : 1.06 Maximum length of overflow chain : 2 Average length of overflow chain : 1.00 Average packing density : 0.00 Als HAM markiert: 20 Sparse spectra file nonspam.css statistics: Total available buckets : 1048577 Total buckets in use : 8187 Total in-use zero-count buckets : 0 Total buckets with value >= max : 0 Total hashed datums in file : 8389 Documents learned : 41 Features learned : 8390 Average datums per bucket : 1.02 Maximum length of overflow chain : 2 Average length of overflow chain : 1.01 Average packing density : 0.01 Script is: mar...@shambhala:~/.crm114> cat ./stats.sh #!/bin/sh echo "Als SPAM markiert: $(find reaver_cache/known_spam/ | wc -l)" cssutil -rb spam.css echo "Als HAM markiert: $(find reaver_cache/known_good/ | wc -l)" cssutil -rb nonspam.css Yet CRM114 doesn't appear to recognize some mails correctly after learning. I learn it: mar...@shambhala:~/Zeit> cat Aufräumpolicy\ _tmp.mbox| crm -u ~/.crm114 mailreaver.crm --good | grep CRM /bin/ln: Erzeuge harte Verknüpfung „reaver_cache/known_good/20091103_160046_476304_CDA7397D“: Die Datei existiert bereits X-CRM114-Version: 20090807-BlameThorstenAndJenny ( TRE 0.7.6 (BSD) ) MR-27CA1CFB X-CRM114-CacheID: sfid-20091103_160046_476304_CDA7397D X-CRM114-Notice: Please train this message. X-CRM114-Action: LEARNED AND CACHED GOOD This is what CRM114 thinks about it afterwards: mar...@shambhala:~/Zeit> cat Aufräumpolicy\ _tmp.mbox| crm -u ~/.crm114 mailreaver.crm | grep CRM X-CRM114-Version: 20090807-BlameThorstenAndJenny ( TRE 0.7.6 (BSD) ) MR-27CA1CFB X-CRM114-CacheID: sfid-20091103_160046_476304_CDA7397D X-CRM114-Status: UNSURE ( 8.14 ) X-CRM114-Notice: Please train this message. These are my differences to the original configuration: mar...@shambhala:~/.crm114> diff -u /usr/share/crm114/mailfilter.cf mailfilter.cf --- /usr/share/crm114/mailfilter.cf 2009-08-07 17:22:37.000000000 +0200 +++ mailfilter.cf 2009-11-03 16:11:45.779852564 +0100 @@ -169,8 +169,8 @@ # --------- will be inserted at the front of the subject if we think the # --------- mail is spam. # -# :spam_flag_subject_string: // -:spam_flag_subject_string: /ADV:/ +:spam_flag_subject_string: // +#:spam_flag_subject_string: /ADV:/ # --------- Do we want to insert a "flagging" string on the subject line # --------- for good email? Usually we don't.... so we set this to the @@ -180,13 +180,13 @@ # ------------Similarly, do we want to insert a "flagging" string on # -------------the subject line of an "unsure" email? This way we know # --------------we need to train it even if "headers" is turned off. -# :unsure_flag_subject_string: // -:unsure_flag_subject_string: /UNS:/ +:unsure_flag_subject_string: // +# :unsure_flag_subject_string: /UNS:/ # ------------- Do we want Training ConFirmation flags on the results of # ------------- a message to be learned? Default is "TCF:". -:confirm_flag_subject_string: /TCF:/ -#:confirm_flag_subject_string: // +#:confirm_flag_subject_string: /TCF:/ +:confirm_flag_subject_string: // # --------- Do we want to do any "rewrites" to increase generality and @@ -194,16 +194,16 @@ # --------- NOTE: this option is somewhat slow. If your mailserver is # --------- maxed out on CPU, you might want to turn this off. # -:rewrites_enabled: /yes/ -#:rewrites_enabled: /no/ +#:rewrites_enabled: /yes/ +:rewrites_enabled: /no/ # --------- Do we copy incoming text into allmail.txt ? default is yes, but # --------- experienced users will probably set this to 'no' after testing # --------- their configuration for functionality. # -:log_to_allmail.txt: /yes/ -# :log_to_allmail.txt: /no/ +# :log_to_allmail.txt: /yes/ +:log_to_allmail.txt: /no/ # ------- Another logging option - log all mail to somewhere else # ------- entirely. Whatever pathname is given here will be prefixed -- System Information: Debian Release: squeeze/sid APT prefers testing APT policy: (450, 'testing'), (400, 'unstable'), (101, 'experimental') Architecture: i386 (i686) Kernel: Linux 2.6.31.5-tp42-toi-3.0.1-04850-g4eddd0d (PREEMPT) Locale: LANG=de_DE.UTF-8, LC_CTYPE=de_DE.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/dash Versions of packages crm114 depends on: ii debconf [debconf-2.0] 1.5.28 Debian configuration management sy ii libc6 2.9-25 GNU C Library: Shared libraries ii libtre4 0.7.6-2 regexp matching library with appro Versions of packages crm114 recommends: ii metamail 2.7-54 implementation of MIME crm114 suggests no packages. -- debconf information: * crm114/cssupgrade: true -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org