Package: crm114
Version: 20090807-1
Severity: normal

I updated to crm114, 20090807, BlameThorstenAndJenny today. Since cssutil 
complained:

 Minor Caution - this file has the learncount slot in use.
 This is not a problem for Markovian classification, but it will have some
 issues with an OSB classfier.

and I use a OSB classifier and also due to the debconf warning, I recreated the 
files from scratch.

It appears to learn:

mar...@shambhala:~/.crm114> ./stats.sh
Als SPAM markiert: 3

 Sparse spectra file spam.css statistics:

 Total available buckets          :      1048577
 Total buckets in use             :         1481
 Total in-use zero-count buckets  :            0
 Total buckets with value >= max  :            0
 Total hashed datums in file      :         1565
 Documents learned                :           40
 Features learned                 :         1566
 Average datums per bucket        :         1.06
 Maximum length of overflow chain :            2
 Average length of overflow chain :         1.00
 Average packing density          :         0.00

Als HAM markiert: 20

 Sparse spectra file nonspam.css statistics:

 Total available buckets          :      1048577
 Total buckets in use             :         8187
 Total in-use zero-count buckets  :            0
 Total buckets with value >= max  :            0
 Total hashed datums in file      :         8389
 Documents learned                :           41
 Features learned                 :         8390
 Average datums per bucket        :         1.02
 Maximum length of overflow chain :            2
 Average length of overflow chain :         1.01
 Average packing density          :         0.01


Script is:

mar...@shambhala:~/.crm114> cat ./stats.sh
#!/bin/sh

echo "Als SPAM markiert: $(find reaver_cache/known_spam/ | wc -l)"
cssutil -rb spam.css

echo "Als HAM markiert: $(find reaver_cache/known_good/ | wc -l)"
cssutil -rb nonspam.css


Yet CRM114 doesn't appear to recognize some mails correctly after learning.

I learn it:

mar...@shambhala:~/Zeit> cat Aufräumpolicy\ _tmp.mbox| crm -u ~/.crm114 
mailreaver.crm --good | grep CRM
/bin/ln: Erzeuge harte Verknüpfung 
„reaver_cache/known_good/20091103_160046_476304_CDA7397D“: Die Datei existiert 
bereits
X-CRM114-Version: 20090807-BlameThorstenAndJenny ( TRE 0.7.6 (BSD) ) MR-27CA1CFB
X-CRM114-CacheID: sfid-20091103_160046_476304_CDA7397D
X-CRM114-Notice: Please train this message.
X-CRM114-Action: LEARNED AND CACHED GOOD

This is what CRM114 thinks about it afterwards:

mar...@shambhala:~/Zeit> cat Aufräumpolicy\ _tmp.mbox| crm -u ~/.crm114 
mailreaver.crm | grep CRM
X-CRM114-Version: 20090807-BlameThorstenAndJenny ( TRE 0.7.6 (BSD) ) MR-27CA1CFB
X-CRM114-CacheID: sfid-20091103_160046_476304_CDA7397D
X-CRM114-Status: UNSURE (   8.14  )
X-CRM114-Notice: Please train this message.


These are my differences to the original configuration:

mar...@shambhala:~/.crm114> diff -u /usr/share/crm114/mailfilter.cf 
mailfilter.cf
--- /usr/share/crm114/mailfilter.cf     2009-08-07 17:22:37.000000000 +0200     
 
+++ mailfilter.cf       2009-11-03 16:11:45.779852564 +0100                     
 
@@ -169,8 +169,8 @@                                                             
 
 #  ---------  will be inserted at the front of the subject if we think the     
 
 #  ---------  mail is spam.                                                    
 
 #                                                                              
 
-# :spam_flag_subject_string: //                                                
 
-:spam_flag_subject_string: /ADV:/                                              
 
+:spam_flag_subject_string: //                                                  
 
+#:spam_flag_subject_string: /ADV:/                                             
 
                                                                                
 
 #  ---------  Do we want to insert a "flagging" string on the subject line     
 
 #  ---------  for good email?  Usually we don't.... so we set this to the      
 
@@ -180,13 +180,13 @@                                                           
 
 #  ------------Similarly, do we want to insert a "flagging" string on          
 
 #  -------------the subject line of an "unsure" email?  This way we know       
 
 #  --------------we need to train it even if "headers" is turned off.          
 
-# :unsure_flag_subject_string: //                                              
 
-:unsure_flag_subject_string: /UNS:/
+:unsure_flag_subject_string: //
+# :unsure_flag_subject_string: /UNS:/

 # ------------- Do we want Training ConFirmation flags on the results of
 # ------------- a message to be learned?  Default is "TCF:".
-:confirm_flag_subject_string: /TCF:/
-#:confirm_flag_subject_string: //
+#:confirm_flag_subject_string: /TCF:/
+:confirm_flag_subject_string: //


 # ---------  Do we want to do any "rewrites" to increase generality and
@@ -194,16 +194,16 @@
 #    --------- NOTE: this option is somewhat slow.  If your mailserver is
 #      --------- maxed out on CPU, you might want to turn this off.
 #
-:rewrites_enabled: /yes/
-#:rewrites_enabled: /no/
+#:rewrites_enabled: /yes/
+:rewrites_enabled: /no/


 #  ---------  Do we copy incoming text into allmail.txt ?  default is yes, but
 #   ---------  experienced users will probably set this to 'no' after testing
 #    ---------  their configuration for functionality.
 #
-:log_to_allmail.txt:  /yes/
-# :log_to_allmail.txt: /no/
+# :log_to_allmail.txt:  /yes/
+:log_to_allmail.txt: /no/

 #   -------  Another logging option - log all mail to somewhere else
 #    -------  entirely.  Whatever pathname is given here will be prefixed

-- System Information:
Debian Release: squeeze/sid
  APT prefers testing
  APT policy: (450, 'testing'), (400, 'unstable'), (101, 'experimental')
Architecture: i386 (i686)

Kernel: Linux 2.6.31.5-tp42-toi-3.0.1-04850-g4eddd0d (PREEMPT)
Locale: LANG=de_DE.UTF-8, LC_CTYPE=de_DE.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages crm114 depends on:
ii  debconf [debconf-2.0]         1.5.28     Debian configuration management sy
ii  libc6                         2.9-25     GNU C Library: Shared libraries
ii  libtre4                       0.7.6-2    regexp matching library with appro

Versions of packages crm114 recommends:
ii  metamail                      2.7-54     implementation of MIME

crm114 suggests no packages.

-- debconf information:
* crm114/cssupgrade: true



--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org

Reply via email to