I was staring at the comment database, and noticed

mysql> select COUNT(*) from longdescs where length(thetext) > 100000;
+----------+
| COUNT(*) |
+----------+
|      271 |
+----------+
1 row in set (0.48 sec)

mysql> select COUNT(*) from longdescs where length(thetext) > 500000;
+----------+
| COUNT(*) |
+----------+
|       35 |
+----------+
1 row in set (0.51 sec)

mysql> select COUNT(*) from longdescs where length(thetext) > 750000;
+----------+
| COUNT(*) |
+----------+
|       12 |
+----------+
1 row in set (0.49 sec)

I can't imagine *any* of these comments are useful to human beings to read.

If you dump the comment text from the >100k comments into a file, it's
actually 71 meg of comments.

These 271 comments make up ~40% of the space used in the entire comment
database of 26k bugs.

(All these are from RESOLVED bugs, BTW)


Would anyone object to me just telling bugzilla to reject comments >
100k in length?

(i've chosen 100k to keep arguments about what the "right" length is.  I
believe *everyone* can agree that 100k is over that length :P )


Reply via email to