[Gossip] thoughts for right now and the future

2020-10-25 Thread Jeff Breidenbach
Dear Friends, As I've mentioned in the past, Mail Archive is alive and well in most respects. The one exceptions is customer support, for which I am regretfully and woefully behind. The service is hosted in the United States, and you might have heard about an upcoming election in this country. Th

Re: [Gossip] unic...@unicode.org no longer archived ..

2020-07-24 Thread Jeff Breidenbach
Hi Steffen, I checked and for whatever reason, they simply aren't sending email to arch...@mail-archive.com. No idea why. If you can help make that happen, archiving will work again. While we are talking, I wants folks to know that the Mail Archive itself is running fine (knock on wood). But I ca

Re: [Gossip] Partial match search

2019-02-03 Thread Jeff Breidenbach
>What type of partial match search is supported on mail-archive? Here's the short and long explanation of search syntax. https://www.mail-archive.com/faq.html#search https://www.mail-archive.com/searching.html ___ Gossip mailing list https://www.mail-ar

Re: [Gossip] The Mail Archive transitioning away from a small business

2019-01-27 Thread Jeff Breidenbach
So far, so good with this transition. Happy 2019 to everyone. Jeff > ___ Gossip mailing list https://www.mail-archive.com/gossip@mail-archive.com https://www.mail-archive.com/cgi-bin/mailman/options/gossip

[Gossip] The Mail Archive transitioning away from a small business

2018-09-03 Thread Jeff Breidenbach
Dear Friends, The Mail Archive has been running for 20 years. It started as hobby and grew into a small business 14 years ago. We have now come full circle. The business will end on December 31, 2018 and the service will revert to a hobby. What happened? Well, traffic has steadily declined for ma

[Gossip] The Mail Archive, 2017

2018-01-07 Thread Jeff Breidenbach
Next month will be the 20th anniversary of The Mail Archive. We had a peaceful, sane year which is more than I can say for much of the world. No major changes to the service. Uptime was by far the best ever, at 99.996%. Our previous best was just shy of 99.9% in 2016. A few disks failed here and t

Re: [Gossip] rcpp-devel: Can't reply via email

2017-09-17 Thread Jeff Breidenbach
Thanks for the report. There was a bug in list name validation. Should be fixed now. Jeff ___ Gossip mailing list https://www.mail-archive.com/gossip@mail-archive.com https://www.mail-archive.com/cgi-bin/mailman/options/gossip

[Gossip] uptime

2017-08-30 Thread Jeff Breidenbach
I know nobody cares, but I noticed that The Mail Archive achieved over 99.99% uptime for a year. Kind of neat. We'll see if this message jinxes it. For comparison, if you count the recent total eclipse in the United States as a failure, the sun had about 99.9995% uptime. https://www.mail-archive.c

Re: [Gossip] exporting mailing lists in mbox format

2017-06-04 Thread Jeff Breidenbach
Sorry for missing this earlier. There is no self service mechanism for downloading raw mail, in pat to prevent address harvesting. The normal procedure is to contact support team with a request. Unfortunately this is an 11 year old mailing list, and a bunch of the raw mail has been moved to very c

Re: [Gossip] Replacing StartCom certificate

2017-04-24 Thread Jeff Breidenbach
The Mail Archive has switched certificate authorities, and dropped extended validation. Please let me know if you notice any problems. ___ Gossip mailing list https://www.mail-archive.com/gossip@mail-archive.com https://www.mail-archive.com/cgi-bin/mailma

Re: [Gossip] doing simple data analysis on list archives

2017-04-22 Thread Jeff Breidenbach
Glad you were able to get things working. For search experts out there, The Mail Archive uses Lucene search syntax. See the following documentation. http://www.mail-archive.com/faq.html#search http://www.mail-archive.com/searching.html More specifically, the site uses the default Lucene query par

Re: [Gossip] Some mails missing from us...@open-mpi.org archive

2017-03-19 Thread Jeff Breidenbach
We use MHonArc to render emails for the web, which is open source. So any impatient programmers who are really hungry for this feature may want to dust off their Perl programming skills. Jeff ___ Gossip mailing list https://www.mail-archive.com/gossip@ma

Re: [Gossip] Some mails missing from us...@open-mpi.org archive

2017-02-15 Thread Jeff Breidenbach
No it's fine, this is a topic of general interest. Missing mail typically means a false positive from spam filtering. We previously used Postini, and when they shut down switched to Spam Hero. There's definitely room for improvement and I'm always open to suggestions if anyone knows of something be

[Gossip] The Mail Archive, 2016

2017-01-02 Thread Jeff Breidenbach
A lot of crazy stuff happened in 2016. Looking back, what parts of that touched The Mail Archive? Let's start with mundane computer stuff. Uptime was great, the service was online for the entire year except for 9 hours, 12 minutes. Some additional hard drives were converted to SSD, and some RAM wa

[Gossip] policy change for message deletion

2016-11-14 Thread Jeff Breidenbach
We’ve decided to make a policy change for The Mail Archive regarding message removal. For the last eighteen years (time flies!) list administrators have been responsible for decisions on removing content from publicly archived lists. We have had this policy in place in order to preserve the integri

Re: [Gossip] Replacing StartCom certificate

2016-10-20 Thread Jeff Breidenbach
Thanks for the heads up. Highly appreciated. I'm impressed that you know the certificate vendor for The Mail Archive. I was not aware of the drama going on with StartCom. Is it correct that the removal only applies to new certificates, and therefore the deadline for action is May 3, 2017 when the c

Re: [Gossip] Archiving stopped after server and system changed

2016-08-27 Thread Jeff Breidenbach
Ezmlm defaults to "X-No-Archive: Yes" in the header, which prohibits archiving. I checked and confirmed that the following message-id has the anti-archiving header. You'll need to turn that header off to use The Mail Archive. ae39af47dd7f5a4a96234b8e6bcac1050112122...@ox-ex.oxid-esales.local _

Re: [Gossip] Start of Thread Email was Wrongly Threaded.

2016-08-05 Thread Jeff Breidenbach
You are correct, we match subject lines in absence of explicit headers. The matching window is the most recent 3000 messages. This was more important two decades ago when mail user agents were less fastidious with headers. Will think about it. Thanks for the feedback and sorry for the misthread. If

[Gossip] Gmane

2016-08-01 Thread Jeff Breidenbach
Today I want to express my support and appreciation for Lars Magne Ingebrigtsen. For those not familiar, Lars created and has been running the utterly terrific email archiving service Gmane for 14 years. I've long admired both Gmane's engineering excellence and integrity. Unfortunately, one downsi

Re: [Gossip] Outage?

2016-05-31 Thread Jeff Breidenbach
Thanks for asking. We had some problems on May 15 during a cascading failure of hard drives. Archiving was paused for about a day and there was also some downtime during investigation. No data was lost. Today archiving was paused for about an hour while the failed hard disks were being replaced. T

Re: [Gossip] certificate chain is incomplete

2016-01-09 Thread Jeff Breidenbach
Thanks for the detailed report. I made some changes and now get a 'A' rating on the online test. Does this fix the Android problems? ___ Gossip mailing list https://www.mail-archive.com/gossip@mail-archive.com https://www.mail-archive.com/cgi-bin/mailman/

[Gossip] 2015 end of year

2015-12-30 Thread Jeff Breidenbach
Once upon a time, an ancient fish crawled out of the water and into the mud. That was an important moment in the grand journey of life. Its children learned to live on the land and eventually became us. On December 21, I watched the a young company called SpaceX successfully land a rocket booster s

Re: [Gossip] not getting emails from redhat.com lists

2015-11-15 Thread Jeff Breidenbach
Our inbound spam filtering service, SpamHero, decided that the confirmation message was phishing. Sending it through now, and we'll discuss this with the SpamHero folks. Suggestions for alternate spam filtering services are always welcome. ___ Gossip mail

Re: [Gossip] archive not updated for board-disc...@documentfoundation.org mailinglist

2015-06-30 Thread Jeff Breidenbach
All the quarantined TDF messages were from you, except for one from bugzilla. No idea why. Anyway, I've whitelisted both the entire opendocumentfoundation.org domain, and also you specifically. Yell if you see any more trouble. Best thing you can do to help is continue keeping TDF lists from being

Re: [Gossip] archive not updated for board-disc...@documentfoundation.org mailinglist

2015-06-30 Thread Jeff Breidenbach
The spam filtering service we use (SpamHero) quarantined that message along with some others. I've released the messages from quarantine and also adjusted the whitelist to hopefully reduce or prevent this from happening in the future. I'm sorry about this and we would definitely consider switching

Re: [Gossip] Porting digested new list archives to mail-archive

2015-05-04 Thread Jeff Breidenbach
A more detailed response was sent over private mail, but the short answers are (1) yes, as per FAQ (2) thanks for the suggestion, will add it to the list of things to think about. ___ Gossip mailing list https://www.mail-archive.com/gossip@mail-archive.co

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-18 Thread Jeff Breidenbach
Yes, you can safely leave out To, Message-id, and Received. Consequences are what you'd expect, like the inability to do a message-id search and find that particular message. You are correct. Posting address is manually assigned during the bulk import process, and automatically determined from hea

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-17 Thread Jeff Breidenbach
The only things indexed for search are: message-id, subject, date (usually extracted from the Recieved: header), sender name (extracted from From: header), posting address (for example, gossip@mail-archive.com), archival message number, and message body. Every message is sorted and organized accord

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-17 Thread Jeff Breidenbach
1. Yes, we override list name on import. 2. Search will have no concept of alternative list names. There is no reasonable way to overcome this. 3. Why not use the tool that Earl mentioned? 4. We always merge into the new list name and set up an HTTP redirect so that the old URLs are not broken. Mer

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-14 Thread Jeff Breidenbach
Statute of limitations is typically 3 kilomessages on a normal non-import list, but should (I think) be unlimited on bulk import. Conversion to unix newlines is required and is manual; doesn't matter who does it. Still prefer to do whole import at once especially if tricky; less labor, also less l

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-14 Thread Jeff Breidenbach
* I recommend doing the import all at once, rather than in stages. Not for technical reasons, it just saves manual labor. * Happy to make a tarball of the HTML after the import. It will look like basic MHonArc output and cosmetically differ quite a bit from what is served, because there is signifi

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-13 Thread Jeff Breidenbach
First, it is very common and super easy to directly import from a mailman (pipermail) archive. If the pipermail archive is publicly online, just supply the URL to the support team. The Mail Archive does not split digests back into individual messages. That's way too scary. If a digest is presente

[Gossip] experimental search interface, feedback requested

2014-04-27 Thread Jeff Breidenbach
We're experimenting with a new user interface for search. It works a little differently, what do people think? To try this on your own list, just replace "search" in the URL with "searchdev". Cheers, Jeff === OLD http://www.mail-archive.com/search?q=squirrel&l=cayugabirds-l%40cornell.edu NEW

[Gossip] 2014 spring update

2014-04-13 Thread Jeff Breidenbach
Happy Spring everyone. Here are some updates for The Mail Archive. Search is about 10 times faster than before. This is due to a complete rewrite that shaves off a ton of initialization time. I'm really happy about this. As an experiment, we're changing the way we serve ads. Previously, direct vi

Re: [Gossip] Temporary archive dysfunction

2014-01-05 Thread Jeff Breidenbach
After investigation, this turned out to be an issue with an "X-No-Archive: Yes" header on the list itself. Cheers, Jeff ___ Gossip mailing list https://www.mail-archive.com/gossip@mail-archive.com http://mail-archive.com/cgi-bin/mailman/options/gossip

[Gossip] test message #1

2013-12-06 Thread Jeff Breidenbach
___ Gossip mailing list https://www.mail-archive.com/gossip@mail-archive.com http://mail-archive.com/cgi-bin/mailman/options/gossip

[Gossip] archiving suspension over

2013-11-24 Thread Jeff Breidenbach
You may have noticed that archiving was suspended at The Mail Archive recently. Things are fine now, read on if you want gory details. We are hosted at a professional datacenter, complete with building wide uninterruptible power supply (UPS) and backup generators. About 5 years ago, the datacenter

[Gossip] change of mailing list name

2013-11-10 Thread Jeff Breidenbach
The gossip mailing list has a new name: gossip@mail-archive.com It's been over fifteen years, but I've finally got around to moving this list to where it should have been the whole time. All subscribers have been migrated. If there are any issues, please let me know. http://www.mail-archive.com/fa

[Gossip] test #7

2013-11-09 Thread Jeff Breidenbach
___ Gossip mailing list Gossip@mail-archive.com http://mail-archive.com/cgi-bin/mailman/listinfo/gossip

Re: Archived-At links - another non-working example

2013-09-02 Thread Jeff Breidenbach
I've now had some time to spruce up the search feature. Amazing what progress Lucene has made in the last few years. Search is slightly faster now, there is less code on our end, and I found and fixed a couple of rare bugs involving HTML escaping. What I can't do is reproduce your problem; it work

SpamHero, HTTPS

2013-07-28 Thread Jeff Breidenbach
The Mail Archive has long used a spam filter for inbound email. For many years, we've used a service called Postini. Over the last month we've switched to a different service, called SpamHero. Overall it appears to be working better, but nothing is perfect. If your list is experiencing any issues w

Re: Search returning 404

2013-05-31 Thread Jeff Breidenbach
Individual list search is very important. Thank you for reporting the problem. This turned out to be a configuration mistake on the webserver, involving the MultiViews configuration directive. Search should be working now. Let us know if you see any anomalies, and in the meantime we're looking into

Postini alternative

2013-04-14 Thread Jeff Breidenbach
Dear List Admins, This is a question for some of the expert spam-fighters out there. As you might imagine, The Mail Archive is on the receiving end of a fair amount of email spam. We currently reduce some of that inbound spam using a service called Postini Message Filtering. Unfortunately that se

Happy 2013

2013-01-05 Thread Jeff Breidenbach
Happy New Year. As The Mail Archive enters its 15th year of operation, let's take a quick look back. This year we had a record uptime percentage of 99.69%. That number jumps to 99.96% if you forgive the day we were deliberately dark in protest of the proposed SOPA law in the United States. There a

Re: New look to Mail Archive message pages

2012-12-09 Thread Jeff Breidenbach
Okay, so here's a quick status update. There are have been suggestions about fonts. Font weight. Line spacing. Typeface. Line spacing is interesting, a big design goal was to make more information available with less scrolling. However, we've found several references that suggest 1.4 is good in ter

Re: New look to Mail Archive message pages

2012-12-01 Thread Jeff Breidenbach
>What I dislike is that visited links are indistinguishable from non >visited ones. The difference in color just is way too little. I didn't notice this until you mentioned it. Now it is driving me crazy. Thank you for the feedback. -- To unsubscribe, send mail to gossip-unsubscr...@jab.org.

Re: New look to Mail Archive message pages

2012-12-01 Thread Jeff Breidenbach
Thank you for the feedback, keep it coming. Interesting screenshots. By the way, my everyday platform is also Ubuntu (10.04 and 12.04). So far I can't reproduce font problems with courier. As for the ordering of font-family, that's a good question. Let me check with graphic designer. -Jeff --

Re: New look to Mail Archive message pages

2012-11-27 Thread Jeff Breidenbach
Ralph: Can you share your browser + operating system including version numbers? If it is not too hard, a screenshot of a message page as well? Do you know what your browser zoom level is set to? Feel free to reply privately if you prefer. http://www.whatismybrowser.com/ http://www.smartsheet.com/

Re: site design

2012-08-28 Thread Jeff Breidenbach
Thanks everyone for public and private feedback. Graphical summation here for the moment. Did I miss or misunderstand anything? http://mail-archive.com/feedback.png >> Access keys should >> also be defined to do similar. > >And that would be really nice. (but they should also be advertized, >mayb

site design

2012-08-27 Thread Jeff Breidenbach
Hello all, The Mail Archive is now 14 years old (that's a long time in dog years) and we've been thinking about some design updates. The mockup below is intended for visitors from global search engines. Direct visitors will continue to have no advertisements. I hope that the proposed design is an

Re: Archived-At links for new mails not working anymore (since June-10)

2012-07-02 Thread Jeff Breidenbach
Okay, found the bug. On June 10th I made a change to deal with archives containing more than one million messages. A piece of code had decided that one million was a really big number and was starting to write scientific notation to some internal log files. Unfortunately my change was flawed and I

Re: arch...@mail-archive.com Slow to List New Posts?

2012-07-01 Thread Jeff Breidenbach
> Fair enough, I found how to have http://cr.yp.to/ezmlm.html subscribe > something other than the envelope or header from address. If you think this would be useful for others, please consider sharing either here, or we can put in the http://mail-archive.com/faq.html if appropriate. > I was a bi

Re: Archived-At links for new mails not working anymore (since June-10)

2012-06-25 Thread Jeff Breidenbach
Thanks for the note, we'll take a look. By the way, there's a discussion about using a shorter URL. Would that be useful to you? http://www.mail-archive.com/mailman-developers@python.org/msg12770.html

infrastructure update

2012-05-24 Thread Jeff Breidenbach
For many years, The Mail Archive has run with a high replication factor for storage - usually 7X. This helps with performance; more drives help serve more requests. It helps with reliability; disk failure is no big deal when there are many more just like it. And we splice full backups right off the

Re: Dark for 24 hours, starting now

2012-01-17 Thread Jeff Breidenbach
The only change is all web pages are serving HTTP 503 (Service Temporarily Unavailable) for the next 24 hours. There is no disruption to email archival. We noticed at least one other email archival service (marc.info) is also participating. -Jeff

Dark for 24 hours, starting now

2012-01-17 Thread Jeff Breidenbach
The Mail Archive is now dark. We will be offline for 24 hours in protest of proposed US censorship laws. -Jeff

SOPA blackout Jan 18, 2012

2012-01-16 Thread Jeff Breidenbach
The Mail Archive will be participating in SOPA Blackout Day. On January 18th, 2012 we will be dark. The Stop Online Piracy Act H.R.3261 (and PIPA, its sister bill S.968) is a proposed United States law, please read it yourself. The basic goal of the bills is to censor all access to non-US websites

Re: Happy 2012

2012-01-14 Thread Jeff Breidenbach
> Finally, one of the charter goals of Mail Archive, Inc. is to have fun Mission accomplished. http://www.airshipventures.com/sightings/1290/2012/01/08 -- To unsubscribe, send mail to gossip-unsubscr...@jab.org.

Happy 2012

2012-01-01 Thread Jeff Breidenbach
Happy New Year! As 2011 ends, it is once again time to look back at The Mail Archive. First, let's talk hardware. We broke the 100 million message barrier this year, which is getting close to the design limit for our current hardware. Next year will almost certainly require a new storage system, a

100 million messages

2011-11-19 Thread Jeff Breidenbach
I'm pleased to announce The Mail Archive has passed the 100 million message mark this past week. It took 13 years and eight generations of hardware, but we made it. -Jeff -- To unsubscribe, send mail to gossip-unsubscr...@jab.org.

Re: Re: [libreoffice-website] Archived-At: links not working since May 31th?

2011-06-26 Thread Jeff Breidenbach
On Sun, Jun 5, 2011 at 9:27 PM, Jeff Breidenbach wrote: > > I assume the indexing is not running at their end for some reason. > > That is exactly correct. If I run by hand it works fine, and that go > link will resolve now. Still looking into why the cron job that kicks >

June weirdness

2011-06-19 Thread Jeff Breidenbach
June has been a weird month. Some of the automatic software programs stopped running, resulting in many search indexes falling behind. Plus we had a crash this morning causing about 90 minutes downtime. Crazy! I think I've finally traced the problem to a May 30 operating system update. It disrupte

mailing list not archived

2011-06-16 Thread Jeff Breidenbach
Sharing response with group. -- Forwarded message -- From: MAISONOBE Luc Date: Thu, Jun 16, 2011 at 1:45 PM Subject: Re: mailing list not archived To: Jeff Breidenbach Jeff Breidenbach a écrit : Sympa sets an X-No-Archive: Yes header by default. > Oh, thanks for the hint

Re: search failure

2011-06-12 Thread Jeff Breidenbach
On Sun, Jun 12, 2011 at 3:11 PM, e-letter wrote: > Unable to use the search feature, e.g. search for "text" or "a" > returns 0 results, clearly incorrect. Works for me, from both the home page and in an individual archive. Can you please supply the exact search URL that is giving trouble? http:/

Re: Re: [libreoffice-website] Archived-At: links not working since May 31th?

2011-06-05 Thread Jeff Breidenbach
 > I assume the indexing is not running at their end for some reason. That is exactly correct. If I run by hand it works fine, and that go link will resolve now. Still looking into why the cron job that kicks off indexing had trouble. Thanks for problem report. -- To unsubscribe, send mail to go

Re: Incorrect mailto: links for list replies.

2011-04-04 Thread Jeff Breidenbach
> I agree, and David pointed it out originally.  Are you implicitly saying > it shouldn't and won't? I'm uncomfortable with the idea of mucking with the subject line, for several reasons (parsing chanllenges, unclear etiquette, internationalization, ...) I would feel a more comfortable if the pra

Re: Incorrect mailto: links for list replies.

2011-04-03 Thread Jeff Breidenbach
> When you add prefixing the subject with "Re: " [...] The Mail Archive doesn't add any prefixing. So I've tried a few more systems (mostly Apple) and none of them are working. That means either I screwed up, or mail user agents don't support this feature, or both. Can a few volunteers use the r

Incorrect mailto: links for list replies.

2011-04-02 Thread Jeff Breidenbach
Hi all, I'm moving this discussion to gossip because it is of general interest. There are about 1000 uses of The Mail Archive's reply feature every day. Based on David's request, I've put together preliminary support for In-Reply-To to help with threading. You can see it in action with curl and I'

Re: What happened to search? (only google custom search with crippled possibilities?)

2011-02-12 Thread Jeff Breidenbach
Oops, sorry for the more-or-less duplicate message. The extra factor of 2 in time is because the temporary files are turning out twice as big as I was expecting. Earl, good suggestion, and no we haven't explored it (yet). -- To unsubscribe, send mail to gossip-unsubscr...@jab.org.

Re: What happened to search? (only google custom search with crippled possibilities?)

2011-02-12 Thread Jeff Breidenbach
I was doing some experiments today, and managed to briefly knock over a server in the process. I looked at searching multiple indexes (Lucene's MultiSearcher) and merging (IndexWriter.addIndexesNoOptimize). The former is unusably slow. The latter seems to be on track for about 6 hours if the desti

Re: What happened to search? (only google custom search with crippled possibilities?)

2011-02-12 Thread Jeff Breidenbach
>Perhaps it's time for us to revisit [Lucene for site-wide search] A few weeks ago I tested Lucene's ability to search across multiple indexes (MultiSearcher) and it is hopelessly slow; queries take 5 seconds across just a few hundred indexes. Right now I'm trying index merging (addIndexesNoOptim

Re: List not appearing (from a google apps 'group')

2011-01-23 Thread Jeff Breidenbach
Fixed faster than expected. Only a few hundred messages affected across the entire corpus as far as I can tell, and they are all being dealt with. One question: the gg1 address is really designed for Google Groups, not for other stuff. Did you try the regular archival address first and get some sor

Re: List not appearing (from a google apps 'group')

2011-01-23 Thread Jeff Breidenbach
Progress report: We are receiving your messages but our system is failing to recognize them as belonging to a new list. It took me a while to find them since gg1 is not logged the same way as regular inbound messages. I will take the sorting engine out back into the parking lot and try to "talk" s

Re: List not appearing (from a google apps 'group')

2011-01-19 Thread Jeff Breidenbach
I'm seeing contradictory data in our inbound mail logs; give us a few days to get this sorted. Thanks for problem report. Jeff -- To unsubscribe, send mail to gossip-unsubscr...@jab.org.

season's greetings

2010-12-28 Thread Jeff Breidenbach
Season's greetings. Thank you all for sticking with The Mail Archive as we close out the decade. Here's a quick rundown of the trials, tribulations, and triumphs over the past year. First, let's talk about infrastructure. Our uptime was 99.57% which is similar to previous years. This year's main

Re: mail-archive search function

2010-12-22 Thread Jeff Breidenbach
On Wed, Dec 22, 2010 at 4:46 AM, e-letter wrote: > Please offer the ability to search using international date format > (-mm-dd) as a search criterion Done. (Actually, this has always worked).

Re: Urls containing "@" screwed up in archived Mails

2010-12-12 Thread Jeff Breidenbach
>The interesting question: is it possible? Are the originating > mails stored so that the visible html can be repaired? A mathematician, a physicist, and an engineer walk into a bar. The mathematician says "The raw mail exists even the old stuff is in offline cold storage. It can be matched by me

Re: Urls containing "@" screwed up in archived Mails

2010-12-11 Thread Jeff Breidenbach
> ((?:cid:)?|(?:(?:http://(?:www\.)?)?mail-archive\.com/)?)($AddrExp) Tested and live. Thanks guys.

Re: Urls containing "@" screwed up in archived Mails

2010-12-10 Thread Jeff Breidenbach
There are two cases that are very common and handled poorly. One is an http hyperlink to mail-archive.com as we've been discussing. The other is a a mailto hyperlink. These are correctly obfuscated, but end up as a broken link. Better to not have a link than a broken one. http://www.mail-archive.

Re: Urls containing "@" screwed up in archived Mails

2010-12-09 Thread Jeff Breidenbach
The Mail Archive does have to be very aggressive to obfuscate email addresses, otherwise a lot of people go bonkers. But yes, it is dumb to break a hyperlink, especially a hyperlink to The Mail Archive. Your feature request is valid and if you are feeling eager, feel free to send in a patch. Otherw

Re: Problem with search by message-id / Archived-At hashes

2010-11-09 Thread Jeff Breidenbach
You found a bug. In The Mail Archive's hash calculation, there is an incorrect urlib.unquote run on the message id. This is escaping the minus sign, and therefore calculating based on an incorrect message id. We're going to have to regenerate the entire message-id index after the bug is fixed. That

Re: Problem with search by message-id / Archived-At hashes

2010-11-08 Thread Jeff Breidenbach
The gossip malling list uses a somewhat obscure and very limited list server called Enemies of Carlotta. Another fun fact is it runs on a 5 watt NSLU2, which has a grand total of 32 megabytes of memory. That's less memory than the very first hardware iteration of The Mail Archive, which was a 90 me

degraded search on home page

2010-11-07 Thread Jeff Breidenbach
If you visit the home page on The Mail Archive, you may notice search is broken. You can not currently search the entire corpus. We're working on it but it will take some time. The other search features still work, e.g. search works fine for an individual archive, and you can still search list nam

request for assistance, Indonesian spam

2010-11-07 Thread Jeff Breidenbach
There are currently 1291 Indonesian language lists stored on The Mail Archive. I want someone to look them over and point out which lists are mostly or entirely spam. Details: - Person needs to speak Indonesian, obviously. - Person must be in the United States. Why? It hard to send payment t

Special test message - please ignore

2010-08-28 Thread Jeff Breidenbach
Testing obfuscation, data below: j...@jab.org http://j...@jab.org mailto:j...@jab.org http://www.mail-archive.com/gossip@jab.org/msg01358.html http://www.mail-archive.com/gossip@jab.org/ http://www.mail-archive.com/gossip@jab.org http://mail-archive.com/gossip@jab.org/msg01358.html http://mail-arc

Re: Advanced search...

2010-08-15 Thread Jeff Breidenbach
I've finally completed localization of advanced search. If you speak a language other than English, now is a great time to click around the user interface and see if there are any silly language mistakes. (To get to advanced search, do a regular search first, then you'll see a link) Cheers, Jeff

Advanced search...

2010-06-06 Thread Jeff Breidenbach
This is general interest, so we are responding publicly. We've discovered that many of the advanced query results have been leaking out through a fractured fiber optic line in the Gulf of Mexico. It is hard to get a precise measurement, but we believe 13 to 20 thousand bits of information per day

Re: [Gossip] search experiment, feedback requested

2010-04-18 Thread Jeff Breidenbach
Ok, it only took ... 4 years ... but we now have sort-by-date available in the advanced search interface. Enjoy. -Jeff > I wasn't clear. Is there some way to organize the search results? > > When I used the trial search engine on the sundial list and typed in > "oglesby" the 550 results were all

Re: Improvements

2010-02-21 Thread Jeff Breidenbach
Hi Andrius, I wasn't aware that there were adult content ads being served at all. Can you please supply a URL so I can take a look? If you don't want to supply to the entire group, send to M-A staff at themailarch...@gmail.com. I'm not keen on adjusting aspects of individual messages because that

Re: Search syntax...

2010-02-18 Thread Jeff Breidenbach
An "advanced search" form would basically have a bunch of fields like "subject" and "date" and "from" then string them together into a query syntax described below. Then redirect that query to the standard search URL. Implementation would most likely be in PHP. Hard part isn't the programming, it i

Re: Improvements

2010-02-15 Thread Jeff Breidenbach
> > The list name links to the info page; optionally not displayed if made > redundant by the nature of the logo. > This one requires too much per-list thinking; we'll only consider changes that are fully automatic. The other parts sound reasonable to me, but it would be nice to know if other peop

Re: Header for list web site?

2010-02-04 Thread Jeff Breidenbach
> I have a web site for the list rules and, very soon, a link to the > archives. Is there a header I can add to my messages that will let the > archiver pick up and display my site URL on its own line? > We'll honor RFC2369 headers. A quick glance suggests List-Help is most appropriate. > > Also

Re: risks of reply to button

2010-02-01 Thread Jeff Breidenbach
>"Manually" disable that button Implemented. List admins can ask to kill off the reply-to button (for new messages only) from M-A support staff. If there is significant demand we'll consider exposing more easily.

risks of reply to button

2010-01-31 Thread Jeff Breidenbach
With Mike's permission, I am moving this over to gossip because it is of general interest.. Like everything else, I think this boils down to cost versus benefit. The reply to button in message pages was used 610 times yesterday. I don't know a better way to commiserate with someone whose pet bunny

Re: A few "pre-purchase" questions...

2010-01-31 Thread Jeff Breidenbach
I meant to say "on the order of 100 queries per second" although right now it is less. http://www.mail-archive.com/faq.html#scale

Re: A few "pre-purchase" questions...

2010-01-31 Thread Jeff Breidenbach
Bring it on. The Mail Archive gets tens of thousands of inbound messages daily, and currently serves on the order of queries per second. So I don't think there is any concern about swamping the service given the numbers mentioned. Doesn't matter to use how many mbox files are involved for imports.

Re: Improvements

2010-01-27 Thread Jeff Breidenbach
Hi Randy, Bad news first.It is not such a great idea for mail-archive.com to re-send mail. That's asking for trouble with respect to abuse and spammers. On the good side, we can get the same effect if the mail server is in cahoots with the archiving service. A specification (RFC5604) is in place a

Re: deletion of old items and archive on Palm dev forum

2010-01-27 Thread Jeff Breidenbach
Hi Zakaria, Deletion policy is outlined in the FAQ. General questions like this one are good for gossip; for a specific case it's probably best to go to the support address. http://www.mail-archive.com/faq.html#delete Cheers, Jeff On Wed, Jan 27, 2010 at 12:44 PM, Zakaria Ousmaal wrote: > >

Re: Welcome to gossip@jab.org

2010-01-13 Thread Jeff Breidenbach
Ok, I checked; the message was dropped because it was bigger than our size limit. We limit inbound message size for a variety of reasons; one is historically attachment heavy archives consume a lot of resources and - statistically speaking - are much more likely to be spammy. (There were some horr

Re: Welcome to gossip@jab.org

2010-01-10 Thread Jeff Breidenbach
About how big was that PNG in bytes?

  1   2   3   4   5   6   7   >