Re: [Gossip] Porting digested new list archives to mail-archive

2015-05-04 Thread Jeff Breidenbach
A more detailed response was sent over private mail, but the short answers are (1) yes, as per FAQ (2) thanks for the suggestion, will add it to the list of things to think about. ___ Gossip mailing list https://www.mail-archive.com/gossip@mail-archive.co

Re: [Gossip] Porting digested new list archives to mail-archive

2015-05-03 Thread Shahrukh Merchant
OK, I'm ready with the archives to upload, with 2 questions before I do so (actually, only the first question is related to the upload): 1. Shall I create the mail-archive entry first and have it start auto-archiving new mail and wait until that's up and running before sending you the old arch

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-18 Thread Shahrukh Merchant
This is probably going to work fine, and let's go ahead and give it a try. If it doesn't work, we'll discuss, figure it out, and try again. Yeah, I'm probably over-analyzing but getting to this point was definitely useful since I now have more confidence what my scripts need to do and what the

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-18 Thread Jeff Breidenbach
Yes, you can safely leave out To, Message-id, and Received. Consequences are what you'd expect, like the inability to do a message-id search and find that particular message. You are correct. Posting address is manually assigned during the bulk import process, and automatically determined from hea

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-18 Thread Shahrukh Merchant
Jeff, Thanks again--more things are clear now. But your response raises more questions in my mind as well. Please bear with me, we're almost there I think. Keeping in mind that I am splitting Digests into individual messages, I have to fake whatever headers are not already there within the

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-17 Thread Jeff Breidenbach
The only things indexed for search are: message-id, subject, date (usually extracted from the Recieved: header), sender name (extracted from From: header), posting address (for example, gossip@mail-archive.com), archival message number, and message body. Every message is sorted and organized accord

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-17 Thread Shahrukh Merchant
1. Yes, we override list name on import. OK, so they are threaded and paginated independently of what's in the "To:" line. 2. Search will have no concept of alternative list names. There is no reasonable way to overcome this. Hmm, I don't understand this, given your answer to (1) above. If

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-17 Thread Jeff Breidenbach
1. Yes, we override list name on import. 2. Search will have no concept of alternative list names. There is no reasonable way to overcome this. 3. Why not use the tool that Earl mentioned? 4. We always merge into the new list name and set up an HTTP redirect so that the old URLs are not broken. Mer

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-17 Thread Shahrukh Merchant
Still working on processing the old email digests to convert them to individual emails in mbox format for import. Meanwhile, though, I thought of a new issue which has to do with identifying the list name from the email headers, given that the list name (and consequently the email address in t

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-14 Thread Jeff Breidenbach
Statute of limitations is typically 3 kilomessages on a normal non-import list, but should (I think) be unlimited on bulk import. Conversion to unix newlines is required and is manual; doesn't matter who does it. Still prefer to do whole import at once especially if tricky; less labor, also less l

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-14 Thread Shahrukh Merchant
On 4/14/2015 9:25 PM, Jeff Breidenbach wrote: * I recommend doing the import all at once, rather than in stages. Not for technical reasons, it just saves manual labor. OK, I may do it in 2 stages, since 1/2 the archives are in mbox format that can be imported instantly. The other half are in D

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-14 Thread Jeff Breidenbach
* I recommend doing the import all at once, rather than in stages. Not for technical reasons, it just saves manual labor. * Happy to make a tarball of the HTML after the import. It will look like basic MHonArc output and cosmetically differ quite a bit from what is served, because there is signifi

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-14 Thread Earl Hood
On Mon, Apr 13, 2015 at 11:19 AM, Matt Morgan wrote: >> 2. Now the harder one. From Sep 1994 (inception) to Apr 2006, the lists >> were hosted using L-Soft's LISTSERV software, which did not keep archives. >> However, I have a complete set of all traffic from that time period, but >> they are all

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-13 Thread Shahrukh Merchant
Thanks Matt and Jeff for your answers--they were very helpful. So, as I understand it: - Sending link to mailman raw archives will take care of all posts from 2006 to present (for which the mailman archives exist). - Listserv digest format to individual email mbox format conversion (for the

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-13 Thread Jeff Breidenbach
First, it is very common and super easy to directly import from a mailman (pipermail) archive. If the pipermail archive is publicly online, just supply the URL to the support team. The Mail Archive does not split digests back into individual messages. That's way too scary. If a digest is presente

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-13 Thread Matt Morgan
On 04/13/2015 12:57 AM, Shahrukh Merchant wrote: I have two discussion lists on the Argentine Tango that are probably going to be suspended going forward owing to lack of activity in the face of many competing technologies in recent years, but that have a treasure of information dating back fro

[Gossip] Porting digested new list archives to mail-archive

2015-04-12 Thread Shahrukh Merchant
I have two discussion lists on the Argentine Tango that are probably going to be suspended going forward owing to lack of activity in the face of many competing technologies in recent years, but that have a treasure of information dating back from 1994. I would like to get these onto mail-archi