Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-17 Thread Jeff Breidenbach
The only things indexed for search are: message-id, subject, date (usually extracted from the Recieved: header), sender name (extracted from From: header), posting address (for example, gossip@mail-archive.com), archival message number, and message body. Every message is sorted and organized accord

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-17 Thread Shahrukh Merchant
1. Yes, we override list name on import. OK, so they are threaded and paginated independently of what's in the "To:" line. 2. Search will have no concept of alternative list names. There is no reasonable way to overcome this. Hmm, I don't understand this, given your answer to (1) above. If

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-17 Thread Jeff Breidenbach
1. Yes, we override list name on import. 2. Search will have no concept of alternative list names. There is no reasonable way to overcome this. 3. Why not use the tool that Earl mentioned? 4. We always merge into the new list name and set up an HTTP redirect so that the old URLs are not broken. Mer

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-17 Thread Shahrukh Merchant
Still working on processing the old email digests to convert them to individual emails in mbox format for import. Meanwhile, though, I thought of a new issue which has to do with identifying the list name from the email headers, given that the list name (and consequently the email address in t