Alastair Rankine wrote:

Seems simple enough, but there's almost certainly some complexity that I'm overlooking.

Yep. What if the images aren't subdirectories of the blog. e.g.

Say we had a blog at http://example.com/foo/bar/blog which in turn referenced images at http://example.com/images/* and enclosures at http://example.com/enclosures/*, the resulting zipfile would contain:

foo/bar/blog/index.xml
images/1.jpg
images/2.jpg
...
enclosures/1.mp3
enclosures/2.mp3


Either you need to rewrite URLs are you need one new file at a know location, not present in the blog itself, to identify where the master index is.

Or maybe not. xml:base attributes in the index.xml might be sufficient to straighten this all out. Indeed that might be necessary because different blog entries live at different URLs themselves. e.g. images/dog.png means one thing at http://example.com/blog/2006/Hello and another at http://example.com/blog/2005/Hello

Perhaps it would be helpful to back up a step. Rather than starting with format design let's try to get a list of everything we need to include in this format. E.g.

1. Complete source text for every blog entry.
2. Metadata for each blog entry
     A. Title
     B. Catgeory
     C. Tags
3. All media referenced by relative URLs from blog entry
4. URL of blog entry

What else?


--
Elliotte Rusty Harold  [EMAIL PROTECTED]
Java I/O 2nd Edition Just Published!
http://www.cafeaulait.org/books/javaio2/
http://www.amazon.com/exec/obidos/ISBN=0596527500/ref=nosim/cafeaulaitA/

Reply via email to