A one-line change to doc2html.pl is needed to add this MIME-type.

Change

  # Microsoft Excel spreadsheet
  if ($XLS2HTML) {
    $mime_type = "application/msexcel|application/vnd.ms-excel";
    $cmd = $XLS2HTML;

to

  # Microsoft Excel spreadsheet
  if ($XLS2HTML) {
    $mime_type =
"application/msexcel|application/vnd.ms-excel|application/excel";
    $cmd = $XLS2HTML;

My apologies for being too lazy to make that into a patch.

Can anybody confirm that "application/excel" is a standard MIME-type?

--
David Adams
Computing Services
Southampton University


----- Original Message -----
From: "Adrian Bolzan" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Tuesday, October 08, 2002 7:41 AM
Subject: [htdig] Parsing Excel and Powerpoint files


> Hello,
>
> I am having trouble parsing Excel and Powerpoint files.
>
> I am using "xlhtml" and "ppthtml".
>
> Using application/msexcel (or application/vnd.ms-excel) from the
> command line converts the documents.  The command I used was:
>
> $ /usr/local/bin/doc2html.pl /var/httpd/aotnet/htdocs/PDFs/test.x
> ls application/msexcel http://URL/test.xls /home/httpd/aotne
> t/htdocs/search/conf/htdig.conf
>
> However, when i use these content types in htdig.conf the excel and
> powerpoint documents are not parsed.
>
> I see that the content type is application/excel when using the command
> -vvv option with htdig.  However, the .xls files are not parsed when using
> this content type at the command prompt nor when used in htdig.conf.
>
> Similar effects are seen for powerpoint documents.
>
> Word docs (catdoc), PDF's (pdf2html.pl) and RTF's (rtf2html.pl) are fine.
>
> Any help would be appreciated.
>
> cheers,
>
> Adrian
>
>
> -------------------------------------------------------
> This sf.net email is sponsored by:ThinkGeek
> Welcome to geek heaven.
> http://thinkgeek.com/sf
> _______________________________________________
> htdig-general mailing list <[EMAIL PROTECTED]>
> To unsubscribe, send a message to
<[EMAIL PROTECTED]> with a subject of unsubscribe
> FAQ: http://htdig.sourceforge.net/FAQ.html
>



-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to