Hello,

I am having trouble parsing Excel and Powerpoint files.

I am using "xlhtml" and "ppthtml".

Using application/msexcel (or application/vnd.ms-excel) from the 
command line converts the documents.  The command I used was:

$ /usr/local/bin/doc2html.pl /var/httpd/aotnet/htdocs/PDFs/test.x
ls application/msexcel http://URL/test.xls /home/httpd/aotne
t/htdocs/search/conf/htdig.conf

However, when i use these content types in htdig.conf the excel and 
powerpoint documents are not parsed.  

I see that the content type is application/excel when using the command  
-vvv option with htdig.  However, the .xls files are not parsed when using 
this content type at the command prompt nor when used in htdig.conf.

Similar effects are seen for powerpoint documents.

Word docs (catdoc), PDF's (pdf2html.pl) and RTF's (rtf2html.pl) are fine.

Any help would be appreciated.

cheers,

Adrian


-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to