Re: Extract footer/header text out of Word docs

2012-08-31 Thread Lance Norskog
l message- >>> From:Otis Gospodnetic >>> Sent: Thu 30-Aug-2012 15:30 >>> To: solr-user@lucene.apache.org >>> Subject: Re: Extract footer/header text out of Word docs >>> >>> Hi Alex, >>> >>> I think you may get better hel

Re: Extract footer/header text out of Word docs

2012-08-31 Thread Erick Erickson
gt; Sent: Thu 30-Aug-2012 15:30 >> To: solr-user@lucene.apache.org >> Subject: Re: Extract footer/header text out of Word docs >> >> Hi Alex, >> >> I think you may get better help on the Tika mailing list - Solr uses Tika to >> parse rich text docs and extrac

RE: Extract footer/header text out of Word docs

2012-08-30 Thread Markus Jelsma
30-Aug-2012 15:30 > To: solr-user@lucene.apache.org > Subject: Re: Extract footer/header text out of Word docs > > Hi Alex, > > I think you may get better help on the Tika mailing list - Solr uses Tika to > parse rich text docs and extract text from them.  I don't know

Re: Extract footer/header text out of Word docs

2012-08-30 Thread Otis Gospodnetic
Hi Alex, I think you may get better help on the Tika mailing list - Solr uses Tika to parse rich text docs and extract text from them.  I don't know if Tika can figure out what's from a header and a footer... Otis  Performance Monitoring for Solr / ElasticSearch / HBase - http://sematext.