:07:57 PM
> Subject: Re: Improving Readability of Hit Highlighting
>
> To answer your questions specifically, here is an example of the raw OCR
> output;
>
> "CONTRACTORINMPRIMENTAYIVE : mom Ale ACCEPT INFORMATIONON TOUR SHEET TO ea"
>
> to which I would like
To answer your questions specifically, here is an example of the raw OCR output;
"CONTRACTORINMPRIMENTAYIVE : mom Ale ACCEPT INFORMATIONON TOUR SHEET TO ea"
to which I would like to see;
"mom ale access tour sheet to"
in the hit highlight. My schema for this field is pretty much
standard, as f
- Solr - Nutch
- Original Message
> From: Terence Gannon
> To: solr-user@lucene.apache.org
> Sent: Monday, January 12, 2009 11:00:31 AM
> Subject: Improving Readability of Hit Highlighting
>
> I'm indexing text from an OCR of an old document. Many words get r
I'm indexing text from an OCR of an old document. Many words get read
perfectly, but they're typically embedded in a lot of junk. I would
like the hit highlighting to show only the 'good' words, in the order
in which they appeared in the original document. Is it possible to
use output of the fil