y, December 30, 2013 11:46 AM
> To: solr-user@lucene.apache.org
> Subject: Re: How to use Solr in my project
>
> On 30 December 2013 11:27, Fatima Issawi wrote:
> > Hi again,
> >
> > We have another program that will be extracting the text, and it will be
> extr
On 30 December 2013 11:27, Fatima Issawi wrote:
> Hi again,
>
> We have another program that will be extracting the text, and it will be
> extracting the top right and bottom left corners of the words. You are right,
> I do expect to have a lot of data.
>
> When would solr start experiencing iss
; Sent: Sunday, December 29, 2013 2:48 PM
> To: solr-user@lucene.apache.org
> Subject: Re: How to use Solr in my project
>
> On 29 December 2013 11:10, Fatima Issawi wrote:
> [...]
> > We will have the full text stored, but we want to highlight the text in the
> original imag
On 29 December 2013 11:10, Fatima Issawi wrote:
[...]
> We will have the full text stored, but we want to highlight the text in the
> original image. I expect to process the image after retrieval. We do plan on
> storing the (x, y) coordinates of the words in a database - I suspected that
> it
Hello,
Our pages are images of handwritten text in Arabic so OCR'ing is not possible.
We will be extracting the text during pre-processing and storing the words and
(x, y) coordinates in a database. Would your process apply to our images?
> Step 1:
> For sending the extracted text content from
> What do you mean by "word location"? The number on the page? What
> purpose would this serve?
I mean the (x, y) coordinates of the word on the page. We want to be able to
highlight the image of the word that was extracted from the text.
> I think that you might be confusing things:
> * If you
Highlighting can be done as three step process:
Pre-requisite: Get the pdf with text after the OCR of the image pdf.
Step 1:
For sending the extracted text content from text pdf to solr, use a low
level pdf converter such as poppler-utils (pdftotext or pdftohtml) to
correctly get the coordinates
On 26 December 2013 15:44, Fatima Issawi wrote:
> Hi,
>
> I should clarify. We have another application extracting the text from the
> document. The full text from each document will be stored in a database
> either at the document level or page level (this hasn't been decided yet). We
> will a
make more sense?
Fatima
-Original Message-
From: Gora Mohanty [mailto:g...@mimirtech.com]
Sent: Thursday, December 26, 2013 1:00 PM
To: solr-user@lucene.apache.org
Subject: Re: How to use Solr in my project
On 26 December 2013 10:54, Fatima Issawi wrote:
> Hello,
>
> First off, I apolo
On 26 December 2013 10:54, Fatima Issawi wrote:
> Hello,
>
> First off, I apologize if this was sent twice. I was having issues
> subscribing to the list.
>
> I'm a complete noob in Solr (and indexing), so I'm hoping someone can help me
> figure out how to implement Solr in my project. I have go
10 matches
Mail list logo