subject:"RE\: Get page number of searchresult of a pdf in solr"

Re: Get page number of searchresult of a pdf in solr

2013-03-02 Thread Upayavira

page number as a > > > payload on each term? > > > > > > James Dyer > > > Ingram Content Group > > > (615) 213-4311 > > > > > > -Original Message- > > > From: Michael Della Bitta [mailto:michael.della.bi...@appinions.

Re: Get page number of searchresult of a pdf in solr

2013-03-02 Thread Anirudha Jadhav

e- > > From: Michael Della Bitta [mailto:michael.della.bi...@appinions.com] > > Sent: Thursday, February 28, 2013 3:33 PM > > To: solr-user@lucene.apache.org > > Subject: Re: Get page number of searchresult of a pdf in solr > > > > My guess is the best way

Re: Get page number of searchresult of a pdf in solr

2013-03-01 Thread Aloke Ghoshal

bi...@appinions.com] > Sent: Thursday, February 28, 2013 3:33 PM > To: solr-user@lucene.apache.org > Subject: Re: Get page number of searchresult of a pdf in solr > > My guess is the best way to do this is to index each page separately > and to store a link to the PDF/page in each doc

RE: Get page number of searchresult of a pdf in solr

2013-03-01 Thread Dyer, James

@lucene.apache.org Subject: Re: Get page number of searchresult of a pdf in solr My guess is the best way to do this is to index each page separately and to store a link to the PDF/page in each document. That would probably require you to preprocess the PDFs to turn each one into a single page per

Re: Get page number of searchresult of a pdf in solr

2013-03-01 Thread dev

Is it possible to write a plugin that is converting each page separately with Tika and saving all pages in one document (maybe in a dynamic field like "page_*")? I would like to have only one document stored in SOLR for each pdf (it fit's better to the way my web application is managing the

RE: Get page number of searchresult of a pdf in solr

2013-02-28 Thread Swati Swoboda

You can get the paragraph of the search result via highlights. You'd have to mark your field as stored (re-indexing required) and then specify it in the highlighting parameters. http://wiki.apache.org/solr/HighlightingParameters#hl As for getting the page number, I am not sure if there is more

Re: Get page number of searchresult of a pdf in solr

2013-02-28 Thread Michael Della Bitta

My guess is the best way to do this is to index each page separately and to store a link to the PDF/page in each document. That would probably require you to preprocess the PDFs to turn each one into a single page per PDF, or to extract the text per page another way. Michael Della Bitta

Re: Get page number of searchresult of a pdf in solr

Re: Get page number of searchresult of a pdf in solr

Re: Get page number of searchresult of a pdf in solr

RE: Get page number of searchresult of a pdf in solr

Re: Get page number of searchresult of a pdf in solr

RE: Get page number of searchresult of a pdf in solr

Re: Get page number of searchresult of a pdf in solr

7 matches

Site Navigation

Mail list logo

Footer information