Re: Get page number of searchresult of a pdf in solr

2013-03-02 Thread Upayavira
page number as a > > > payload on each term? > > > > > > James Dyer > > > Ingram Content Group > > > (615) 213-4311 > > > > > > -Original Message- > > > From: Michael Della Bitta [mailto:michael.della.bi...@appinions.

Re: Get page number of searchresult of a pdf in solr

2013-03-02 Thread Anirudha Jadhav
e- > > From: Michael Della Bitta [mailto:michael.della.bi...@appinions.com] > > Sent: Thursday, February 28, 2013 3:33 PM > > To: solr-user@lucene.apache.org > > Subject: Re: Get page number of searchresult of a pdf in solr > > > > My guess is the best way

Re: Get page number of searchresult of a pdf in solr

2013-03-01 Thread Aloke Ghoshal
bi...@appinions.com] > Sent: Thursday, February 28, 2013 3:33 PM > To: solr-user@lucene.apache.org > Subject: Re: Get page number of searchresult of a pdf in solr > > My guess is the best way to do this is to index each page separately > and to store a link to the PDF/page in each doc

RE: Get page number of searchresult of a pdf in solr

2013-03-01 Thread Dyer, James
@lucene.apache.org Subject: Re: Get page number of searchresult of a pdf in solr My guess is the best way to do this is to index each page separately and to store a link to the PDF/page in each document. That would probably require you to preprocess the PDFs to turn each one into a single page per

Re: Get page number of searchresult of a pdf in solr

2013-03-01 Thread dev
Is it possible to write a plugin that is converting each page separately with Tika and saving all pages in one document (maybe in a dynamic field like "page_*")? I would like to have only one document stored in SOLR for each pdf (it fit's better to the way my web application is managing the

RE: Get page number of searchresult of a pdf in solr

2013-02-28 Thread Swati Swoboda
re is more you can do than what Michael suggested... -Original Message- From: d...@geschan.de [mailto:d...@geschan.de] Sent: Thursday, February 28, 2013 3:27 PM To: solr-user@lucene.apache.org Subject: Get page number of searchresult of a pdf in solr Hello, I'm building a web applica

Re: Get page number of searchresult of a pdf in solr

2013-02-28 Thread Michael Della Bitta
My guess is the best way to do this is to index each page separately and to store a link to the PDF/page in each document. That would probably require you to preprocess the PDFs to turn each one into a single page per PDF, or to extract the text per page another way. Michael Della Bitta

Get page number of searchresult of a pdf in solr

2013-02-28 Thread dev
Hello, I'm building a web application where users can search for pdf documents and view them with pdf.js. I would like to display the search results with a short snippet of the paragraph where the search term where found and a link to open the document at the right page. So what I need is