I have extracted text from .pdf files and I also
inserted page numbers of the .pdf file to the text. My
document looks something like:

  <content>
   <page no="2"> ..Some Text..</page>
   <page no="3"> ..Some Text..</page>
   ..................................
   ...........................</page>
  </content>

I indexed my data using solr and I am making
highlighted queries.(hl.fragsize=200&hlsnippets=5).
Currently I am displaying just snippets to the user,
however I also want to capture the page number of the
corresponding snippet. I will give a link to jump that
page in the original pdf file.

Is there a way that I can find out which page the
snipped was extracted from, by using the <page> tags?

Any ideas and help is appreciated.

Thank you...


      
____________________________________________________________________________________
Be a better friend, newshound, and 
know-it-all with Yahoo! Mobile.  Try it now.  
http://mobile.yahoo.com/;_ylt=Ahu06i62sR8HDtDypao8Wcj9tAcJ

Reply via email to