I have extracted text from .pdf files and I also inserted page numbers of the .pdf file to the text. My document looks something like:
<content> <page no="2"> ..Some Text..</page> <page no="3"> ..Some Text..</page> .................................. ...........................</page> </content> I indexed my data using solr and I am making highlighted queries.(hl.fragsize=200&hlsnippets=5). Currently I am displaying just snippets to the user, however I also want to capture the page number of the corresponding snippet. I will give a link to jump that page in the original pdf file. Is there a way that I can find out which page the snipped was extracted from, by using the <page> tags? Any ideas and help is appreciated. Thank you... ____________________________________________________________________________________ Be a better friend, newshound, and know-it-all with Yahoo! Mobile. Try it now. http://mobile.yahoo.com/;_ylt=Ahu06i62sR8HDtDypao8Wcj9tAcJ