This feels somewhat backwards. It's very hard to extract Line-Number information out of MSWord and next to impossible from PDF. So, it's not whether the Solr is a good fit or not here is that maybe your whole architecture has a major issue. Can you do this/what you want by hand at least once? Down to the precision you want?
If you can, then yes you probably can automate the searching with Solr, though you will still have serious issues (sentence crossing line-boundaries, etc). But I suspect your whole approach will change once you try to do this manually. Regards, Alex. Personal website: http://www.outerthoughts.com/ Current project: http://www.solr-start.com/ - Accelerating your Solr proficiency On Thu, Mar 27, 2014 at 11:46 PM, Saurabh Agarwal <sagarwal1...@gmail.com> wrote: > Can anyone help me please. > > Hi All, > > I am new to Solr and from initial reading i am quite convinced Solr > will be of great help. Can anyone help in making that decision. > > Usecase: > 1. I will have PDF,Word docs generated daily/weekly ( lot of them ) > which kinds of get overwritten frequently. > 2. I have a dictionary kind of thing ( having a list of which > words/small sentences should be part of above docs , words which > cannot be and alternatives for some ). > 3. Now i want Solr to search my Docs produced in step 1 to be searched > for words/small sentences from step 2 and give me my Doc Name/line no > in which they exist. > > Will Solr be a good help to me, If anybody can help giving some > examples that will be great. > > Appreciate your help and patience. > > Thanks > Saurabh