Re: Need help with indexing names in a pdf

2013-06-26 Thread Walter Underwood
This kind of text processing is called entity extraction. I'm not up to date on what is available in Solr, but search on that. wunder On Jun 26, 2013, at 10:26 AM, Warren H. Prince wrote: > We receive about 100 documents a day of various sizes. The documents > could pertain to any of 40

Need help with indexing names in a pdf

2013-06-26 Thread Warren H. Prince
We receive about 100 documents a day of various sizes. The documents could pertain to any of 40,000 contacts stored in our database, and could include more than one. For each file we have, we maintain a list of contacts that are related to or involved in that file. I know it will nev