Hi Vinay, Perhaps you could say more about what you are looking for? What use cases, say. Did you see the book _Taming Text_?
Thanks Jack On Fri, Feb 22, 2013 at 8:48 AM, Vinay B, <vybe3...@gmail.com> wrote: > Hi, > > A few questions, some specific to UIMA, others more general. > 1. The SOLR/UIMA example employs 3rd party (some of which are > commercial) semantic APIs such as AlchemyApi and OpenCalais. This > won't do for our application (semantic analysis of large numbers of > plain text files) . Are there any open source alternatives that work > with SOLR and can achieve the same results. OpenNLP can extract parts > of speech and extract names etc but isn't really meant for concept > extraction. > 2. Regardless of the caveat mentioned above, can someone illustrate a > usecase for UIMA annotations . i.e. what kind of queries can be > performed once a document has been processed via the UIMA plugin > 3. Does (or can) SOLR have any disambiguation functionality (either > native or via a 3rd party plugin) and if so, how can I leverage it. > Once again OpenNLP has a part of speech tagger that could possibly be > used for this. > eg. if doc 1 contains text "This pipe is made of lead" (lead is a > noun) and doc 2 contains text "Lincoln lead by example" (lead is a > verb) , how would I phrase a query intended to return docs that > countain the term "lead" as a verb. If there's a link that explains > how to do this, please do post it. > > Apparantly SIREN (http://siren.sindice.com/index.html) has some of > this functionality (and more) built in but the documentation and use > cases are a bit sketchy. It also hasn't been updated in a year. Does > anyone know if it will be compatable with future SOLR / Lucene > releases. > > Thanks for your responses.