Re: Can Solr analyze content and find dates and places

2013-02-11 Thread SUJIT PAL
Cool! Thanks for the update, this will help if I ever go all the way with UIMA and Solr. -sujit On Feb 11, 2013, at 12:13 PM, jazz wrote: > Hi Sujit, > > Thanks for your help! I moved the RoomNumberAnnotator.xml to the top level of > the jar and used the same solrconfig.xml (with the /). Now

Re: Can Solr analyze content and find dates and places

2013-02-11 Thread jazz
Hi Sujit, Thanks for your help! I moved the RoomNumberAnnotator.xml to the top level of the jar and used the same solrconfig.xml (with the /). Now it works perfect. Best regards, Bart On 11 Feb 2013, at 20:13, SUJIT PAL wrote: > Hi Bart, > > Like I said, I didn't actually hook my UIMA stuff

Re: Can Solr analyze content and find dates and places

2013-02-11 Thread SUJIT PAL
Hi Bart, Like I said, I didn't actually hook my UIMA stuff into Solr, content and queries are annotated before they reach Solr. What you describe sounds like a classpath problem (but of course you already knew that :-)). Since I haven't actually done what you are trying to do, here are some sug

Re: Can Solr analyze content and find dates and places

2013-02-11 Thread jazz
Hi Sujit and others who answered my question, I have been working on the UIMA path which seems great with the available Eclipse tooling and this: http://sujitpal.blogspot.nl/2011/03/smart-query-parsing-with-uima.html Now I worked through the UIMA tutorial of the RoomNumberAnnotator: http://uim

Re: Can Solr analyze content and find dates and places

2013-02-10 Thread Otis Gospodnetic
Lance did some work on opennlp integration. Check the Wiki. Otis Solr & ElasticSearch Support http://sematext.com/ On Feb 8, 2013 4:12 PM, "SUJIT PAL" wrote: > Hi Bart, > > I did some work with UIMA but this was to annotate the data before it goes > to Lucene/Solr, ie not built as a UpdateReque

Re: Can Solr analyze content and find dates and places

2013-02-08 Thread SUJIT PAL
Hi Bart, I did some work with UIMA but this was to annotate the data before it goes to Lucene/Solr, ie not built as a UpdateRequestProcessor. I just looked through the SolrUima wiki page [http://wiki.apache.org/solr/SolrUIMA] and I believe you will have to set up your own aggregate analysis cha

RE: Can Solr analyze content and find dates and places

2013-02-08 Thread Markus Jelsma
rowse/NUTCH-1414 If you're doing multiple languages locale information is very imporant. That counts for an UIMA annotator as well. Cheers, Markus -Original message- > From:Bart Rijpers > Sent: Fri 08-Feb-2013 17:51 > To: solr-user@lucene.apache.org > Subject: Re: Ca

Re: Can Solr analyze content and find dates and places

2013-02-08 Thread Bart Rijpers
Hi Alex, Indeed that is exactly what I am trying to achieve using wordcities. Date will be simple: 16-Jan becomes 16-Jan-2013 in a new dynamic field. But how do I integrate the Java library as UIMA? The documentation about changing schema.xml and solr.xml is not very detailed. Regards, Bart

Re: Can Solr analyze content and find dates and places

2013-02-08 Thread Alexandre Rafalovitch
Hi Bart, I haven't done any UIMA work (I used other stuff for my NLP phase), so not sure I can help much further. But in general, you are venturing into pure research territory here. Even for dates, what do you actually mean? Just fixed expression? Relative dates (e.g. last tuesday?). What about

Re: Can Solr analyze content and find dates and places

2013-02-08 Thread jazz
Thanks Alex, I checked the documentation but it seems there is only a webservice (OpenCalais) available to extract dates and places. http://uima.apache.org/sandbox.html Do you know is there is a Solr Compatible UIMA add-on which detects dates and places (cities) without a webservice? If not, h

Re: Can Solr analyze content and find dates and places

2013-02-08 Thread Alexandre Rafalovitch
Yes, it is possible. You are looking at UIMA or OpenNLP integration, most probably in Update Request Processor pipeline. Have a look here as a start: https://wiki.apache.org/solr/SolrUIMA You will have to put some serious work into this, it is not all tied together and packaged. Mostly because th