Writing a Solr custom analyzer to post content to Stanbol {was: Need additional data processing in Data Import Handler prior to indexing}

2013-11-02 Thread Dileepa Jayakody
Hi All, I went through possible solutions for my requirement of triggering a Stanbol enhancement during Solr indexing, and I got the requirement simplified. I only need to process the field named "content" to perform the Stanbol enhancement to extract Person and Organizations. So I think it will

Re: Need additional data processing in Data Import Handler prior to indexing

2013-10-29 Thread Dileepa Jayakody
Thanks guys for your ideas. I will go through them and come back with questions. Regards, Dileepa On Wed, Oct 30, 2013 at 7:00 AM, Erick Erickson wrote: > Third time tonight I've been able to paste this link > > Also, you can consider just moving to SolrJ and > taking DIH out of the proces

Re: Need additional data processing in Data Import Handler prior to indexing

2013-10-29 Thread Erick Erickson
Third time tonight I've been able to paste this link Also, you can consider just moving to SolrJ and taking DIH out of the process, see: http://searchhub.org/2012/02/14/indexing-with-solrj/ Whichever approach fits your needs of course. Best, Erick On Tue, Oct 29, 2013 at 7:15 PM, Alexandre

Re: Need additional data processing in Data Import Handler prior to indexing

2013-10-29 Thread Alexandre Rafalovitch
It's also possible to combine Update Request Processor with DIH. That way if a debug entry needs to be inserted it could go through the same Stanbol process. Just define a processing chain the DIH handler and write custom URP to call out to Stanbol web service. You have access to a full record in

Re: Need additional data processing in Data Import Handler prior to indexing

2013-10-29 Thread Michael Della Bitta
Hi Dileepa, You can write your own Transformers in Java. If it doesn't make sense to run Stanbol calls in a Transformer, maybe setting up a web service that grabs a record out of MySQL, sends the data to Stanbol, and displays the results could be used in conjunction with HttpDataSource rather than

RE: Need additional data processing in Data Import Handler prior to indexing

2013-10-29 Thread Dyer, James
13 3:48 PM To: solr-user@lucene.apache.org Subject: Need additional data processing in Data Import Handler prior to indexing Hi All, I'm a newbie to Solr, and I have a requirement to import data from a mysql database; enhance the imported content to identify Persons mentioned and index i

Need additional data processing in Data Import Handler prior to indexing

2013-10-29 Thread Dileepa Jayakody
Hi All, I'm a newbie to Solr, and I have a requirement to import data from a mysql database; enhance the imported content to identify Persons mentioned and index it as a separate field in Solr along with the other fields defined for the original db query. I'm using Apache Stanbol [1] for the co