Hi Chris, 2012/3/28 chris3001 <chrislia...@hotmail.com>
> I am having a hard time integrating UIMA with Solr. I have downloaded the > Solr 3.5 dist and have it successfully running with nutch and tika on > windows 7 using solrcell and curl via cygwin. To begin, I copied the 6 jars > from solr/contrib/uima/lib to the working /lib in solr. Next, I read the > readme.txt file in solr/contrib/uima/lib and edited both my solrconfig.xml > and schema.xml accordingly to no avail. I then found this link which seemed > a bit more applicable since I didnt care to use Alchemy or OpenCalais: > http://code.google.com/a/apache-extras.org/p/rondhuit-uima/?redir=1 Still- > when I run a curl command that imports a pdf via solrcell I do not get the > additional UIMA fields nor do I get anything on my logs. The test.pdf is > parsed though and I see the pdf in Solr using: > curl > ' > http://localhost:8080/solr/update/extract?fmap.content=content&literal.id=doc1&commit=true > ' > -F "file=@test.pdf" > > What I added to my SolrConfig.XML: > > /<updateRequestProcessorChain name="uima"> > <processor > class="org.apache.solr.uima.processor.UIMAUpdateRequestProcessorFactory"> > <lst name="uimaConfig"> > <lst name="runtimeParameters"> > </lst> > <str > > name="analysisEngine">C:\web\solrcelluimacrawler\com\rondhuit\uima\desc\KeyphraseExtractAnnotatorDescriptor.xml</str> > <bool name="ignoreErrors">true</bool> > <str name="logField">id</str> > <lst name="analyzeFields"> > <bool name="merge">false</bool> > <arr name="fields"> > <str>content</str> > </arr> > </lst> > <lst name="fieldMappings"> > <lst name="type"> > <str name="name">com.rondhuit.uima.yahoo.Keyphrase</str> > <lst name="mapping"> > <str name="feature">keyphrase</str> > <str name="field">UIMAname</str> > </lst> > </lst> > </lst> > </lst> > </processor> > <processor class="solr.LogUpdateProcessorFactory" /> > <processor class="solr.RunUpdateProcessorFactory" /> > </updateRequestProcessorChain> > / > I also adjusted my requestHander: > > /<requestHandler name="/update" class="solr.XmlUpdateRequestHandler"> > <lst name="defaults"> > <str name="update.processor">uima</str> > </lst> > </requestHandler>/ > > Finally, my added entries in my Schema.xml > > / > <field name="UIMAname" type="string" indexed="true" stored="true" > multiValued="true" required="false"/> > <dynamicField name="*_sm" type="string" indexed="true" stored="true"/> > / > > All I am trying to do is have test *any* UIMA AE in Solr and cannot figure > out what I am doing wrong. Thank you in advance for reading this. > > if I understood things correctly the problem is that you're using the /update/extract call which uses the SolrCell ExtractingRequestHandler while the UIMA update processor chain is available via the /update path, see: <requestHandler name="/update" class="solr.XmlUpdateRequestHandler"> <lst name="defaults"> <str name="update.processor">uima</str> </lst> </requestHandler>/ HTH Tommaso > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Solr-with-UIMA-tp3863324p3863324.html > Sent from the Solr - User mailing list archive at Nabble.com. >