my pleasure!
2014-05-06 16:43 GMT+08:00 Victor Pascual [via Lucene] < ml-node+s472066n413488...@n3.nabble.com>: > Thank you very much Ahmet for your help. > It finally worked! > > For anyone interested, all your hints where more than useful. I basically > had two problems: > - Didn't have my language detection chain in the update/json > requestHandler > - Didn't create the field where the detected language should be stored > > Again, thanks for your help! > > > On Mon, May 5, 2014 at 5:19 PM, Ahmet Arslan <[hidden > email]<http://user/SendEmail.jtp?type=node&node=4134885&i=0>> > wrote: > > > Hi Victor, > > > > I don't know mysolr, I assume you are using /update/json, lets add your > > chain to defaults section. > > > > <requestHandler name="/update/json" class="solr.UpdateRequestHandler"> > > > > <lst name="defaults"> > > <str name="stream.contentType">application/json</str> > > <str name="update.chain">langid</str> > > </lst> > > </requestHandler> > > > > > > > > > > On Monday, May 5, 2014 4:06 PM, Victor Pascual < > > [hidden email] <http://user/SendEmail.jtp?type=node&node=4134885&i=1>> > wrote: > > Hi there, > > > > I'm indexing my documents using mysolr. I mainly generate a lost of json > > objects and the run: solr.update(documents_array,'json') > > > > > > > > On Mon, May 5, 2014 at 1:08 PM, Ahmet Arslan <[hidden > > email]<http://user/SendEmail.jtp?type=node&node=4134885&i=2>> > wrote: > > > > > Hi Victor, > > > > > > How do you index your documents? Your last config looks correct. > However > > > for example if you use data import handler you need to add > update.chain > > > there too. Same as extraction request hadler if you are using > sole-cell. > > > > > > <requestHandler name="/dataimport" > > > class="org.apache.solr.handler.dataimport.DataImportHandler"> > > > <lst name="defaults"> > > > <str name="config">/home/username/data-config.xml</str> > > > <str name="update.chain">langid</str> > > > </lst> > > > </requestHandler> > > > > > > By the way The URL > > > http://localhost:8080/solr/update?commit=true&update.chain=langid was > > > just an example and meant to feed xml update messages by POST method. > Not > > > to use in a browser. > > > > > > Ahmet > > > > > > On Monday, May 5, 2014 11:04 AM, Victor Pascual < > > > [hidden email] <http://user/SendEmail.jtp?type=node&node=4134885&i=3>> > wrote: > > > > > > Thank you very much for you help Ahmet. > > > > > > However the language detection is still not workin. :( > > > My solrconfig.xml didn't contain that lst section inside the update > > > requestHandler. > > > That's the content I added: > > > > > > <requestHandler name="/update" > > > > class="solr.XmlUpdateRequestHandler"> > > > > <lst name="defaults"> > > > > <str name="update.chain">langid</str> > > > > </lst> > > > > </requestHandler> > > > > > > > > > > <updateRequestProcessorChain name="langid"> > > > > <processor > > > > > > class="org.apache.solr.update.processor.LangDetectLanguageIdentifierUpdateProcessorFactory"> > > > > > <lst name="defaults"> > > > > <str name="langid.fl">text</str> > > > > <str name="langid.langField">lang</str> > > > > </lst> > > > > </processor> > > > > <processor class="solr.LogUpdateProcessorFactory" /> > > > > <processor class="solr.RunUpdateProcessorFactory" /> > > > > </updateRequestProcessorChain> > > > > > > Now, your suggested query > > > > http://localhost:8080/solr/update?commit=true&update.chain=langidreturns > > > > > > <response> > > > ><lst name="responseHeader"> > > > ><int name="status">0</int> > > > ><int name="QTime">14</int> > > > ></lst> > > > ></response> > > > And there is still no lang field in my documents. > > > Any idea what am I doing wrong? > > > > > > > > > > > > > > > On Tue, Apr 29, 2014 at 5:33 PM, Ahmet Arslan <[hidden > > > email]<http://user/SendEmail.jtp?type=node&node=4134885&i=4>> > wrote: > > > > > > Hi, > > > > > > > >solr/update should be used, not /solr/select > > > > > > > >curl ' > http://localhost:8983/solr/update?commit=true&update.chain=langid > > ' > > > > > > > >By the way don't you have following definition in your > solrconfig.xml? > > > > > > > > <requestHandler name="/update" class="solr.UpdateRequestHandler"> > > > > > > > > <lst name="defaults"> > > > > <str name="update.chain">langid</str> > > > > </lst> > > > > </requestHandler> > > > > > > > > > > > > > > > > > > > >On Tuesday, April 29, 2014 4:50 PM, Victor Pascual < > > > [hidden email] <http://user/SendEmail.jtp?type=node&node=4134885&i=5>> > wrote: > > > >Hi Ahmet, > > > > > > > >thanks for your reply. Adding &update.chain=langid to my query > doesn't > > > >work: IP:8080/solr/select/?q=*%3A*&update.chain=langid > > > >Regarding defining the chain in an UpdateRequestHandler... sorry for > the > > > >lame question but shall I paste those three lines to solrconfig.xml, > or > > > >shall I add them somewhere else? > > > > > > > >There is not UpdateRequestHandler in my solrconfig. > > > > > > > >Thanks! > > > > > > > > > > > > > > > >On Tue, Apr 29, 2014 at 3:13 PM, Ahmet Arslan <[hidden > > > >email]<http://user/SendEmail.jtp?type=node&node=4134885&i=6>> > > > wrote: > > > > > > > >> Hi, > > > >> > > > >> Did you attach your chain to a UpdateRequestHandler? > > > >> > > > >> You can do it by adding &update.chain=langid to the URL or defining > it > > > in > > > >> a defaults section as follows > > > >> > > > >> <lst name="defaults"> > > > >> <str name="update.chain">langid</str> > > > >> </lst> > > > >> > > > >> > > > >> > > > >> On Tuesday, April 29, 2014 3:18 PM, Victor Pascual < > > > >> [hidden email]<http://user/SendEmail.jtp?type=node&node=4134885&i=7>> > wrote: > > > >> Dear all, > > > >> > > > >> I'm a new user of Solr. I've managed to index a bunch of documents > (in > > > >> fact, they are tweets) and everything works quite smoothly. > > > >> > > > >> Nevertheless it looks like Solr doesn't detect the language of my > > > documents > > > >> nor remove stopwords accordingly so I can extract the most frequent > > > terms. > > > >> > > > >> I've added this piece of XML to my solrconfig.xml as well as the > Tika > > > lib > > > >> jars. > > > >> > > > >> <updateRequestProcessorChain name="langid"> > > > >> <processor > > > >> > > > >> > > > > > > class="org.apache.solr.update.processor.LangDetectLanguageIdentifierUpdateProcessorFactory"> > > > > >> <lst name="defaults"> > > > >> <str name="langid.fl">text</str> > > > >> <str name="langid.langField">lang</str> > > > >> </lst> > > > >> </processor> > > > >> <processor class="solr.LogUpdateProcessorFactory" /> > > > >> <processor class="solr.RunUpdateProcessorFactory" /> > > > >> </updateRequestProcessorChain> > > > >> > > > >> There is no error in the tomcat log file, so I have no clue of why > > this > > > >> isn't working. > > > >> Any hint on how to solve this problem will be much appreciated! > > > >> > > > > > > > > > > > > > > > > > > ------------------------------ > If you reply to this email, your message will be added to the discussion > below: > > http://lucene.472066.n3.nabble.com/Solr-does-not-recognize-language-tp4133711p4134885.html > To unsubscribe from Solr does not recognize language, click > here<http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=4133711&code=ZmluYWx4Y29kZUBnbWFpbC5jb218NDEzMzcxMXwyMDg1ODE1Mzg4> > . > NAML<http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml> > -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-does-not-recognize-language-tp4133711p4134897.html Sent from the Solr - User mailing list archive at Nabble.com.