Solr alternates returning different versions of the same document
I have the following problem with Solr 4.5.1, with a cloud install with 4 shards, no replication, using the built-in zookeeper on one Solr: I have updated a document via the Solr console (select a core, then select "Documents"). I used the CSV format to upload the document, including the document ID. When I query the document id from the Solr console (simple query: id:"the-id-of-the-doc-I-updated"), I alternatively obtain the old document (with the values before update, and a given _version_ number), or the new document (with the values after update, and a different _version_). No log messages in the Solr console about updating the document or anything. Any idea what might be going on, and how to fix that problem? Thanks in advance, Yann -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-alternates-returning-different-versions-of-the-same-document-tp4143006.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Solr alternates returning different versions of the same document
Hi Erik, thanks for your answer. I didn't manually assign docs to shards, I indexed all docs on one server, which then assigned it to shards (based on the default Solr behaviour, based on the document ID I believe). If I understood you correctly - this means the update section of the admin should be avoided when using a sharded install, because it doesn't guarantee a given document ID will be sent to the same shard as the previous version of the same document? Thanks Yann -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-alternates-returning-different-versions-of-the-same-document-tp4143006p4143375.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Solr alternates returning different versions of the same document
Hi Erik, thanks - if it helps, I eventually fixed the problem by deleting the documents by id (via an http request), which apparently deleted all the versions everywhere, then re-creating the documents via the admin interface (update, csv). This seems to have left only one version of each document. Yann -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-alternates-returning-different-versions-of-the-same-document-tp4143006p4143680.html Sent from the Solr - User mailing list archive at Nabble.com.
Use of solr.ASCIIFoldingFilterFactory
Hi, I have define this type in my schema.xml file : Fields definition : I have import my documents with DataImportHandler (my orginals documents are in RDBMS). I test query this query string on SOLR web application : all:chateau. Results (content of the field "all") : CHATEAU D'AMBOISE [CHATEAU EN FRANCE, BABELON] ope dvd rene chateau CHATEAU DE LA LOIRE DE CHATEAU EN CHATEAU ENTRE LA LOIRE ET LE CHER [LE CHATEAU AMBULANT, HAYAO MIYAZAKI] [Chambres d'hôtes au château, Moreau] [ARCHIMEDE, LA VIE DE CHATEAU, KRAHENBUHL] [NEUF, NAISSANCE D UN CHATEAU FORT, MACAULAY] [ARCHIMEDE, LA VIE DE CHATEAU, KRAHENBUHL] Now i try this query string : all:château. No result :( I don't understand. I think the second query respond the same result of the first query but it is not the case. I use SOLR 1.4 (Solr Implementation Version: 1.4.0 833479 - grantingersoll - 2009-11-06 12:33:40). Java 32 bits : Java(TM) SE Runtime Environment (build 1.6.0_17-b04) OS : Windows Seven 64 bits Regards, -- Yann
Re: Use of solr.ASCIIFoldingFilterFactory
On Fri, Feb 5, 2010 at 4:00 PM, Ahmet Arslan wrote: > > I test query this query string on SOLR web > > application : all:chateau. > > Results (content of the field "all") : > > CHATEAU D'AMBOISE > > [CHATEAU EN FRANCE, BABELON] > > ope dvd rene chateau > > CHATEAU DE LA LOIRE > > DE CHATEAU EN CHATEAU ENTRE LA LOIRE ET LE CHER > > [LE CHATEAU AMBULANT, HAYAO MIYAZAKI] > > [Chambres d'hôtes au château, Moreau] > > [ARCHIMEDE, LA VIE DE CHATEAU, KRAHENBUHL] > > [NEUF, NAISSANCE D UN CHATEAU FORT, MACAULAY] > > [ARCHIMEDE, LA VIE DE CHATEAU, KRAHENBUHL] > > > > Now i try this query string : all:château. > > No result :( > > > > I don't understand. I think the second query respond the > > same result of the > > first query but it is not the case. > > Probably château isn't reduced to chateau. You can confirm this by > /admin/anaysis.jsp. > > If thats the case you can use : > > mapping="mapping-ISOLatin1Accent.txt"/> > > If mapping-ISOLatin1Accent.txt does not contain â, you can easily add this > entry to it. â => a > > château is reduce to chateau. I test it on /admin/anaysis.jsp, result : Index Analyzer château chateau chateau Query Analyzer château chateau chateau -- Yann
Re: Use of solr.ASCIIFoldingFilterFactory
On Fri, Feb 5, 2010 at 4:53 PM, Ahmet Arslan wrote: > > Just for your information: since you are using > > whitespacetokenizer château won't retrieve documents > > containing (comma) château, > > Thats the problem. I just see that your matched (multivalued-field) all > contains chateau thats why all:chateau is matching. And it has château, > (with comma) so all:château is not matching. Probably q=all:château, will > return that document. It is better to use StandardTokenizerFactory in your > case. > > > > I use StandardTokenizerFactory. No change. I don't htink the problem is with comma. Comma is add by Solr. On XML result no comma appear : Tout voir, tout savoir Le château féodal Harris Bruce Dennis Harris Bruce Dennis Tout voir, tout savoir Le château féodal 84907 I do other tests. I directly write url in my browser adresse bar : http://localhost:8080/solr/select/?q=all:château<http://localhost:8080/solr/select/?q=all:ch%C3%A2teau> and i have result !!! and the url is now : http://localhost:8080/solr/select/?q=all:ch%E2teau The character â is replace by %E2. I use Firefox 3.6. I try i with IE same result ... Then i test with Solrj ... java code : SolrServer server = getServer(); SolrQuery query = new SolrQuery(); query.setQuery( "all:château" ); QueryResponse rsp = server.query( query ); SolrDocumentList docs = rsp.getResults(); for (Iterator iterator = docs.iterator(); iterator.hasNext();) { SolrDocument solrDocument = (SolrDocument) iterator.next(); System.out.println(" " + solrDocument.getFieldValue("all")); } No result ... -- Yann
Re: Use of solr.ASCIIFoldingFilterFactory
Hello, Thank's, your response solve my problem. Thank's for all, On Sun, Feb 7, 2010 at 4:00 PM, Sven Maurmann wrote: > Hi, > > you might have run into an encoding problem. If you use Tomcat as > the container for Solr you should probably consult the following > > > http://wiki.apache.org/solr/SolrTomcat#URI_Charset_Config > > Cheers, >Sven > > > > --On Freitag, 5. Februar 2010 15:41 +0100 Yann PICHOT > wrote: > > Hi, >> >> I have define this type in my schema.xml file : >> >>> positionIncrementGap="100"> >> >> >> >> >> >> >> >> >> >> >> >> >> Fields definition : >> >> >>> required="true" /> >>> required="false" /> >>> required="false" /> >>> required="false" /> >>> required="false" /> >>> required="false" /> >>> required="false" /> >> >> >> >> >> >> >> >> I have import my documents with DataImportHandler (my orginals documents >> are in RDBMS). >> >> I test query this query string on SOLR web application : all:chateau. >> Results (content of the field "all") : >> CHATEAU D'AMBOISE >> [CHATEAU EN FRANCE, BABELON] >> ope dvd rene chateau >> CHATEAU DE LA LOIRE >> DE CHATEAU EN CHATEAU ENTRE LA LOIRE ET LE CHER >> [LE CHATEAU AMBULANT, HAYAO MIYAZAKI] >> [Chambres d'hôtes au château, Moreau] >> [ARCHIMEDE, LA VIE DE CHATEAU, KRAHENBUHL] >> [NEUF, NAISSANCE D UN CHATEAU FORT, MACAULAY] >> [ARCHIMEDE, LA VIE DE CHATEAU, KRAHENBUHL] >> >> Now i try this query string : all:château. >> No result :( >> >> I don't understand. I think the second query respond the same result of >> the first query but it is not the case. >> >> I use SOLR 1.4 (Solr Implementation Version: 1.4.0 833479 - >> grantingersoll - 2009-11-06 12:33:40). >> Java 32 bits : Java(TM) SE Runtime Environment (build 1.6.0_17-b04) >> OS : Windows Seven 64 bits >> >> Regards, >> -- >> Yann >> > -- Yann