SolrCloud and configuration files on ZooKeeper
Hi. I setted up a SolrCloud cluster with 3 nodes and configuration files on Zookeeper. I read the document at https://cwiki.apache.org/confluence/display/solr/Using+ZooKeeper+to+Manage+Configuration+Files and saw that configuration files are stored on ZooKeeper of course. But now I have to update my custom schema.xml and I cannot realize how I can download the current config set to update it. Could you help me please? Thank you very much! Bye
Re: SolrCloud and configuration files on ZooKeeper
2015-06-08 21:42 GMT+02:00 Erick Erickson : > See "downconfig" here: > https://cwiki.apache.org/confluence/display/solr/Command+Line+Utilities Thank you Erick for your answer! I've already seen that page, and I was trying with the following command: root@index1:/opt/zookeeper/bin# ./zkCli.sh -z zk1 -cmd downconfig -confdir /tmp/zkconfig/ (zk1 is the first node of the Zookeeper's cluster). But no files are written in /tmp/zkconfig and I see this output: root@index1:/opt/zookeeper/bin# ./zkCli.sh -z zk1 -cmd downconfig -confdir /tmp/zkconfig/ Connecting to localhost:2181 2015-06-08 15:44:55,952 [myid:] - INFO [main:Environment@100] - Client environment:zookeeper.version=3.4.6-1569965, built on 02/20/2014 09:09 GMT 2015-06-08 15:44:55,955 [myid:] - INFO [main:Environment@100] - Client environment:host.name=index1.dc.ems.network 2015-06-08 15:44:55,955 [myid:] - INFO [main:Environment@100] - Client environment:java.version=1.8.0_45 2015-06-08 15:44:55,957 [myid:] - INFO [main:Environment@100] - Client environment:java.vendor=Oracle Corporation 2015-06-08 15:44:55,957 [myid:] - INFO [main:Environment@100] - Client environment:java.home=/usr/lib/jvm/java-8-oracle/jre 2015-06-08 15:44:55,958 [myid:] - INFO [main:Environment@100] - Client environment:java.class.path=/opt/zookeeper/bin/../build/classes:/opt/zookeeper/bin/../build/lib/*.jar:/opt/zookeeper/bin/../lib/slf4j-log4j12-1.6.1.jar:/opt/zookeeper/bin/../lib/slf4j-api-1.6.1.jar:/opt/zookeeper/bin/../lib/netty-3.7.0.Final.jar:/opt/zookeeper/bin/../lib/log4j-1.2.16.jar:/opt/zookeeper/bin/../lib/jline-0.9.94.jar:/opt/zookeeper/bin/../zookeeper-3.4.6.jar:/opt/zookeeper/bin/../src/java/lib/*.jar:/opt/zookeeper/bin/../conf: 2015-06-08 15:44:55,958 [myid:] - INFO [main:Environment@100] - Client environment:java.library.path=/usr/java/packages/lib/amd64:/usr/lib64:/lib64:/lib:/usr/lib 2015-06-08 15:44:55,958 [myid:] - INFO [main:Environment@100] - Client environment:java.io.tmpdir=/tmp 2015-06-08 15:44:55,958 [myid:] - INFO [main:Environment@100] - Client environment:java.compiler= 2015-06-08 15:44:55,958 [myid:] - INFO [main:Environment@100] - Client environment:os.name=Linux 2015-06-08 15:44:55,958 [myid:] - INFO [main:Environment@100] - Client environment:os.arch=amd64 2015-06-08 15:44:55,958 [myid:] - INFO [main:Environment@100] - Client environment:os.version=2.6.32-39-pve 2015-06-08 15:44:55,958 [myid:] - INFO [main:Environment@100] - Client environment:user.name=root 2015-06-08 15:44:55,959 [myid:] - INFO [main:Environment@100] - Client environment:user.home=/root 2015-06-08 15:44:55,959 [myid:] - INFO [main:Environment@100] - Client environment:user.dir=/opt/zookeeper-3.4.6/bin 2015-06-08 15:44:55,960 [myid:] - INFO [main:ZooKeeper@438] - Initiating client connection, connectString=localhost:2181 sessionTimeout=3 watcher=org.apache.zookeeper.ZooKeeperMain$MyWatcher@3eb07fd3 I'm guessing you started your cloud with one of the example (i.e. -e > cloud). Under the covers, > it did an "upconfig" to put the configs up in ZK in the first place, > this is the inverse. > I created a new collection with the following SOLR command: ./solr create_collection -c test -replicationFactor 3 I also tried to list the configuration in the ZooKeeper's CLI: [zk: 127.0.0.1:2181(CONNECTED) 4] ls /configs/test [currency.xml, protwords.txt, managed-schema, solrconfig.xml, synonyms.txt, elevate.xml, stopwords.txt, params.json, lang] But... Where is the schema.xml file?
Re: SolrCloud and configuration files on ZooKeeper
2015-06-08 22:09 GMT+02:00 Shalin Shekhar Mangar : > Looks like you are using managed-schema (better known as the schemaless > mode). Thank you very much Shalin and Erick! I've never used schemaless before. I am not editing the managed schema, and I uploaded my own custom schema. Thank you very much again! Bye
Word recognised in a search
Hi. I'm making some searches using Apache SOLR 1.4, but I will upgrade to 3.6. When SOLR uses stemming, it is very difficult to know what are the words that are really found (for example, if I search "ups" SOLR find "up" too). I need to know that because I need to highlight founded words in the text, and I need to extract some strings from the source using that words. I hope I managed in explain my problem well :-) Could you help me, please? Thank you very much! Bye.
Understanding SOLR search results
Hi. I get some strange results for one query from SOLR. This is an example query: (titolo:trenti OR sommario:trenti OR occhiello:trenti OR testo:trenti) In the results I have this document: 6.5818048 503af94e0c342 IL PROGETTO.. (no word "tren" in "occhiello" field) C'รจ la concessione edilizia. Gli islamici... (no word "tren" in "sottotitolo" field). La fine del ramadan, pochi giorni fa... (no word "tren" in "testo" field). Moschea in viale. (no word "tren" in "titolo" field). This document does not have the word "tren" in any of the fields titolo, occhiello or testo, but: So, as I can see, this document was selected for the field "id". This is the debug: 6.5818048 = (MATCH) sum of: 6.3718405 = (MATCH) weight(id:503af94e0c342 in 48107), product of: 0.57440555 = queryWeight(id:503af94e0c342), product of: 11.09293 = idf(docFreq=1, maxDocs=48343) 0.05178123 = queryNorm 11.09293 = (MATCH) fieldWeight(id:503af94e0c342 in 48107), product of: 1.0 = tf(termFreq(id:503af94e0c342)=1) 11.09293 = idf(docFreq=1, maxDocs=48343) 1.0 = fieldNorm(field=id, doc=48107) 0.20996419 = (MATCH) product of: 0.83985674 = (MATCH) sum of: 0.83985674 = (MATCH) weight(titolo:trent in 48107), product of: 0.34054396 = queryWeight(titolo:trent), product of: 6.5765905 = idf(docFreq=182, maxDocs=48343) 0.05178123 = queryNorm 2.4662213 = (MATCH) fieldWeight(titolo:trent in 48107), product of: 1.0 = tf(termFreq(titolo:trent)=1) 6.5765905 = idf(docFreq=182, maxDocs=48343) 0.375 = fieldNorm(field=titolo, doc=48107) 0.25 = coord(1/4) The field "id" is an indexed string field: Could you help me to understand this behavior well, please? Thank you very much! Bye, Mattia.
Re: Understanding SOLR search results
> Maybe you have a synonym in the title field? Or maybe some stemming anomaly? The complete title is "Moschea in viale Trento, partono i lavori", so "Trent" should be a substring of the word "Trento". But if I search for "Mos" or "lavo", I don't have this result, so I don't understand why "Trent" is taken as a subword, and "Mos" and "lavo" are not. Do you have any idea?
Re: Understanding SOLR search results
2012/8/27 Mike Schultz : > Can you include the entire text for only the titolo field? The entire text for the titolo field is "Moschea in viale Trento, partono i lavori". I tried to change the type of the titolo field from text to textgen, and now it does not match. I think it is a stemming problem, but I cannot use "KeywordMarkerFilter" for every wrong-stemmed word, because I cannot suppose how much they are.