SolrCloud and configuration files on ZooKeeper

2015-06-08 Thread mattia.martine...@gmail.com
Hi.
I setted up a SolrCloud cluster with 3 nodes and configuration files on
Zookeeper.
I read the document at
https://cwiki.apache.org/confluence/display/solr/Using+ZooKeeper+to+Manage+Configuration+Files
and saw that configuration files are stored on ZooKeeper of course.
But now I have to update my custom schema.xml and I cannot realize how I
can download the current config set to update it.
Could you help me please?
Thank you very much!
Bye


Re: SolrCloud and configuration files on ZooKeeper

2015-06-08 Thread mattia.martine...@gmail.com
2015-06-08 21:42 GMT+02:00 Erick Erickson :

> See "downconfig" here:
> https://cwiki.apache.org/confluence/display/solr/Command+Line+Utilities


Thank you Erick for your answer!
I've already seen that page, and I was trying with the following command:

root@index1:/opt/zookeeper/bin# ./zkCli.sh -z zk1 -cmd downconfig -confdir
/tmp/zkconfig/
(zk1 is the first node of the Zookeeper's cluster).

But no files are written in /tmp/zkconfig and I see this output:

root@index1:/opt/zookeeper/bin# ./zkCli.sh -z zk1 -cmd downconfig -confdir
/tmp/zkconfig/
Connecting to localhost:2181
2015-06-08 15:44:55,952 [myid:] - INFO  [main:Environment@100] - Client
environment:zookeeper.version=3.4.6-1569965, built on 02/20/2014 09:09 GMT
2015-06-08 15:44:55,955 [myid:] - INFO  [main:Environment@100] - Client
environment:host.name=index1.dc.ems.network
2015-06-08 15:44:55,955 [myid:] - INFO  [main:Environment@100] - Client
environment:java.version=1.8.0_45
2015-06-08 15:44:55,957 [myid:] - INFO  [main:Environment@100] - Client
environment:java.vendor=Oracle Corporation
2015-06-08 15:44:55,957 [myid:] - INFO  [main:Environment@100] - Client
environment:java.home=/usr/lib/jvm/java-8-oracle/jre
2015-06-08 15:44:55,958 [myid:] - INFO  [main:Environment@100] - Client
environment:java.class.path=/opt/zookeeper/bin/../build/classes:/opt/zookeeper/bin/../build/lib/*.jar:/opt/zookeeper/bin/../lib/slf4j-log4j12-1.6.1.jar:/opt/zookeeper/bin/../lib/slf4j-api-1.6.1.jar:/opt/zookeeper/bin/../lib/netty-3.7.0.Final.jar:/opt/zookeeper/bin/../lib/log4j-1.2.16.jar:/opt/zookeeper/bin/../lib/jline-0.9.94.jar:/opt/zookeeper/bin/../zookeeper-3.4.6.jar:/opt/zookeeper/bin/../src/java/lib/*.jar:/opt/zookeeper/bin/../conf:
2015-06-08 15:44:55,958 [myid:] - INFO  [main:Environment@100] - Client
environment:java.library.path=/usr/java/packages/lib/amd64:/usr/lib64:/lib64:/lib:/usr/lib
2015-06-08 15:44:55,958 [myid:] - INFO  [main:Environment@100] - Client
environment:java.io.tmpdir=/tmp
2015-06-08 15:44:55,958 [myid:] - INFO  [main:Environment@100] - Client
environment:java.compiler=
2015-06-08 15:44:55,958 [myid:] - INFO  [main:Environment@100] - Client
environment:os.name=Linux
2015-06-08 15:44:55,958 [myid:] - INFO  [main:Environment@100] - Client
environment:os.arch=amd64
2015-06-08 15:44:55,958 [myid:] - INFO  [main:Environment@100] - Client
environment:os.version=2.6.32-39-pve
2015-06-08 15:44:55,958 [myid:] - INFO  [main:Environment@100] - Client
environment:user.name=root
2015-06-08 15:44:55,959 [myid:] - INFO  [main:Environment@100] - Client
environment:user.home=/root
2015-06-08 15:44:55,959 [myid:] - INFO  [main:Environment@100] - Client
environment:user.dir=/opt/zookeeper-3.4.6/bin
2015-06-08 15:44:55,960 [myid:] - INFO  [main:ZooKeeper@438] - Initiating
client connection, connectString=localhost:2181 sessionTimeout=3
watcher=org.apache.zookeeper.ZooKeeperMain$MyWatcher@3eb07fd3

I'm guessing you started your cloud with one of the example (i.e. -e
> cloud). Under the covers,
> it did an "upconfig" to put the configs up in ZK in the first place,
> this is the inverse.
>

I created a new collection with the following SOLR command:

./solr create_collection -c test -replicationFactor 3

I also tried to list the configuration in the ZooKeeper's CLI:

[zk: 127.0.0.1:2181(CONNECTED) 4] ls /configs/test
[currency.xml, protwords.txt, managed-schema, solrconfig.xml, synonyms.txt,
elevate.xml, stopwords.txt, params.json, lang]

But... Where is the schema.xml file?


Re: SolrCloud and configuration files on ZooKeeper

2015-06-08 Thread mattia.martine...@gmail.com
2015-06-08 22:09 GMT+02:00 Shalin Shekhar Mangar :

> Looks like you are using managed-schema (better known as the schemaless
> mode).


Thank you very much Shalin and Erick!
I've never used schemaless before.
I am not editing the managed schema, and I uploaded my own custom schema.
Thank you very much again!
Bye


Word recognised in a search

2012-05-04 Thread mattia.martine...@gmail.com
Hi.

I'm making some searches using Apache SOLR 1.4, but I will upgrade to 3.6.

When SOLR uses stemming, it is very difficult to know what are the
words that are really found (for example, if I search "ups" SOLR find
"up" too).
I need to know that because I need to highlight founded words in the
text, and I need to extract some strings from the source using that
words.

I hope I managed in explain my problem well :-)

Could you help me, please?

Thank you very much!
Bye.


Understanding SOLR search results

2012-08-27 Thread mattia.martine...@gmail.com
Hi.
I get some strange results for one query from SOLR.

This is an example query:


(titolo:trenti OR sommario:trenti OR occhiello:trenti OR testo:trenti)


In the results I have this document:



6.5818048
503af94e0c342
IL PROGETTO..  (no word "tren" in
"occhiello" field) 


C'รจ la concessione edilizia. Gli islamici...
 (no word "tren" in "sottotitolo" field).

La fine del ramadan, pochi giorni fa...
 (no word "tren" in "testo" field).
Moschea in viale. (no word "tren" in
"titolo" field).



This document does not have the word "tren" in any of the fields
titolo, occhiello or testo, but:





So, as I can see, this document was selected for the field "id". This
is the debug:



6.5818048 = (MATCH) sum of: 6.3718405 = (MATCH)
weight(id:503af94e0c342 in 48107), product of: 0.57440555 =
queryWeight(id:503af94e0c342), product of: 11.09293 = idf(docFreq=1,
maxDocs=48343) 0.05178123 = queryNorm 11.09293 = (MATCH)
fieldWeight(id:503af94e0c342 in 48107), product of: 1.0 =
tf(termFreq(id:503af94e0c342)=1) 11.09293 = idf(docFreq=1,
maxDocs=48343) 1.0 = fieldNorm(field=id, doc=48107) 0.20996419 =
(MATCH) product of: 0.83985674 = (MATCH) sum of: 0.83985674 = (MATCH)
weight(titolo:trent in 48107), product of: 0.34054396 =
queryWeight(titolo:trent), product of: 6.5765905 = idf(docFreq=182,
maxDocs=48343) 0.05178123 = queryNorm 2.4662213 = (MATCH)
fieldWeight(titolo:trent in 48107), product of: 1.0 =
tf(termFreq(titolo:trent)=1) 6.5765905 = idf(docFreq=182,
maxDocs=48343) 0.375 = fieldNorm(field=titolo, doc=48107) 0.25 =
coord(1/4)



The field "id" is an indexed string field:



Could you help me to understand this behavior well, please?

Thank you very much!
Bye,
Mattia.


Re: Understanding SOLR search results

2012-08-27 Thread mattia.martine...@gmail.com
> Maybe you have a synonym in the title field? Or maybe some stemming anomaly?

The complete title is "Moschea in viale Trento, partono i lavori", so
"Trent" should be a substring of the word "Trento".
But if I search for "Mos" or "lavo", I don't have this result, so I
don't understand why "Trent" is taken as a subword, and "Mos" and
"lavo" are not.

Do you have any idea?


Re: Understanding SOLR search results

2012-08-27 Thread mattia.martine...@gmail.com
2012/8/27 Mike Schultz :
> Can you include the entire text for only the titolo field?

The entire text for the titolo field is "Moschea in viale Trento,
partono i lavori".

I tried to change the type of the titolo field from text to textgen,
and now it does not match.

I think it is a stemming problem, but I cannot use
"KeywordMarkerFilter" for every wrong-stemmed word, because I cannot
suppose how much they are.