Re: Cache problem

2016-04-12 Thread Bastien Latard - MDPI AG
Thank you Shawn & Reth! So I have now some questions, again Remind: I have only Solr running on this server (i.e.: java + tomcat). /BTW: I needed to increase previously the java heap size because I went out of memory. Actually, you only see here 2Gb (8Gb previously) for JVM because I automat

Re: SOLR Upgrade 3.x to 4.10

2016-04-12 Thread abhi Abhishek
Thanks Erick and Shawn for the input. it makes more sense to move to SOLR 5.x but we would like to get there in few iterations gradually making incremental changes to have a smooth cut over. our index size is 3TB (10 shards of 300G each), i was looking for a alternate route which would save me fro

Re: Trim trailing whitespaces

2016-04-12 Thread Ahmet Arslan
Hi Srinivas, Using TrimFieldUpdateProcessorFactory http://lucene.apache.org/solr/5_5_0/solr-core/org/apache/solr/update/processor/TrimFieldUpdateProcessorFactory.html Ahmet On Wednesday, April 13, 2016 8:41 AM, Srinivas Kashyap wrote: Hi, When i index the data, the data is coming with trai

Trim trailing whitespaces

2016-04-12 Thread Srinivas Kashyap
Hi, When i index the data, the data is coming with trailing whitespaces. How should i remove them? In schema.xml fieldtype for below fields are "string". Please suggest. "response": { "numFound": 40327, "start": 0, "docs": [ { "TECHSPEC.REQUEST_NO": "HQ22

Re: SOLR Upgrade 3.x to 4.10

2016-04-12 Thread Shawn Heisey
On 4/12/2016 6:10 AM, abhi Abhishek wrote: > I have SOLR 3.6 running currently, i am planning to upgrade this to > SOLR 4.10. Below were the thoughts we could come up with. > > 1. in place upgrade >I would be making the SOLR 4.10 slave of 3.6 and copy the indexes, > and optimize this in

Re: Cache problem

2016-04-12 Thread Shawn Heisey
On 4/12/2016 3:35 AM, Bastien Latard - MDPI AG wrote: > Thank you both, Bill and Reth! > > Here is my current options from my command to launch java: > */usr/bin/java -Xms20480m -Xmx40960m -XX:PermSize=10240m > -XX:MaxPermSize=20480m [...]* > > So should I do *-Xms20480m -Xmx20480m*? > Why? What w

Re: Arguments for and against putting solr.xml into Zookeeper?

2016-04-12 Thread Shawn Heisey
On 4/12/2016 2:20 PM, John Bickerstaff wrote: > I'm wondering if anyone can comment on arguments for and against putting > solr.xml into Zookeeper? > > I assume one argument for doing so is that I would then have all > configuration in one place. > > I also assume that if it doesn't get included as

Re: Arguments for and against putting solr.xml into Zookeeper?

2016-04-12 Thread Alexandre Rafalovitch
The relevant JIRA is SOLR-7735 and its references. Maybe that would be useful as the background. Regards, Alex. Newsletter and resources for Solr beginners and intermediates: http://www.solr-start.com/ On 13 April 2016 at 06:20, John Bickerstaff wrote: > Hello all, > > I'm wondering if

Re: SOLR Upgrade 3.x to 4.10

2016-04-12 Thread Erick Erickson
I would always re-index if possible, it's more certain than upgrading the indexes. It's "not possible" when it takes very long And why go for 4.10 rather than 5.5 (Note, 5.5.1 will be out Real Soon Now). If you can re-index, I'd really think about upgrading to 5.5.1 and going from there. Best

Re: Bad Request

2016-04-12 Thread Erick Erickson
The Solr logs themselves may give you a better error message. Best, Erick On Tue, Apr 12, 2016 at 6:37 AM, Robert Brown wrote: > Hi, > > My collection had issues earlier, 1 shard showed as Down, the other only > replica was Gone. > > Both were actually still up and running, no disk or CPU issues

Re: Indexing date data for facet search

2016-04-12 Thread Erick Erickson
It may not have made it into the schemas, so just try adding this to your schema file: As far as adding the 00Z, I would to be safe. Best, Erick On Tue, Apr 12, 2016 at 6:57 AM, Steven White wrote: > Hi Erick, > > In Solr's schema.xml, I cannot find for "dateRange", not even > on Apache Solr

Re: Which line is solr following in terms of a BI Tool?

2016-04-12 Thread Erick Erickson
The unsatisfactory answer is that the have different characteristics. The analytics contrib does not work in distributed mode. It's not receiving a lot of love at this point. The JSON facets are estimations. Generally very close but are not guaranteed to be 100% accurate. The variance, as I under

Re: Arguments for and against putting solr.xml into Zookeeper?

2016-04-12 Thread Erick Erickson
upconfig is for _configurations_. Each collection can use one of the configurations. Solr.xml is configuration for the entire Solr instance so it doesn't make sense for it to be part of upconfig. There's certainly room for something explicit to upload it separate from configsets though... Best,

Question regarding empty UUID field

2016-04-12 Thread Susmit Shukla
Hi, I have configured solr schema to generate unique id for a collection using UUIDUpdateProcessorFactory I am seeing a peculiar behavior - if the unique 'id' field is explicitly set as empty string in the SolrInputDocument, the document gets indexed. I can see in the solr query console a good uu

Arguments for and against putting solr.xml into Zookeeper?

2016-04-12 Thread John Bickerstaff
Hello all, I'm wondering if anyone can comment on arguments for and against putting solr.xml into Zookeeper? I assume one argument for doing so is that I would then have all configuration in one place. I also assume that if it doesn't get included as part of the upconfig command, there is likely

Re: boost parent fields BlockJoinQuery

2016-04-12 Thread Mikhail Khludnev
Giving the error message you undercopypasted search query and and omit the closing bracket. On Tue, Apr 12, 2016 at 3:30 PM, michael solomon wrote: > Thanks, > when I'm trying: > city:"walla walla"^10 {!parent which="is_parent:true" > score=max}(normal_text:walla) > I get: > > > "msg": "org.apac

Re: [More Like This] Query building

2016-04-12 Thread Scott Stults
Hi Alessandro, It's not uncommon for Solr patches to remain uncommitted for months, even years. In fact some never get merged. Don't let that discourage you! k/r, Scott On Fri, Mar 11, 2016 at 11:49 AM, Alessandro Benedetti < abenede...@apache.org> wrote: > I start to feel that is not that eas

Re: Solr slave is doing full replication (entire index) of index after master restart

2016-04-12 Thread Lior Sapir
So what do you say: Is it a problem in my environment + confs OR That's how the replication is working (if a slave fails to locate the master when polling then next time the master is available it will replicate the entire index even if no document was added to the master and no optimization was

Re: SolrCloud Config file

2016-04-12 Thread Sam Xia
Thanks you Shawn and Erick. It turns out there is a get-pip.py file in the configuration folder (the config file was copied from somewhere), which caused the mis-behave. After get-pip.py is removed, everything worked as expected. Thanks Again. On 4/11/16, 8:40 PM, "Erick Erickson" wrote:

Re: Solrj API for Managed Resources

2016-04-12 Thread iambest
Thanks for your reply, sorry if I wasn't clear. But, I am looking for a solrj client API to make my life easier when dealing with Managed Resources. solrj has a client for schema API (SchemaRequest), but, it doesn't handle Managed Resources. -- View this message in context: http://lucene.472066

Solr cloud newSearcher warmup

2016-04-12 Thread Simone Sabba
Hi, i just configured cache autowarm into solrconfig.xml using newSearcher listener (as described here: https://wiki.apache.org/solr/SolrCaching#newSearcher_and_firstSearcher_Event_Listeners ). It works fi

Re: Solr 6 - AbstractSolrTestCase Error Unable to build KeyStore from file: null

2016-04-12 Thread Joe Lawson
Adding @SolrTestCaseJ4.SuppressSSL to my abstract class extended the AbstractSolrTestCase worked. Thanks! https://github.com/healthonnet/hon-lucene-synonyms/blob/cedb3cbb56b01cd6480c257c04999cdce433f53e/src/test/java/org/apache/solr/search/HonLuceneSynonymTestCase.java#L21-L21 On Mon, Apr 11, 201

Re: Shard ranges seem incorrect

2016-04-12 Thread Shawn Heisey
On 4/12/2016 5:49 AM, Markus Jelsma wrote: > Hi - i've just created a 3 shard 3 replica collection on Solr 6.0.0 and we > noticed something odd, the hashing ranges don't make sense (full state.json > below): > shard1 Range: 8000-d554 > shard2 Range: d555-2aa9 > shard3 Range: 2aaa0

Re: Solr Sharding Strategy

2016-04-12 Thread Shawn Heisey
On 4/11/2016 6:31 AM, Bhaumik Joshi wrote: > We are using solr 5.2.0 and we have Index-heavy (100 index updates per > sec) and Query-heavy (100 queries per sec) scenario. > > *Index stats: *10 million documents and 16 GB index size > > > > Which sharding strategy is best suited in above scenario?

Which line is solr following in terms of a BI Tool?

2016-04-12 Thread Pablo
Hello, I think this topic is important for solr users that are planning to use solr as a BI Tool. Speaking about facets, nowadays there are three majors way of doing (more or less) the same in solr. First, you have the pivot facets, on the other hand you have the Analytics component and finally

Re: Indexing date data for facet search

2016-04-12 Thread Steven White
Hi Erick, In Solr's schema.xml, I cannot find for "dateRange", not even on Apache Solr Reference guide [1]. What am I missing? I'm on Solr 5.2.1. Also, since my date data doesn't have seconds, can I leave ".ssZ" out or must I supply it with "00"? Thanks Steve [1] https://cwiki.apache.org/co

JSON facet raw HLL as result

2016-04-12 Thread sudsport s
is it possible to get raw HLL object as result of json facet instead of getting cardinality? I tried to build custom json facet to return raw value as external Jar but attempt was unsuccessful as JSON facet has some classes with default scope and I get IllegalAccessException (RunTimeException) if

Re: Pivot facets - distributed search - request

2016-04-12 Thread Yonik Seeley
On Tue, Apr 12, 2016 at 8:47 AM, Pablo wrote: > Hi, > Is there any way of requesting limit 10 order by a stat within facet pivot? No. > I know that the "json facet" component can do this and it has a very > comphrehensive api, but it has a problem of consistency (refinement) when > querying acro

Bad Request

2016-04-12 Thread Robert Brown
Hi, My collection had issues earlier, 1 shard showed as Down, the other only replica was Gone. Both were actually still up and running, no disk or CPU issues. This occurred during updates. The server since recovered after a reboot. Upon trying to update the index again, I'm now getting cons

Re: Cache problem

2016-04-12 Thread Reth RM
This has answers about why giving enough memory to OS is important: https://wiki.apache.org/solr/SolrPerformanceProblems#OS_Disk_Cache And as per solr admin dashboard, the os cache (physical memory is almost utilized where as memory allocated to jvm is not used) so its best to lower jvm memory. Why

Pivot facets - distributed search - request

2016-04-12 Thread Pablo
Hi, Is there any way of requesting limit 10 order by a stat within facet pivot? I know that the "json facet" component can do this and it has a very comphrehensive api, but it has a problem of consistency (refinement) when querying across multiple shards. And given that pivot facets supports distr

Re: boost parent fields BlockJoinQuery

2016-04-12 Thread michael solomon
Thanks, when I'm trying: city:"walla walla"^10 {!parent which="is_parent:true" score=max}(normal_text:walla) I get: > "msg": "org.apache.solr.search.SyntaxError: Cannot parse > '(normal_text:walla': Encountered \"\" at line 1, column 18.\nWas > expecting one of:\n ...\n ...\n ...\n

Re: Curious case of DataSource.getConnection()

2016-04-12 Thread Shalin Shekhar Mangar
What is this Solr scheduler class? Is that your own custom code? None of the information or code snippets in your email related to a Solr problem. I guess you are looking to troubleshoot a DB connectivity problem and it would be better to ask this on stackoverflow. On Tue, Apr 12, 2016 at 4:01 PM,

Re: EmbeddedSolr for unit tests in Solr 6

2016-04-12 Thread Shalin Shekhar Mangar
Rohana, as I said earlier, the MiniSolrCloudCluster is specifically made for your use-case i.e. where you want to quickly setup a SolrCloud cluster in your own application for testing. It is available in the solr-test-framework artifact. On Tue, Apr 12, 2016 at 4:31 PM, Rohana Rajapakse < rohana.r

SOLR Upgrade 3.x to 4.10

2016-04-12 Thread abhi Abhishek
Hi All, I have SOLR 3.6 running currently, i am planning to upgrade this to SOLR 4.10. Below were the thoughts we could come up with. 1. in place upgrade I would be making the SOLR 4.10 slave of 3.6 and copy the indexes, and optimize this index. will optimizing the Lucene 3.3 ind

Shard ranges seem incorrect

2016-04-12 Thread Markus Jelsma
Hi - i've just created a 3 shard 3 replica collection on Solr 6.0.0 and we noticed something odd, the hashing ranges don't make sense (full state.json below): shard1 Range: 8000-d554 shard2 Range: d555-2aa9 shard3 Range: 2aaa-7fff We've also noticed ranges not going from

RE: EmbeddedSolr for unit tests in Solr 6

2016-04-12 Thread Rohana Rajapakse
Please note that I am not writing unit tests for testing classes in Solr. I need a temporary Solr index to test classes in my own application that needs a Solr index. I would like to use classes that are available in solr-core and solr-solrj jars. I could do this easily in solr-4.x versions usin

Curious case of DataSource.getConnection()

2016-04-12 Thread Srinivas Kashyap
Hi, In a Solr scheduler class which runs every 'n' interval of seconds, i'm polling a database table to do some custom job. I'm getting the connection to database, through context file as below: try { Context initContext = new InitialContext(); DataSour

Re: boost parent fields BlockJoinQuery

2016-04-12 Thread Mikhail Khludnev
Hello, It's usually parent_field:"bla bla"^10 {!parent which="is_parent:true" score=max}(child_field:bla) or parent_field:"bla bla"^10 +{!parent which="is_parent:true" score=max}(child_field:bla) there should be no spaces in child clause, otherwise extract it to param and refrer via v=$param On

Re: Cache problem

2016-04-12 Thread Bastien Latard - MDPI AG
Thank you both, Bill and Reth! Here is my current options from my command to launch java: */usr/bin/java -Xms20480m -Xmx40960m -XX:PermSize=10240m -XX:MaxPermSize=20480m [...]* So should I do *-Xms20480m -Xmx20480m* ? Why? What would it change? Reminder: the size of my main index is 46Gb...

Re: Limiting regex queries

2016-04-12 Thread Vincenzo D'Amore
Hi Michael, I suggest to wrap the query parser you're using now with a custom one. That's should help to handle the case where the query has a range with a large number. I did something like that with Edismax. https://github.com/freedev/solr-synonyms-query-parser-plugin Take a look at the creat

Re: Solr Sharding Strategy

2016-04-12 Thread Bhaumik Joshi
Ok i will try with pausing the indexing fully and will check the impact. In performance test queries issued sequentially. Thanks & Regards, Bhaumik Joshi From: Toke Eskildsen Sent: Monday, April 11, 2016 11:13 PM To: Bhaumik Joshi Cc: solr-user@lucene.apa