Re: Upgrade SOLR version - facets perfomance regression

2017-01-27 Thread alessandro.benedetti
Which kind of field are you faceting on ? Cardinality ? Field Type ? Doc Valued ? Which facet algorithm are you using ? Which facet parameters ? Cheers -- View this message in context: http://lucene.472066.n3.nabble.com/Upgrade-SOLR-version-facets-perfomance-regression-tp4315027p4317513.html S

Re: Documents issue

2017-01-27 Thread alessandro.benedetti
I may be wrong and don't have time to check the code in details now, but I would say you need to define the default in the destination field as well. The copy field should take in input the plain content of the field ( which is null) and then pass that content to the destination field. Properties

[Benchmark SOLR] JETTY VS TOMCAT

2017-01-27 Thread Gerald Reinhart
Hello, We are migrating our platform from - Solr 5.4.1 hosted by a Tomcat to - Solr 5.4.1 standalone (hosted by Jetty) => Jetty is 15% slower than Tomcat in the same conditions. Here are details about the benchmarks : Context : - Index with 9 000 000 docume

Distributed IDF in inter collections distributed queries

2017-01-27 Thread alessandro.benedetti
Hi all, I was playing a bit with the distributed IDF, I debugged and explored a lot the code and it is a nice feature in a shared environment. I tried to see what is the behaviour in case we run a distributed query across collections ( ...&collection=a,b,c) Distributed IDF should work in this sce

Re: Distributed IDF in inter collections distributed queries

2017-01-27 Thread alessandro.benedetti
I have an update on this, I have identified at least 2 bugs : 1) Real score / Debug score is not aligned When we operate a shard request with purpose '16388' ( GET_TOP_IDS,SET_TERM_STATS) we correctly pass the global collection stats and we calculate the real score. When we operate a shard reques

RE: Distributed IDF in inter collections distributed queries

2017-01-27 Thread Markus Jelsma
On 1), https://issues.apache.org/jira/browse/SOLR-7759 -Original message- > From:alessandro.benedetti > Sent: Friday 27th January 2017 13:22 > To: solr-user@lucene.apache.org > Subject: Re: Distributed IDF in inter collections distributed queries > > I have an update on this, I have

RE: Distributed IDF in inter collections distributed queries

2017-01-27 Thread alessandro.benedetti
Thanks Markus, I commented the Jira issue with a very naive approach to solve that. It's a shot in the dark, I will double check if it makes sense at all :) Cheers -- View this message in context: http://lucene.472066.n3.nabble.com/Distributed-IDF-in-inter-collections-distributed-queries-tp431

Re: [Benchmark SOLR] JETTY VS TOMCAT

2017-01-27 Thread William Bell
Did you try: Set your acceptor count, SelectChannelConnector.setAcceptors(int) to be a a value between 1 and (number_of_cpu_cores - 1). On Fri, Jan 27, 2017 at 3:22 AM, Geral

Re: [Benchmark SOLR] JETTY VS TOMCAT

2017-01-27 Thread Yago Riveiro
Solr run tests with jetty. I ran in nasty bugs in solr in the past with tomcat. My advise it’s that speed is only one more metric, robustness and reliability matter too. -- /Yago Riveiro On 27 Jan 2017 15:38 +, William Bell , wrote: > Did you try: > > Set your acceptor count, SelectChanne

Re: Documents issue

2017-01-27 Thread Comcast
Why would this behavior changed from one day to the next? I ran crawl and index several times with no issues, changed the schematic xml definition of a couple of fields, ran crawl and index and produced the dataset with missing copyfields. Sent from my iPhone > On Jan 27, 2017, at 4:07 AM, ales

Re: Documents issue

2017-01-27 Thread alessandro.benedetti
Hi Khris, can you paste here the diff between the OK status and the KO status ? Has only the name of the destination field changed ? ( a replace of '.' with '_' ?) Do you have any dynamic field defined ? Cheers -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-alter-the

Splitting large non-stored field

2017-01-27 Thread deansg
Hi, Many of our documents contain a unique, non-indexed text field that contains html-content which we display to our users (let's call it "html_content"). The reason we store this field in Solr in the first place is because of Solr's highlighting capabilities: the query itself is against the non-h

Re: Documents issue

2017-01-27 Thread Comcast
Only change was rename to remove periods from the fieldnames Sent from my iPhone > On Jan 27, 2017, at 11:53 AM, alessandro.benedetti > wrote: > > Hi Khris, > can you paste here the diff between the OK status and the KO status ? > Has only the name of the destination field changed ? ( a replac

Re: Does DIH queues up requests

2017-01-27 Thread William Bell
However you can create multiple DIH configs under a core/collection. You can run them each in parallel and commit at the end. SELECT * FROM existingtable WHERE column >= 1 AND column <= 2000; SELECT * FROM existingtable WHERE column >= 2001 AND column <= 4000; Something like that works for u

Re: After migrating to SolrCloud

2017-01-27 Thread Chris Hostetter
That error means that some client talking to your server is attempting to use an antiquated HTTP protocol version, which was (evidently) supported by the jetty used in 3.6, but is no longer supported by the jetty used in 6.2. (some details: https://stackoverflow.com/a/32302263/689372 ) If it'

Re: Streaming Expressions result-set fields not in order

2017-01-27 Thread Joel Bernstein
The issue is that fields are held in HashMaps internally so field order is not maintained. The thinking behind this was that field order was not so important as Tuples are mainly accessed by keys. But I think it's worth looking into an approach for maintaining field order. Feel free to create jira