SolrCloud 6 Join Stream and pagination

2016-05-18 Thread Roshan Kamble
Hello, I am using Solr 6 in cloud mode. In order to search within different collections I am using InnerJoinStream. (using qt=export in order to get correct result) Is there any way to get paginated result? Regards, Roshan The information in this email is confi

How to stop searches to solr while full data import is going in SOLR

2016-05-18 Thread preeti kumari
Hi, I am using solr 5.2.1. I have two clusters Primary A and Primary B. I was pinging servers to check whether they are up or not to route the searches to working cluster A or B. But while I am running Full data import in Primary cluster A , There is not all the data and pinging servers will not

Re: Sub faceting on string field using json facet runs extremly slow

2016-05-18 Thread Vijay Tiwary
Can somebody confirm whether the jira SOLR-8096 will affect json facet also as I see sub faceting using term facet on string field is ruuning 5x slower than on integer field for same number of hits and unique terms. On 17-May-2016 3:33 pm, "Vijay Tiwary" wrote: > Below is the request > >

Sorting on child document field.

2016-05-18 Thread Pranaya Behera
Hi, How can I sort the results i.e. from a block join parent query using the field from child document field ? Thanks & Regards Pranaya Behera

Re: Solrj 4.7.2 - slowing down over time

2016-05-18 Thread Ahmet Arslan
Hi Roman, You said you were using EmbeddedSolrServer, also you mention Tomcat. I don't think it is healthy to use both. Also I wouldn't use EmbeddedSolrServer at all. It is rarely used and there can be hidden things there. Consider using jetty which is actually tested. Since you commit every min

Re: Faceting and Grouping Performance Degradation in Solr 5

2016-05-18 Thread Joel Bernstein
The enum slowness is interesting. It would appear on the surface to not be related to the FieldCache issue. I don't think the main emphasis of the JSON facet API has been the enum approach. You may find using the JSON facet API and eliminating the use of enum meets your performance needs. With the

Re: Faceting and Grouping Performance Degradation in Solr 5

2016-05-18 Thread Solr User
Joel, Thank you for taking the time to respond to my question. I tried the JSON Facet API for one query that uses facet.method=enum (since this one has a ton of unique values and performed better with enum) but this was way slower than even the slower Solr 5 times. I did not try the new API with

Re: Solrj 4.7.2 - slowing down over time

2016-05-18 Thread Joel Bernstein
One thing to investigate is whether your caches are too large and gradually filling up memory. It does sound like memory is getting tighter over time. A memory profiler would be helpful in figuring out memory issues. Moving to autoCommits would also eliminate any slowness due to overlapping search

Re: Solr M2M authentication on Jetty

2016-05-18 Thread Joel Bernstein
Hi, You may want to investigate the Streaming Expression functions in Solr 6.0. There are some interesting functions that might help with some of the tasks you mentioned: 1) topic: let's users subscribe to a query and returns all new documents for a query. 2) daemon: allows you to setup processes

Conditional atomic update

2016-05-18 Thread chris
(Resending because DMARC-compliant ESPs bounced the previous version) � I'm looking for a way to do an atomic update, but if a certain condition exists on the existing document, abort the update. � Each document has the fields id, count, and value.� The source data ha

RE: Solr M2M authentication on Jetty

2016-05-18 Thread Gregoric, James
Correction: Item 1 is not an absolute requirement; we can use Apache or Tomcat if that makes things any easier. -Original Message- From: Gregoric, James [mailto:james.grego...@childrens.harvard.edu] Sent: Wednesday, May 18, 2016 1:54 PM To: solr-user@lucene.apache.org Subject: Solr M2M

Re: load schema.xml

2016-05-18 Thread Siddhartha Singh Sandhu
Hi, Add the following to your solrconfig.xml: in place of: because solr 6.0.0 defaults to managed schema. First time answering and not questioning. Community, please correct me if I am wrong. Sid. On Wed, May 18, 2016 at 8:48 AM, forme book wrote: > Hi, > > I'm using solr 6.0.0 and I

Solr M2M authentication on Jetty

2016-05-18 Thread Gregoric, James
Dear Solr Community, We would like to provide an in-house group of users access to our Solr database in a way that meets the following specifications: 1. Use the Jetty web service that Solr 6.0 installs by default. 2. Provide an M2M (machine-to-machine) interface, so a user can setu

Re: Facet heatmaps: cluster coordinates based on average position of docs

2016-05-18 Thread David Smiley
Sorry for such a belated response; I don't monitor this list as much as I used to. My response is within... On Wed, Apr 20, 2016 at 4:28 AM Anton K. wrote: > Thanks for your answer, David, and have a good vacation. > > It seems more detailed heatmap is not a goods solution in my case because i >

Hierarchial Support - Solr

2016-05-18 Thread thiaga rajan
Hi Team,    We are exploring solr for one of our project as a search engine. It was a really a great tool around indexing and response time. While we are exploring we got the below questions and understandings. Kindly confirm the same.  We are actually trying to implement the search engine for

Re: Highlighting phone numbers

2016-05-18 Thread David Smiley
Perhaps an easy thing to try is see of the FastVectorHighlighter yields any different results. There are some nuances to the highlighters -- it might. Failing that, this likely due to your analysis chain, and where exactly the offsets point to, which you can see/debug in Solr's analysis screen.

Achieve replication in shards for SolrCloud cluster

2016-05-18 Thread Deeksha Sharma
I am new to Solr and already have Lucene indexes that I want to serve through SolrCloud. I have SolrCloud setup with external zookeeper and 2 Solr Instances - solr1 and solr2registered with this zookeeper. On solr1 I add a symlink to my existing Lucene indexes. (and not on solr2) I create the

Re: Faceting and Grouping Performance Degradation in Solr 5

2016-05-18 Thread Joel Bernstein
Yes, SOLR-8096 is the issue here. I don't believe indexing with docValues is going to help too much with this. The enum slowness may not be related, but I'm not positive about that. The major slowdowns are likely due to the removal of the top level FieldCache from general use and the removal of t

Re: Solrj Basic Authentication randomly failing - "request has come without principal"

2016-05-18 Thread shamik
anyone ? -- View this message in context: http://lucene.472066.n3.nabble.com/Solrj-Basic-Authentication-randomly-failing-request-has-come-without-principal-tp4277342p4277533.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Precision, Recall, ROC in solr

2016-05-18 Thread Ahmet Arslan
Hi Tentri, Evaluation in IR primary carried out by traditional TREC-style (also referred to as Cranfield paradigm) evaluation methodology. The evaluation methodology requires a document collection, a set of information needs (called topics or queries), and a set of query relevance judgments (qr

Re: Faceting and Grouping Performance Degradation in Solr 5

2016-05-18 Thread Solr User
Does anyone know the answer to this? On Wed, May 4, 2016 at 2:19 PM, Solr User wrote: > I recently was attempting to upgrade from Solr 4.8.1 to Solr 5.4.1 but had > to abort due to average response times degraded from a baseline volume > performance test. The affected queries involved faceting

Solrj 4.7.2 - slowing down over time

2016-05-18 Thread Roman Slavík
Hi all, we're using solr in our application and have problem that both searching and indexing is slowing down over time. Versions: - Java 1.7 Solr 4.7.2 Lucene 4.1 (luceneMatchVersion param in solrconfig.xml) App architecture: - We don't use solr as standalone applicat

Highlighting phone numbers

2016-05-18 Thread marotosg
Hi, I have a solr multivalued field with a list of phone numbers with many different formats. Below field type.

load schema.xml

2016-05-18 Thread forme book
Hi, I'm using solr 6.0.0 and I've some trouble to load custom schema.xml After create a new collection with bin/solr create -c newcollection -d basic_configs and copy my costum schema.xml under {solr_home}/server/newcollection/conf I try to load some data with java -Dtype=text/csv -Durl= http://

Precision, Recall, ROC in solr

2016-05-18 Thread Tentri Oktaviani
Hi solr users, My final task on college is making a search engine. I'm using solr to access and retrieve data from ontology which later will be used as corpuses. I'm entirely new to these (information retrieval, ontology, python and solr) things. There's a step in information retrieval to evaluat

Problem retrieving payloads from a specific term in a boosting function

2016-05-18 Thread Félix Sanjuán
Hi all, I have added a new field to my schema that is of the following type: As a quick example, the value of this field would have values like the following: people: "userid1|1 userid2|56" Basically, a user id and an integ

RE: http request to MiniSolrCloudCluster

2016-05-18 Thread Rohana Rajapakse
Hi, I can now talk to MiniSolrCloudCluster over http using the actual jetty port that Solr is running on rather than going through zookeeper (using zookeeper port). The solr port(s) is available in the JettrySolrRunner s list in MiniSolrCloudCluster instance I am not sure if this is the correc

Commit (hard) at shutdown?

2016-05-18 Thread Per Steffensen
Hi Solr 5.1. Someone in production in my organization claims that even though Solrs are shut down gracefully, there can be huge tlogs to replay when starting Solrs again. We are doing heavy indexing right up until Solrs are shut down, and we have set to 1 min. Can anyone confirm (or the oppo

Re: Updating error while add doc to Solrcloud

2016-05-18 Thread Shawn Heisey
On 5/17/2016 12:29 AM, scott.chu wrote: > I build Solrcloud with 2 nodes, 1 shard, 2 replica. I add doc in xml format > using post.jar up to 2.85M+ no. of docs and 10gb index size. When I add more > docs. the solr.log shows: > > -- > 2016-05-17 14:01:09,024

Re: API call for optimising a collection

2016-05-18 Thread Binoy Dalal
OK That's great. Thanks. On Wed, 18 May 2016, 12:52 Shawn Heisey, wrote: > On 5/17/2016 11:04 PM, Binoy Dalal wrote: > > Is there no api call that can optimize an entire collection? I tried > > the collections api page on the confluence wiki but couldn't find > > anything, and a Google search al

Re: API call for optimising a collection

2016-05-18 Thread Shawn Heisey
On 5/17/2016 11:04 PM, Binoy Dalal wrote: > Is there no api call that can optimize an entire collection? I tried > the collections api page on the confluence wiki but couldn't find > anything, and a Google search also yielded no meaningful results. With SolrCloud, if you either send an optimize re

Re: API call for optimising a collection

2016-05-18 Thread Binoy Dalal
John, I haven't noticed such a thing until now, but I'll give it a shot and update this thread. Additionally, what version of solr have you noticed this behaviour? I'm currently running 4.10.4 On Wed, 18 May 2016, 11:26 John Bickerstaff, wrote: > Having run the optimize from the admin UI on one

Re: Creating a collection with 1 shard gives a weird range

2016-05-18 Thread John Smith
On 17/05/16 11:56, Tom Evans wrote: > On Tue, May 17, 2016 at 9:40 AM, John Smith wrote: >> I'm trying to create a collection starting with only one shard >> (numShards=1) using a compositeID router. The purpose is to start small >> and begin splitting shards when the index grows larger. The shard