Re: reindex a lucene index with solr

2017-12-12 Thread Amin Raeiszadeh
thanks for your guides Erick before this i use solr v4.10.3 in a cluster of 40 searcher (solr instance) each with about 40 cores. now i want to migrate to solr cloud v6.5.1 i design my own custom policy for document routing(default mode of routing) between shards an i have about 1000 shards at star

Re: Specific cores/collections to specific nodes

2017-12-12 Thread Leo Prince
Thanks Erick. createNodeSet works perfect. A nice way of allocating collections among nodes. When I went through the docs, host tag just noticed so thought to make use of it, however createNodeSet works just fine for me; and createNodeSet.shuffle too. Just wanted to know one more thing. Is there

Re: Solr Aggregation queries are way slower than Elastic Search

2017-12-12 Thread RAUNAK AGRAWAL
Thanks Yonik and Joel. I will try with JSON Facet API and update the results here. On Tue, Dec 12, 2017 at 10:56 PM, Yonik Seeley wrote: > On Tue, Dec 12, 2017 at 9:17 AM, RAUNAK AGRAWAL > wrote: > > Hi Yonik, > > > > So if the query is fine then I guess even using JSON Facet API will not > > h

Re: How to perform delta-import on SolrCloud mode through a scheduler?

2017-12-12 Thread Shawn Heisey
On 12/8/2017 2:40 AM, Sabeer Hussain wrote: > I am using Solr 7.1 version and deployed it in standalone mode. I have > created a scheduler in my application itself to perform delta-import > operation based on a pre-configured frequency. I have used the following > lines of code (in java) to invoke

Re: Protect a collection to be deleted

2017-12-12 Thread Anshum Gupta
You might want to explore Rule based authorization in Solr and stop non-admin users from deleting collections etc. Here’s the link to the documentation: https://lucene.apache.org/solr/guide/6_6/rule-based-authorization-plugin.html

Re: Protect a collection to be deleted

2017-12-12 Thread Yago Riveiro
Thanks Shawn for address the question to Jira. Indeed I want to continue to insert data in the collection. I found that delete a collection by mistake using the API it’s to easy and prone to human error. Regards, -- Yago Riveiro On 12 Dec 2017 19:05 +, Shawn Heisey , wrote: > On 12/12/20

Re: Protect a collection to be deleted

2017-12-12 Thread Shawn Heisey
On 12/12/2017 10:27 AM, Yago Riveiro wrote: > Is it possible in Solr protect a collection to be deleted through a > property? I doubt that this is possible at the moment. The suggestion from Markus to change permissions on the index files would prevent the actual index from being deleted, but I s

RE: Protect a collection to be deleted

2017-12-12 Thread Yago Riveiro
I don’t know if it’s possible but, if we can mark the collection like protected, we can avoid DELETE command to remove de collection. Maybe set the flag when CREATE command is executed? This is an interesting feature to avoid human errors, and relatively easy to implement. Regards -- Yago Ri

RE: Protect a collection to be deleted

2017-12-12 Thread Markus Jelsma
Hello, Well, you could remove the write permission for all segment files. Or, make a custom UpdateProcessor that intercepts *:* operations and stops a delete in its tracks. This is what we did, protect the search against me. Keep in mind that a negative query can also delete everything, so you

Protect a collection to be deleted

2017-12-12 Thread Yago Riveiro
Hi, Is it possible in Solr protect a collection to be deleted through a property? Regards - Best regards /Yago -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: Solr Aggregation queries are way slower than Elastic Search

2017-12-12 Thread Yonik Seeley
On Tue, Dec 12, 2017 at 9:17 AM, RAUNAK AGRAWAL wrote: > Hi Yonik, > > So if the query is fine then I guess even using JSON Facet API will not > help me here. As Joel mentioned, it's completely different code than the old stats API. This is a very simple use-case, so if we're slower than ES for s

Re: Recovery Issue - Solr 6.6.1 and HDFS

2017-12-12 Thread Erick Erickson
Right, look at autoAddReplicas which is designed to do this automagically (but I confess I don't have much experience with it). What that doesn't handle is capacity, if you need to increase the QPS you need to add replicas though. Depends on your needs of course. Best, Erick On Mon, Dec 11, 201

Re: Specific cores/collections to specific nodes

2017-12-12 Thread Erick Erickson
Wait, rule-based placement isn't what I was talking about at all. You don't need to knock yourself out understanding the rule based replica placement, which is being replaced by "Policies" in 7x anyway. You can just use createNodeSet in the collection CREATE command. It's a list of nodes (as in th

Re: Implicit routing changes to Composite while re-deploy changes and make Upconfig command

2017-12-12 Thread Erick Erickson
bq: It will changes the router value implicit to compositeId What evidence do you have of that? This better not happen. The routing scheme is part of the collection creation command and is stored in ZooKeeper in the collection's znode, it has nothing to do with the configset so updating the config

Re: Prevent Document to get partially indexed if document is not available

2017-12-12 Thread Mikhail Khludnev
https://lucene.apache.org/solr/guide/6_6/updating-parts-of-documents.html#UpdatingPartsofDocuments-OptimisticConcurrency If the content in the _version_ field is less than '0' (i.e., '-1'), then the document must *not* exist. In this case, no version matching occurs, but if the document exists, the

Re: How extractingrequest handler works?

2017-12-12 Thread Erick Erickson
Please follow the instructions here: http://lucene.apache.org/solr/community.html#mailing-lists-irc. You must use the _exact_ same e-mail as you used to subscribe. If the initial try doesn't work and following the suggestions at the "problems" link doesn't work for you, let us know. But note you n

Re: reindex a lucene index with solr

2017-12-12 Thread Erick Erickson
What you haven't told us is whether you need to shard or not. But there's no difference between the index built for stand-alone Solr and SolrCloud. So just create your SolrCloud instance, probably single shard, 1 replica with the appropriate configset. Then shut that down and copy the index from y

Re: Solr Aggregation queries are way slower than Elastic Search

2017-12-12 Thread Joel Bernstein
The JSON facet API is a different aggregation implementation. So, it will definitely have different performance characteristics. Joel Bernstein http://joelsolr.blogspot.com/ On Tue, Dec 12, 2017 at 9:17 AM, RAUNAK AGRAWAL wrote: > Hi Yonik, > > So if the query is fine then I guess even using JS

Re: Solr Aggregation queries are way slower than Elastic Search

2017-12-12 Thread RAUNAK AGRAWAL
Hi Yonik, So if the query is fine then I guess even using JSON Facet API will not help me here. Can you suggest me some other idea or further tuning which will help me in reducing the latency? On Tue, Dec 12, 2017 at 7:27 PM, Yonik Seeley wrote: > OK great, so it's definitely not the main query

Re: Solr Aggregation queries are way slower than Elastic Search

2017-12-12 Thread Yonik Seeley
OK great, so it's definitely not the main query (which is just a single term query in this example!) > Also I have looked into the JSON Facet API. If I have to use facets, I will > have to then define 3600 facets in a single query and I guess that would be > also slow. You can ask for any number

Prevent Document to get partially indexed if document is not available

2017-12-12 Thread kumar gaurav
Hi I need to update documents partially to solr only if it is already present there . But the problem is it creates a new document if document not available . Please help someone . Thanks regards Kumar Gaurav

Re: Howto search for § character

2017-12-12 Thread Bernd Schmidt
Thanks for all the infos. That helps so far to understand the issue . Cheers, Bernd Mit freundlichen Grüßen Bernd Schmidt SOFTWARE-ENTWICKLUNG b.schm...@eggheads.de Von: Shawn Heisey An: Gesendet: 07.12.2017 19:00 Betreff: Re: Howto search for § character On 12/7/201

RE: Pass field value through function for filtering

2017-12-12 Thread Markus Jelsma
Forget about it, i just remember PostFilters! Thanks! Markus -Original message- > From:Markus Jelsma > Sent: Tuesday 12th December 2017 12:54 > To: Solr-user > Subject: Pass field value through function for filtering > > Hello, > > I have a function and a lot of documents, i want to

Pass field value through function for filtering

2017-12-12 Thread Markus Jelsma
Hello, I have a function and a lot of documents, i want to select all documents that give a certain value when i pass a document's field through the function, i just want to filter by function, how? I am thinking of implementing Collector. Get the docId, make a field look-up and discard if it

Solr - Achieve Delta-Import with the FileListEntityProcessor for PDF Files

2017-12-12 Thread Karan Saini
Solr version :: 6.6.1 I am using the solr to index the PDF files and it is working fine as expected. Now i have a requirement to perform the option of delta-import on the PDF file. I am not able to locate the example of implementing the delta-import with FileListEntityProcessor. Please suggest.

solr slave do not delete old index files

2017-12-12 Thread Muke
The problem i am having is the old index files are not being deleted on the slave. After each replication, I can see the old files still hanging around This causes the data directory size to increase by the index size every replication until the disk fills up. master: -rw-r- 1 henew

Re: Solr Aggregation queries are way slower than Elastic Search

2017-12-12 Thread RAUNAK AGRAWAL
Hi Yonik, As you asked here is the code snippet and the actual solr query. Please have a look. I have included only 104 metrics but like this we can go upto 3600 rollups. Also I have looked into the JSON Facet API. If I have to use facets, I will have to then define 3600 facets in a single query

reindex a lucene index with solr

2017-12-12 Thread Amin Raeiszadeh
i have a lucene index that some fields of docs are indexed with custom incremental gaps and all fields are stored too(not only indexed). i need to import this docs to solr cloud. is there any way to automatically rebuild this docs for importing in solr with costum gaps by some thing likes converte

Re: Solr :: How to trigger the DIH from SolrNet API with C# code

2017-12-12 Thread Karan_Saini
I finally did myself and used the *C# class HttpClient* to make the web request for the Solr API URL as follows. string solrTargetDIHUrl = "http://localhost:8983/solr/dih/dataimport?command=delta-import";; Makes Async call to the Web API URL & trigger the data import handler ac

Performance issues with 'unique' function in json facets over a high cardinality field

2017-12-12 Thread alexpusch
Hi, I have a surprising performance issue with the 'unique' function in a json facet My setup holds large amount of docs (~1B), despite this large number I only facet on a small result set of a query, only a few docs. The query itself returns as fast as expected, but when I try to do a unique cou

Re: How extractingrequest handler works?

2017-12-12 Thread Gareth Harper
Could someone unsubscribe me from this group please? Thank you, Gareth > On 8 Dec 2017, at 17:53, Sreenivas.T wrote: > > Thanks Erick. > > Im using ManifoldCF to connect to Fileshare and index the content to Solr. > So I was thinking to customize Solr's updateProcessor. However, It looks > like

Implicit routing changes to Composite while re-deploy changes and make Upconfig command

2017-12-12 Thread Nimesh Aghera
Hello, I have implemented implicit routing with below configuration. Created one default collection manually 'AMS_Config' which contains configurations files schema,solrconfig etc. Using 'AMS_Config' I have created 2 collections model,workset respectively with below command which created 2 sha

Re: Specific cores/collections to specific nodes

2017-12-12 Thread Leo Prince
Thanks Erick. I went through the doc https://lucidworks.com/2015/05/12/rule-based-replica-assignment-solrcloud/ and it helped on how to use "host" tag. However can I use two values to same key.? I can do host:!1.1.1.1 but can I do something like host:!1.1.1.1,!2.2.2.2 or host:!1.1.1.1,host:!2.2.2.