Re: Multi threading indexing

2018-05-14 Thread Mikhail Khludnev
A few years ago I provided server side concurrency "booster" https://issues.apache.org/jira/browse/SOLR-3585. But now, I'd rather suppose it's client-side (or ETL) duty. On Mon, May 14, 2018 at 6:39 AM, Raymond Xie wrote: > Hello, > > I have a huge amount of data (TB level) to be indexed, I am w

Re: How to restart solr in docker?

2018-05-14 Thread reznov9185
This is what I needed to do for updating the solrconfig files from local to docker: `sudo docker cp docker/solr/production/conf/solrconfig.xml solr:/opt/solr/server/solr/production/conf/solrconfig.xml` `sudo docker restart solr` For some reason this is not syncing automatically, so I had to cp the

Techniques for Retrieving Hits

2018-05-14 Thread Terry Steichen
In order to allow users to retrieve the documents that match a query, I make use of the embedded Jetty container to provide file server functionality.  To make this happen, I provide a symbolic link between the actual document archive, and the Jetty file server.  This seems somewhat of a kludge, an

Re: Async exceptions during distributed update

2018-05-14 Thread Jay Potharaju
Adding some more context to my last email Solr:6.6.3 2 nodes : 3 shards each No replication . Can someone answer the following questions 1) any ideas on why the following errors keep happening. AFAIK streaming solr clients error is because of timeouts when connecting to other nodes. Async e

Re: Techniques for Retrieving Hits

2018-05-14 Thread Shawn Heisey
On 5/14/2018 6:46 AM, Terry Steichen wrote: In order to allow users to retrieve the documents that match a query, I make use of the embedded Jetty container to provide file server functionality.  To make this happen, I provide a symbolic link between the actual document archive, and the Jetty fil

Commit too slow?

2018-05-14 Thread LOPEZ-CORTES Mariano-ext
Hi After having injecting 200 documents in our Solr server, the commit operation at the end of the process (using ConcurrentUpdateSolrClient) take 10 minutes. It's too slow? Our auto-commit policy is the following: 15000

Re: Commit too slow?

2018-05-14 Thread Shawn Heisey
On 5/14/2018 11:29 AM, LOPEZ-CORTES Mariano-ext wrote: > After having injecting 200 documents in our Solr server, the commit > operation at the end of the process (using ConcurrentUpdateSolrClient) take > 10 minutes. It's too slow? There is a wiki page discussing slow commits: https://wiki.

Re: Techniques for Retrieving Hits

2018-05-14 Thread Terry Steichen
Shawn, As noted in my embedded comments below, I don't really see the problem you apparently do.  Maybe I'm missing something important (which certainly wouldn't  be the first - or last -  time that happened). I posted this note because I've not seen list comments pertaining to the job of actual

Re: Techniques for Retrieving Hits

2018-05-14 Thread Shawn Heisey
On 5/14/2018 3:13 PM, Terry Steichen wrote: > I posted this note because I've not seen list comments pertaining to the > job of actually locating and retrieving hitlist documents. How documents are retrieved will be highly dependent on your setup.  Here's how things usually go: If the original da

[ANNOUNCE] Apache Solr 7.3.1 released

2018-05-14 Thread Cao Mạnh Đạt
15 May 2018, Apache Solr™ 7.3.1 available The Lucene PMC is pleased to announce the release of Apache Solr 7.3.1 Solr is the popular, blazing fast, open source NoSQL search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted sea

Re: question about updates to shard leaders only

2018-05-14 Thread Bernd Fehling
OK, I have the CloudSolrClient with SolrJ now running but it seams a bit slower compared to ConcurrentUpdateSolrClient. This was not expected. The logs show that CloudSolrClient send the docs only to the leaders. So the only advantage of CloudSolrClient is that it is "Cloud aware"? With Concurre