Re: Uneven shard heap usage

2014-06-01 Thread Joe Gresock
These are some good ideas. The "huge document" idea could add up, since I think the shard1 index is a little larger (32.5GB on disk instead of 31.9GB), so it is possible there's one or 2 really big ones that are getting loaded into memory there. Btw, I did find an article on the Solr document rou

Re: Full Indexing fails on Solr-Probable connection issue.HELP!

2014-06-01 Thread Shawn Heisey
On 5/31/2014 1:54 PM, Aniket Bhoi wrote: > Caused by: com.microsoft.sqlserver.jdbc.SQLServerException: The result >>> set is closed. I still think this is an indication of the source of the problem. Something closed the connection to your SQL server before Solr was done with it. That could have b

HASH range calculation

2014-06-01 Thread SolrUser1543
I have a SOLR cloud with 5 solr instances . Indexing of new documents is always performed against instance#1 . Then according to hash calculation the document is being indexing on one of instances . How could i know on which one it will be ? How could I know how SOLR is calculating the hash to

Re: HASH range calculation

2014-06-01 Thread Anshum Gupta
Hi, This blogpost would help you understand the routing and hash calculation : http://searchhub.org/2014/01/06/10590/ Also, http://searchhub.org/2013/06/13/solr-cloud-document-routing/ will come in handy. Let me know if you have any more questions. On Sun, Jun 1, 2014 at 11:58 AM, SolrUser154

Can Atomic Updates help me to re-indexing w/o crawling external content?

2014-06-01 Thread Gili Nachum
Hello. I'm just starting out with my Solr deployment and believe there's a good chance I'll want to change how my fields are indexing in the near future, I wouldn't want to crawl the original content store again just to re-index. I was hoping that Atomic Updates (which keeps all fields as stored)

Re: HASH range calculation

2014-06-01 Thread Shawn Heisey
On 6/1/2014 12:58 PM, SolrUser1543 wrote: > I have a SOLR cloud with 5 solr instances . > > Indexing of new documents is always performed against instance#1 . > > Then according to hash calculation the document is being indexing on one of > instances . > > How could i know on which one it will b

strategies for managing Solr indexing failures and retries

2014-06-01 Thread Philip Durbin
I'm curious about strategies for managing Solr indexing failures. If there's an indexing problem, I'd like my application to retry the indexing operation. I noticed that Chef uses RabbitMQ for (apparently) all Solr indexing operations. http://docs.opscode.com/server_components.html has this to say

Re: Compression vs FieldCache for doc ids retrieval

2014-06-01 Thread jim ferenczi
@William Firstly because I was sure that the ticket (or an equivalent) was already opened but I just could not find it. Thanks @Manuel. Secondly because I wanted to start the discussion, I have the feeling that the compression of the documents, activated by default, can be a killer for some applica

Anybody knows of a project that indexes SVN repos into Solr?

2014-06-01 Thread Alexandre Rafalovitch
Hello, Anybody knows of a recent projects that index SVN repos for Solr search? With or without UI. I know of similar efforts for other VCS, but the only thing I found for SVN is from 2010 and looking quiet. Regards, Alex. P.s. This could also be a cool show-off project for somebody. Plenty o