Re: Obtaining SOLR index size on disk

2009-07-18 Thread Erik Hatcher
On Jul 17, 2009, at 8:45 PM, J G wrote: Is it possible to obtain the SOLR index size on disk through the SOLR API? I've read through the docs and mailing list questions but can't seem to find the answer. No, but it'd be a great addition to the /admin/system handler which returns lots of o

Truncated XML responses from CoreAdminHandler

2009-07-18 Thread James Brady
The Solr application I'm working on has many concurrently active cores - of the order of 1000s at a time. The management application depends on being able to query Solr for the current set of live cores, a requirement I've been satisfying using the STATUS core admin handler method. However, once

Re: Truncated XML responses from CoreAdminHandler

2009-07-18 Thread Otis Gospodnetic
James, Not enough memory and Garbage Collection? Connecting to Solr via JConsole should show it. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: James Brady > To: solr-user@lucene.apache.org > Sent: Saturday, July 18, 2009 5:02:42 PM >

Re: Wikipedia or reuters like index for testing facets?

2009-07-18 Thread Grant Ingersoll
It's only really effective if the number of tokens in the Sink is expected to be significantly less than (my various tests showed around < 50%, but YMMV) so it isn't likely useful for most copy fields situations. For Solr to utilize, the schema would have to allow for giving ids to the var

Re: Wikipedia or reuters like index for testing facets?

2009-07-18 Thread Alexandre Rafalovitch
I have something that maybe could be made into one: http://uncorpora.org/ It is resolutions of the United Nations General Assembly in 6 official languages aligned on a paragraph level in an XML (Translation Memory eXchange) format. The 6 languages are: English, French, Spanish, Arabic, Chinese, Ru