Liz,

I've built terrabyte (1-2 TB) test Lucene indexes, but have not
reached to the petabyte level, so I am not sure. Certainly there is
overhead in using the http and xml marshaling/de-marshaling, which may
or may not be a critical factor for you.

Could you give more information with respect to your application, i.e.
the nature of your data loading (i.e. many PB at once or GB per
hour/day/week accumulating to PB or MB per second/minute/hour
eventually accumulating to PB...;) searching ( i.e. the number of
fields indexed & the query complexity; if you are using facets, etc),
number of queries per second expected...

Lucene has a limit on the number of documents (in a single index) that
might impact your application:
http://lucene.apache.org/java/3_0_2/api/core/org/apache/lucene/index/IndexWriter.html#numDocs%28%29
of a 32bit int, 2 147 483 648.

-glen

On 24 August 2010 12:29, Liz Sommers <lizswo...@gmail.com> wrote:
> I was worried that it wouldn't scale.  We are going to be indexing petabytes
> of data.  Does the httpserver solution scale?
>
> Thanks
>
> Liz Sommers
> lizswo...@gmail.com
>
> On Tue, Aug 24, 2010 at 12:23 PM, Thomas Joiner
> <thomas.b.joi...@gmail.com>wrote:
>
>> Is there any reason you aren't using http://wiki.apache.org/solr/Solrj to
>> interact with Solr?
>>
>> On Tue, Aug 24, 2010 at 11:12 AM, Liz Sommers <lizswo...@gmail.com> wrote:
>>
>> > I am very new to the solr/lucene world.  I am using solr 1.4.0 and cannot
>> > move to 1.4.1.
>> >
>> > I have to index about 50 fields for each document, these fields are
>> already
>> > in key/value pairs by the time I get to my index methods.  I was able to
>> > index them with lucene without any problem, but found that I could not
>> then
>> > read the indexes with solr/admin.  So, I decided to use Solr for my
>> > indexing.
>> >
>> > The error I am currently getting is
>> > java.lang.RuntimeException: Can't find resource 'synonyms.txt' in
>> classpath
>> > or 'solr/conf'/'
>> >
>> > This exception is being thrown by SolrResourceLoader.openResource(line
>> > 260).
>> > which is called by IndexSchema<init> (line 102)
>> >
>> > My code that leads up to this follows:
>> >
>> > <code>
>> > String path = "c:/swdev/apache-solr-1.4.0/IDW"
>> > SolrConfig cfg new SolrConfig(path + "/solr/conf/solrconfig.xml");
>> > schema = new IndexSchema(cfg,path + "/solr/conf/schema.xml",null);
>> >
>> > </code>
>> >
>> > This also fails if I use
>> > schema = new IndexSchema(cfg,"schema.xml",null);
>> >
>> >
>> > Any help would be greatly appreciated.
>> >
>> > Thank you
>> >
>> > Liz Sommers
>> > lizswo...@gmail.com
>> >
>>
>



-- 

-

Reply via email to