You need to translate your source data size into number of documents and
document size. Document size will depend on number of fields, the type of
data in each field, and the size of the data in each field. You need to
think about numeric and date fields, raw string fields, and keyword text
fields.

Solr and Lucene do not merely index a bulk blob of bytes, but
semi-structured data, in the form of documents and fields.

In some cases the indexed data can be smaller than the source data, but it
can sometimes be larger as well.


-- Jack Krupansky

On Wed, May 27, 2015 at 12:33 PM, Sznajder ForMailingList <
bs4mailingl...@gmail.com> wrote:

> Hi ,
>
> Could you give me some hints wrt HW requirements for Solr if I need to
> index about 400 Gigas of text?
>
> Thanks
>
> Benjamin
>

Reply via email to