I believe it's 10,000 tokens, not characters, but that's a quibble.

Yes, you need to change maxFieldLength to be greater than
any doc you expect to index. It can be made huge, I don't
think there's a penalty for making this number, say, 100,000,000
and indexing documents with only 10 tokens.....

Best
Erick

On Fri, Nov 7, 2008 at 9:48 AM, Dan A. Dickey <[EMAIL PROTECTED]> wrote:

> I just came across the maxFieldLength setting for the mainIndex
> in solrconfig.xml and have a question or two about it.
> The default value is 10000.
>
> I'm extracting text from pdf documents and
> storing them into a text field.  Is the length of this text field limited
> to 10000 characters?  Many pdf documents are megabytes in size.
> Do this mean that only the first 10000 characters are getting indexed?
>
> Is there a good way to index the whole document, or do I just simply
> need to increase the size of maxFieldLength?  What performance
> ramifications would something like this have?
>
> Please advise... thanks!
>        -Dan
>
> --
> Dan
> A. Dickey
> [EMAIL PROTECTED]
>
> SAVVIS, Inc.
> Built to Respond
>

Reply via email to