I believe it's 10,000 tokens, not characters, but that's a quibble. Yes, you need to change maxFieldLength to be greater than any doc you expect to index. It can be made huge, I don't think there's a penalty for making this number, say, 100,000,000 and indexing documents with only 10 tokens.....
Best Erick On Fri, Nov 7, 2008 at 9:48 AM, Dan A. Dickey <[EMAIL PROTECTED]> wrote: > I just came across the maxFieldLength setting for the mainIndex > in solrconfig.xml and have a question or two about it. > The default value is 10000. > > I'm extracting text from pdf documents and > storing them into a text field. Is the length of this text field limited > to 10000 characters? Many pdf documents are megabytes in size. > Do this mean that only the first 10000 characters are getting indexed? > > Is there a good way to index the whole document, or do I just simply > need to increase the size of maxFieldLength? What performance > ramifications would something like this have? > > Please advise... thanks! > -Dan > > -- > Dan > A. Dickey > [EMAIL PROTECTED] > > SAVVIS, Inc. > Built to Respond >