Hi

I am using Nutch to crawl a site, and post it in Solr 3.6.1. The page is
very large. 

When I query the index, using the Solr Admin query page, it only finds the
result if it is in the top X% of the page, probably about 30%.

The page is about 79Kb, and consists of 19,067 words.

Is there a setting somewhere that sets the maxFieldSize? Or maxTokenSize?

I set the field content to be displayed on the result page, and it displays
all the data correctly, where I can see all the tokens I get no results
from.

I can't split the page up, as it is auto-generated from a database.

Any help gratefully received.

Thanks Mark



-- 
 The Wellcome Trust Sanger Institute is operated by Genome Research 
 Limited, a charity registered in England with number 1021457 and a 
 company registered in England with number 2742969, whose registered 
 office is 215 Euston Road, London, NW1 2BE. 

Reply via email to