> > Our queries do not sort by any field. However, we do make use of > > FunctionQueries and a typical query is something like: > > > > users_query AND (+linear_function_query > +recip_function_query > > +language:english^0 -flags:spam^0) > > Function queries often build fieldCaches--on how many fields > do you use function queries, and how big is the set of unique > values for those fields?
2 fields: - date string with hundreds of unique values - an integer field with < 250 unique values To tweak our scoring, a custom hit collector in SolrIndexSearcher creates 1 fieldCache and 3 ValueSources from 3 fields: - an integer field with many unique values (order 10^4) - another integer field with many unique values (order 10^4) - an integer field with hundreds of unique values I thought a function query used ValueSource, so perhaps usage is similar in both cases. Would a ValueSource load all values into memory, or just unique ones? > Is user_query a string of keywords, or is it an arbitrary > query in lucene syntax? It's whatever the user types into a search box (supports arbitrary lucene). Some queries are intentionally harsh, like 'george OR bush' or 'the OR at'. The latter matches virtually every document in the index. Thanks again, -Graham