English, Eben [eengl...@bpl.org] wrote:
> Is there any kind of general rule-of-thumb character limit in regards to 
> faceting on very long strings?

Not really. There are limits, but they are quite high. Due to a bad analyzer we 
had an index with ~1M unique facet values that ranged from 100-3000 characters 
and besides a very messy GUI, we did not notice anything problematic.

> I have a string field that I want to facet on (contains geographic data 
> structured as a GeoJSON Feature),
> where the length is typically around 220 characters. Is this too long to 
> facet on, performance-wise?

I doubt you will be able to measure the difference between that and 10 
character strings. There is a question of string comparison upon index open and 
the result must be serialized, but for most of the facet operations, the length 
matters little. If your strings were 100 times longer, you might have seen a 
performance impact.

> [...] possibly about 200,000 unique values for this GeoJSON string field.

I see no problems with this. 200K values of 220 characters is only large if you 
insist on returning everyone of them as the facet result.

- Toke Eskildsen

Reply via email to