Re: faceted browsing

Trey Hyde Tue, 04 Apr 2006 10:24:34 -0700

Chris Hostetter wrote:


: My (our) query plugin uses specialized SolrCache's in lieu of the meta
: data records.   For each new searcher installed each fields possible
: values will be determined and stored in a cache (off the top of my head,

Are you determining the field values based on all indexed values for those
fields, or do you have application specific logic in the plugin that knows
certain fields (like "price") should be ranges, while other fields should
be discreet?

Yes. We filter on facets in 2 major ways.For unranged attributes we simply specify the normalized value thatshould appear the in the field (duh).For unranged attributes we specify the field, a operator and thenormalized value we are comparing against. Here is an example of apassed parameter.


&atr_A00053=K02147U00054||>4194304

That tells the system to give me only computers with more than 4MB ofRAM (wasn't that obvious?). In this case the K...U... number isn'tactually used (translated that means "4MB"), only the field (A00053) andthe normalized field value (4194303 ... that nonsense value thatcurrently means 4MB, 4048KB, etc).

This system only exists to maintain compatibility with systemspreviously used to manage our AltaVista based search engine. It's notpretty but it works well given our current functionality requirements.It also doesn't do bounded searched like 4MB to 8MB.

that's the reason why I used special metadata docs -- actually that's only
part of the reason, i needed the facets to be data driven to allow our
site staff to manage them, and i needed to support vastly different facets
based on category (hence: one metadata doc per category).

Right, it's all about customer requirements. As above, the data getspulled from a live DB the web front end to produce the query strings asoptions to the user and the logic is embedded in the query string.What I'd really like to see is an XML query language so I can toss allthe hackish URL query arguments and really move much of the query pluginlogic out into the query itself instead of in the Java code.

I do intend to revamp our faceting engine in our next major release tocustomers. We'll introduce dynamic attribute bucketing. Rather thanproduce a list of counts of all values for an attribute and have "atleast" or "at most" options, users will be given ranged lists based onthe actual distribution of the facets. I haven't really worked outthe details since I haven't actually began the design but I'm probablygoing to see if I can't just look at it like it's on a bell curve andstart picking evenly sized buckets. Monitors <= 15" (10), 15 -> 17(10), 17 -> 21 (10), 21-> 25 (10), > 25 (10). Now obviously I can'tforce it into a nice distribution like that but I'll figure outsomething. In any case, the bucket ranges will need to be based on theactual distribution (easy to maintain, hard to implement) in the currentresult set and not some pre-manufactured bucket categories (easy toimplement, hard to maintain) as those get obsoleted fairly quickly.

Re: faceted browsing

Reply via email to