I'm experienced with Lucene, less so than SOLR. I am looking at two systems built on top of SOLR for a library discovery service: blacklight and vufind.

I checked the raw lucene index using Luke and noticed that both of these indexes have single character terms in the index, such as "d" or "f". I asked about this on the vufind list, and was told I didn't understand SOLR and why it would need these.

So I'm now asking: why would SOLR want single character terms? "a" is usually a stopword. I know the Library MARC data from which the index is derived has a lot of these characters because they denote subfields in the data. But why would we want them to be searchable?

Naomi Dushay
[EMAIL PROTECTED]



Reply via email to