The Internet Archive is getting close to going live with Solr.
I have two remaining classes of problems.

1) across the entire index, enumerate all the unique values for a given field.
2) we use unrestricted dynamicField additions from documents.  (that is our users are free to add any named field they like to their document's data (which is metadata for their item)).  we want to list all the unique field names in the index.

Eg:
<doc>
  ...
 <mediatype>audio</mediatype>
</doc>
<doc>
  ...
  <mediatype>movies</mediatype>
  <collection>prelinger</collection>
</doc>

1) would yield a list of audio and movies if the field passed in was mediatype
2) would yield a list of  mediatype and collection


>From our prior implementation of a java + lucene search engine, we already
ran in to queries that our SE could not handle.  So we nightly build a cache
structure to handle those other queries.  We *could* solve 1) and 2) in
this nightly cache, but ideally we'd like to use Solr if possible.

thanks!
--tracey


--
       --Tracey Jaquith - http://www.archive.org/~tracey --

Reply via email to