On Jun 5, 2009, at 12:13 AM, Jörg Agatz wrote:

Hallo, Solr users...

I have a Problem!

I Have a lot of files, fome of the Files are exist in more than one version.
often they are only little changes in the files...

Now i musst find a way to get only the last of each file.
The normal Results are maby 500 Documents, but from each document are exist 2 or 3 revisions, nuw i hop to find with your help a way to get, only the
300 - 200 newes Document!


Do you ever have a requirement to get at the old revisions? Because if you don't need the old revisions _ever_ then just make sure, during indexing, that they file has the same Unique Id and then Solr will replace it.

Otherwise, you can look at the field collapsing work going on in JIRA (SOLR-236) and also looking at using a Function Query that boosts by relevancy. The latter approach will just help make sure the newer results appear on top, but won't exclude the older revisions.

HTH,
Grant

--------------------------
Grant Ingersoll
http://www.lucidimagination.com/

Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) using Solr/Lucene:
http://www.lucidimagination.com/search

Reply via email to