On Jun 5, 2009, at 12:13 AM, Jörg Agatz wrote:
Hallo, Solr users...
I have a Problem!
I Have a lot of files, fome of the Files are exist in more than one
version.
often they are only little changes in the files...
Now i musst find a way to get only the last of each file.
The normal Results are maby 500 Documents, but from each document
are exist
2 or 3 revisions, nuw i hop to find with your help a way to get,
only the
300 - 200 newes Document!
Do you ever have a requirement to get at the old revisions? Because
if you don't need the old revisions _ever_ then just make sure, during
indexing, that they file has the same Unique Id and then Solr will
replace it.
Otherwise, you can look at the field collapsing work going on in JIRA
(SOLR-236) and also looking at using a Function Query that boosts by
relevancy. The latter approach will just help make sure the newer
results appear on top, but won't exclude the older revisions.
HTH,
Grant
--------------------------
Grant Ingersoll
http://www.lucidimagination.com/
Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)
using Solr/Lucene:
http://www.lucidimagination.com/search