custom reranking

2009-02-26 Thread CIF Search
We have a distributed index consisting of several shards. There could be some documents repeated across shards. We want to remove the duplicate records from the documents returned from the shards, and re-order the results by grouping them on the basis of a clustering algorithm and reranking the doc

Re: custom reranking

2009-02-26 Thread CIF Search
, 2009, at 6:04 AM, CIF Search wrote: > > We have a distributed index consisting of several shards. There could be >> some documents repeated across shards. We want to remove the duplicate >> records from the documents returned from the shards, and re-order the >> results by g

input XSLT

2009-03-09 Thread CIF Search
Just as you have an xslt response writer to convert Solr xml response to make it compatible with any application, on the input side do you have an xslt module that will parse xml documents to solr format before posting them to solr indexer. I have gone through dataimporthandler, but it works in dat

Re: input XSLT

2009-03-12 Thread CIF Search
will also facilitate plugging in solr seamlessly with all kinds of applications. Regards, CI On Wed, Mar 11, 2009 at 11:52 PM, Noble Paul നോബിള്‍ नोब्ळ् < noble.p...@gmail.com> wrote: > On Tue, Mar 10, 2009 at 12:17 PM, CIF Search wrote: > > Just as you have an xslt response w

Re: input XSLT

2009-03-12 Thread CIF Search
man...@gmail.com> wrote: > On Fri, Mar 13, 2009 at 11:36 AM, CIF Search wrote: > > > There is a fundamental problem with using 'pull' approach using DIH. > > Normally people want a delta imports which are done using a timestamp > > field. > > Now it may not

response time

2009-04-06 Thread CIF Search
Hi, I have around 10 solr servers running indexes of around 80-85 GB each and and with 16,000,000 docs each. When i use distrib for querying, I am not getting a satisfactory response time. My response time is around 4-5 seconds. Any suggestions to improve the response time for queries (to bring it

Re: custom reranking

2009-04-07 Thread CIF Search
reorder the results returned by Solr accordingly and not depend purely on relevance as we do today. Regards, CI On Fri, Feb 27, 2009 at 5:21 PM, Grant Ingersoll wrote: > > On Feb 26, 2009, at 11:16 PM, CIF Search wrote: > > I believe the query component will generate the query in such

Re: response time

2009-04-07 Thread CIF Search
yes, non cached. If I repeat a query the response is fast since the results are cached. 2009/4/7 Noble Paul നോബിള്‍ नोब्ळ् > are these the numbers for non-cached requests? > > On Tue, Apr 7, 2009 at 11:46 AM, CIF Search wrote: > > Hi, > > > > I have around 10 solr

truncating indexed docs

2009-04-15 Thread CIF Search
Is it possible to truncate large documents once they are indexed? (Can this be done without re-indexing) Regards, CI