Using remote Nutch Server to crawl, then merging results into local index

2010-12-23 Thread Dietrich
ndex Any suggestions would be highly appreciated. Dietrich Schmidt http://www.linkedin.com/in/dietrichschmidt

Boost newer documents only if date is different from timestamp

2011-04-28 Thread Dietrich
I am trying to boost newer documents in Solr queries. The ms function http://wiki.apache.org/solr/SolrRelevancyFAQ#How_can_I_boost_the_score_of_newer_documents seems to be the right way to go, but I need to add an additional condition: I am using the last-Modified-Date from crawled web pages as the

Specifying backup location in solrconfig.xml

2011-05-17 Thread Dietrich
ow can I specify the location for the backup in solrconfig.xml Dietrich

Re: Specifying backup location in solrconfig.xml

2011-05-17 Thread Dietrich
slave can be another core in the same Solr instance. > > > On 5/17/2011 2:20 PM, Dietrich wrote: >> I am using Solr Replication to create a snapshot for backup purposes >> after each optimize: >> >>     >>         optimize >>     > name="c

How to index multiple sites with option of combining results in search

2008-03-25 Thread Dietrich
I am planning to index 275+ different sites with Solr, each of which might have anywhere up to 200 000 documents. When performing searches, I need to be able to search against any combination of sites. Does anybody have suggestions what the best practice for a scenario like that would be, consideri

Re: How to index multiple sites with option of combining results in search

2008-03-25 Thread Dietrich
x27;t want to (or need to) use a crawler. I am using a crawler-base system now, and it does not offer the flexibility I need when it comes to custom schemes and faceting. > > Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > - Origin

Re: How to index multiple sites with option of combining results in search

2008-03-26 Thread Dietrich
es for that? -ds On Tue, Mar 25, 2008 at 11:25 PM, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > Dietrich, > > I pointed to SOLR-303 because 275 * 200,000 looks like a too big of a number > for a single machine to handle. > > > Otis > -- > Sematext -

Re: How to index multiple sites with option of combining results in search

2008-03-26 Thread Dietrich
s On Wed, Mar 26, 2008 at 2:05 PM, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > Dietrich, > > I don't think there are established practices in the open (yet). You could > design your application with a site(s)->shard mapping and then, knowing which > sites are invo

Download of old solr releases

2012-07-25 Thread Nicolas Dietrich
Hi there, it looks like the old releases have been thrown out of the download servers, for example http://apache.mirrors.tds.net/lucene/solr/1.4.1/apache-solr-1.4.1.tgz Is this on purpose or a mistake, or have I overseen something? Thanks for clarification. Cheers, Nicolas

Re: Download of old solr releases

2012-07-25 Thread Nicolas Dietrich
On 07/25/2012 07:17 PM, Chris Hostetter wrote: > > : it looks like the old releases have been thrown out of the download > : servers, for example > > This is standard practice for apache projects so that the mirror network > doesn't have to store gigs and gigs of ancient files that most people

Re: User search in Facebook like

2009-05-31 Thread Dietrich Featherston
try searching for matches where the name starts with whatever the user has entered so far with a wildcard ?q=vinc* Are you always going to be searching for names? If so you could see if the user has entered two terms and suffix each with a wildcard to get potentially more relevant searches. For