solr index reusable with nutch?

2006-12-13 Thread Thorsten Scherler
Hi all, is it possible to directly use the solr index in nutch? My client is creating a portal search based on nutch. In this portal there is as well my project and ATM I prefer to go with solr instead of nutch since it its much better for my use case. Now the question is whether the portal sear

Re: solr index reusable with nutch?

2006-12-13 Thread Otis Gospodnetic
Hi, Solr should be able to search any Lucene index, not just those created by Solr itself, as long as you configure it properly via schema.xml. Thus, you should be able to use Solr to search an index created by Nutch. Haven't tried it. It would be nice if you could contribute the configurati

'New' "Date Math" parsing code in Solr

2006-12-13 Thread Chris Hostetter
(I have this nasty habit of commiting cool things to Solr that should be announced on solr-user, and then deciding I'll wait untill they are in a nightly snapshot before I send an email about them -- and then forgetting that I never sent the mail). A while back I added some functionality to the D

Re: automatic index time field?

2006-12-13 Thread Chris Hostetter
: Is there a way to automatically set a field when a document is indexed? : Specifically, I'd like to have a date field updated to the current time when : a document is indexed. Your message reminded me that i never announced the new "Date Match" parsing code, which does let you say something lik

Re: Strange Sorting results on a Text Field

2006-12-13 Thread Tracey Jaquith
Despite considerations of stemming and such for "text" type fields, is it the case that if we have a single value "text" type field, will sorting work, though? --tracey On 9/11/06, Tom Weber <[EMAIL PROTECTED]> wrote: Thanks also for the "multiValued" explanation, this is useful for my curr

Re: automatic index time field?

2006-12-13 Thread ryan mckinley
thanks for the advice. I implemented option #2, followed the directions on: http://wiki.apache.org/solr/HowToContribute and made: http://issues.apache.org/jira/browse/SOLR-82 The only change I might make is to have the schema store if it has fields with default values so that DocumentBuilder.g

Case sensitivity on hostnames and email addresses

2006-12-13 Thread Wade Leftwich
I've run into some unexpected case sensitivity on searches, at least unexpected by me. If you index a text field containing this sentence: A sentence containing CamelCase words by [EMAIL PROTECTED] is found at StudlyCaps.org The document will be found by searching for "camelcase" but not for "[E

Re: Case sensitivity on hostnames and email addresses

2006-12-13 Thread Otis Gospodnetic
When indexing (and searching), make sure you are using an Analyzer that lower-cases (or upper-cases) tokens. These are from Lucene, so Solr has them, too: ./src/java/org/apache/lucene/analysis/LowerCaseTokenizer.java ./src/java/org/apache/lucene/analysis/LowerCaseFilter.java Otis - Origi

Re: Case sensitivity on hostnames and email addresses

2006-12-13 Thread Walter Underwood
Also, avoid stemming URLs. I used a stemmer that turned my "best.com" URL into "good.com". The Lucene StandardAnalyzer works pretty hard to avoid that. --wunder On 12/13/06 9:33 PM, "Otis Gospodnetic" <[EMAIL PROTECTED]> wrote: > When indexing (and searching), make sure you are using an Analyzer

Re: Case sensitivity on hostnames and email addresses

2006-12-13 Thread Yonik Seeley
On 12/13/06, Wade Leftwich <[EMAIL PROTECTED]> wrote: I've run into some unexpected case sensitivity on searches, at least unexpected by me. If you index a text field containing this sentence: A sentence containing CamelCase words by [EMAIL PROTECTED] is found at StudlyCaps.org The document wi

Re: Case sensitivity on hostnames and email addresses

2006-12-13 Thread Yonik Seeley
Oh, and yet another way to get around it (with it's own trade offs) is to use something like fieldtype textTight in the example schema.xml, which catenates all word parts in both the index analyzer and query analyzer. This would index as "upanddownmysitecom" and allow the following queries to mat

Re: Strange Sorting results on a Text Field

2006-12-13 Thread Chris Hostetter
: Despite considerations of stemming and such for "text" : type fields, is it the case that : if we have a single value "text" type field, : will sorting work, though? correct ... KeywordTokenizer with Filters of your choice should produce a sortable string of whatever form you desire. -Hoss

Re: solr index reusable with nutch?

2006-12-13 Thread Thorsten Scherler
On Wed, 2006-12-13 at 07:45 -0800, Otis Gospodnetic wrote: > Hi, > > Solr should be able to search any Lucene index, ok, good to know. :) So can I guess that the same is true for nutch? Meaning the index solr is creating could be used by a nutch searcher. > not just those created by Solr itse