Re: faceted result

2007-03-16 Thread Erik Hatcher
You've got your field set to be analyzed, and its using a stemmer. Chances are you don't intend to analyze the fields you're faceting on (and if you are doing that intentionally, performance caveats apply). Check that the field type is "string" and re-index. Erik On Mar 16, 2007

Returning xx number of each group in a single query?

2007-03-16 Thread Brian Lucas
Is there a way to fetch 5 records with group_id:1, 5 records with group_id:2, 5 records with group_id:3, and so forth in a single query? The facet features don't seem to give me what I need -- same with rows. Any ideas on how to do something like this? -- View this message in context: http:/

Re: match any field value? field:*?

2007-03-16 Thread Ryan McKinley
thank you thank you that does it.

Re: match any field value? field:*?

2007-03-16 Thread Erik Hatcher
On Mar 16, 2007, at 6:17 PM, Ryan McKinley wrote: Is it possible to search for all documents with a field? Yup... I have documents with a bunch of fields and a field for 'error' (string field with the actual error). Is it possible do search for documents that have an error? Considering the

faceted result

2007-03-16 Thread Giri
Hi, I am using Solr's faceted response, and the result seems to be OK on the numbers it is returning, but the words are trimmed in the response, for example, instead of "atmosphere" the response is returning as "atmospher", similarly the last letter (or last few letters) of most of the words are t

Re: Commit after how many updates?

2007-03-16 Thread Otis Gospodnetic
+1 to what Mike said. I am running some Lucene benchmarks as we type and this is exactly what I just saw. On a beefy box with 32GB RAM I'm searching 63GB worth of Lucene indices. I gave the JVM 20GB (-Xmx20g) at first and saw a bit of disk IO. Then I lowered that max heap to 10GB and the disk

match any field value? field:*?

2007-03-16 Thread Ryan McKinley
Is it possible to search for all documents with a field? I have documents with a bunch of fields and a field for 'error' (string field with the actual error). Is it possible do search for documents that have an error? Considering the new *:* syntax, i would guess something like "error:*" Do i

Re: cache sizes

2007-03-16 Thread Erik Hatcher
On Mar 16, 2007, at 2:21 PM, Andrew Nagy wrote: Is their a science to choosing a cache sizes? I have about 500,000 records and am seeing a lot of evictions, about 50% of lookups. What factors can i look at to determine what my cache sizes should be? Roughly you could start with getting a

Re: Commit after how many updates?

2007-03-16 Thread Mike Klaas
On 3/16/07, Chris Hostetter <[EMAIL PROTECTED]> wrote: : I thought so, but hoped there would be some experiences with heap space : settings for Solr. But I guess I have to try for myself. there's lots of experience, but it's hard to translate to generic rules ... there's so many variables invol

Re: Performance penalty for Multivalued field?

2007-03-16 Thread Chris Hostetter
: 3) when faceting, a multiValued field always uses the TermEnum : algorithm rather than the FieldCache algorithm. Damn. Good catch J.J. ... i totally forgot about that and it certainly is a "performance penalty" if you use facets. : depending on the data, this can have a dramatic effect on

Re: Performance penalty for Multivalued field?

2007-03-16 Thread J.J. Larrea
Perhaps not relevant in this case, but for the record there is one more SOLR behavior affected by multiValued: 3) when faceting, a multiValued field always uses the TermEnum algorithm rather than the FieldCache algorithm. depending on the data, this can have a dramatic effect on faceting perf

cache sizes

2007-03-16 Thread Andrew Nagy
Is their a science to choosing a cache sizes? I have about 500,000 records and am seeing a lot of evictions, about 50% of lookups. What factors can i look at to determine what my cache sizes should be? Here are my cache statistics: filterCache class: org.apache.solr.search.LRUCache ver

Re: Bug ? unique id

2007-03-16 Thread Chris Hostetter
: but can someone please answer my question :'( : is it illegal to put filters on the unique id ? : or is it a bug that we get duplicate id's? : or is this a know issue (since everybody is using copyfields?) there's nothing illegal about using an Analyzer on your uniqueKey, but you have to ensure

Re: how to balance index and search

2007-03-16 Thread Chris Hostetter
: Can people from cnet tell how to use solr in CNET.COM ? I really don't understand your question, here's some links to CNET.com that use Solr... http://www.cnet.com/4244-5_1-0.html?query=ipod http://search.news.com/search?q=apple http://reviews.cnet.com/4566-3121-0.html -Hoss

Re: how to balance index and search

2007-03-16 Thread Chris Hostetter
if your indexing while searching is causing problems, one way to reduce the impact is to index on a master instance and then use the replication scripts to sync it up with a slave instance (where all of your searches happen) if you are specificly seeing high CPU when indexing HTML, that's probabl

Re: Multiple Fields syntax

2007-03-16 Thread Chris Hostetter
: will this query: : q=title:photoshop-adobe+OR+description:photoshop;score+desc&version=2.2&start=0&rows=170&indent=on&fl=*,score : : return documents that have photoshop but NOT adobe in the title and/or : photoshop in the description? "-" and "+" are only treated special if they are at the st

Re: Restrict Servlet Access

2007-03-16 Thread Chris Hostetter
: on a given port, say :8983. I have my firewall (iptables) set up so : that the outside world cannot connect to :8983. However, my httpd : server, running on port 80, can connect to solr because they are : running on the same box. Therefore all access to solr is mediated : through whatever applica

Re: Commit after how many updates?

2007-03-16 Thread Chris Hostetter
: I thought so, but hoped there would be some experiences with heap space : settings for Solr. But I guess I have to try for myself. there's lots of experience, but it's hard to translate to generic rules ... there's so many variables involved that it's hard to even recognize what the equation is

Re: Performance penalty for Multivalued field?

2007-03-16 Thread Chris Hostetter
: I would be shocked if you noticed any performance difference between a : single-valued field and a multivalued field with one entry. there shouldnt' be any difference at all in search performance, or index size ... marking a field multiValued should only have two effects: 1) on document add/

Re: DisMax Question.

2007-03-16 Thread Chris Hostetter
: but no results for: : qt=dismax&q= blabla : ^1.5&version=2.2&start=0&rows=1&indent=on&fl=*,score&debugQuery=true&sort=length_seconds+desc : : doesn't DisMax support term boosting? the dismax syntax is specifically very limited to match the common assumptions end users have about search input ..

Re: Reloading solr schema file

2007-03-16 Thread Chris Hostetter
: As a newbiie Java programmer I'm not sure what happens if I do the : following: : : SolrCore core = new SolrCore(null,null); : : Will it replace the current core? What happens to requests that are running? I think the best answer to that question is "undefined behavior" ... that's not a situatio

RE: fetch only facets

2007-03-16 Thread Gunther, Andrew
Would you set rows=0. -Andrew -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Sent: Friday, March 16, 2007 12:24 PM To: solr-user@lucene.apache.org Subject: fetch only facets Is their a way to not return any docs and only facets? I tried setting the fl equal to bl

fetch only facets

2007-03-16 Thread Andrew Nagy
Is their a way to not return any docs and only facets? I tried setting the fl equal to blank, but then i get everything back. Thanks Andrew

Re: Bug ? unique id

2007-03-16 Thread Erik Hatcher
On Mar 16, 2007, at 10:54 AM, [EMAIL PROTECTED] wrote: because we want to be able to search our unique id's :) and we would like to use the Latin character filter and the Lowercase filter so our searches dont have to be case sensitive and stuff. that seems reasonable, however you'd have to be

Re: Bug ? unique id

2007-03-16 Thread Maarten . De . Vilder
yes, that is exactly what we are doing now ... copyfield with the filters ... we figured that much :) but we are talking about a couple of million records, so the less data we copy the better ... but can someone please answer my question :'( is it illegal to put filters on the unique id ? or is

RE: Bug ? unique id

2007-03-16 Thread Gunther, Andrew
Why not use CopyField and put an analyzer on that field? -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Sent: Friday, March 16, 2007 10:54 AM To: solr-user@lucene.apache.org Subject: Re: Bug ? unique id because we want to be able to search our unique id's :) and we

Re: Bug ? unique id

2007-03-16 Thread Paul Borgermans
Hi Maarten Why not copy your unique id into another field with the required filters and use that for search? Regards Paul On 3/16/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote: because we want to be able to search our unique id's :) and we would like to use the Latin character filter and th

Re: Bug ? unique id

2007-03-16 Thread Maarten . De . Vilder
because we want to be able to search our unique id's :) and we would like to use the Latin character filter and the Lowercase filter so our searches dont have to be case sensitive and stuff. thanks for the quick response! grts,m Erik Hatcher <[EMAIL PROTECTED]> 16/03/2007 12:09 Please respo

Re: Bug ? unique id

2007-03-16 Thread Erik Hatcher
Why in the world would you want to analyze your unique id? Erik On Mar 16, 2007, at 6:07 AM, [EMAIL PROTECTED] wrote: Hello, we have been using Solr for a month now and we are running into a lot of trouble . one of the issues is a problem with the unique id field. can this field

Re: About field-specific analyzer

2007-03-16 Thread Erik Hatcher
On Mar 16, 2007, at 5:17 AM, shjiang wrote: I don't understand how solr make field-specific analysis possible .In the source code ,they didn't use the PerFieldAnalyzerWrapper class.Can any one tell me something about that? It's configured through schema.xml.Solr has a fairly sophisticat

Bug ? unique id

2007-03-16 Thread Maarten . De . Vilder
Hello, we have been using Solr for a month now and we are running into a lot of trouble . one of the issues is a problem with the unique id field. can this field have analyzer, filters and tokenizers on it ?? because when we use filters or tokenizers on our unique id field, we get duplicate i

About field-specific analyzer

2007-03-16 Thread shjiang
I don't understand how solr make field-specific analysis possible .In the source code ,they didn't use the PerFieldAnalyzerWrapper class.Can any one tell me something about that?