You've got your field set to be analyzed, and its using a stemmer.
Chances are you don't intend to analyze the fields you're faceting on
(and if you are doing that intentionally, performance caveats
apply). Check that the field type is "string" and re-index.
Erik
On Mar 16, 2007
Is there a way to fetch 5 records with group_id:1, 5 records with group_id:2,
5 records with group_id:3, and so forth in a single query?
The facet features don't seem to give me what I need -- same with rows. Any
ideas on how to do something like this?
--
View this message in context:
http:/
thank you thank you
that does it.
On Mar 16, 2007, at 6:17 PM, Ryan McKinley wrote:
Is it possible to search for all documents with a field?
Yup...
I have documents with a bunch of fields and a field for 'error'
(string field with the actual error). Is it possible do search for
documents that have an error? Considering the
Hi,
I am using Solr's faceted response, and the result seems to be OK on the
numbers it is returning, but the words are trimmed in the response, for
example, instead of "atmosphere" the response is returning as "atmospher",
similarly the last letter (or last few letters) of most of the words are
t
+1 to what Mike said. I am running some Lucene benchmarks as we type and this
is exactly what I just saw.
On a beefy box with 32GB RAM I'm searching 63GB worth of Lucene indices. I
gave the JVM 20GB (-Xmx20g) at first and saw a bit of disk IO. Then I lowered
that max heap to 10GB and the disk
Is it possible to search for all documents with a field?
I have documents with a bunch of fields and a field for 'error'
(string field with the actual error). Is it possible do search for
documents that have an error? Considering the new *:* syntax, i would
guess something like "error:*"
Do i
On Mar 16, 2007, at 2:21 PM, Andrew Nagy wrote:
Is their a science to choosing a cache sizes? I have about 500,000
records and am seeing a lot of evictions, about 50% of lookups.
What factors can i look at to determine what my cache sizes should be?
Roughly you could start with getting a
On 3/16/07, Chris Hostetter <[EMAIL PROTECTED]> wrote:
: I thought so, but hoped there would be some experiences with heap space
: settings for Solr. But I guess I have to try for myself.
there's lots of experience, but it's hard to translate to generic rules
... there's so many variables invol
: 3) when faceting, a multiValued field always uses the TermEnum
: algorithm rather than the FieldCache algorithm.
Damn. Good catch J.J. ... i totally forgot about that and it certainly is
a "performance penalty" if you use facets.
: depending on the data, this can have a dramatic effect on
Perhaps not relevant in this case, but for the record there is one more SOLR
behavior affected by multiValued:
3) when faceting, a multiValued field always uses the TermEnum algorithm
rather than the FieldCache algorithm.
depending on the data, this can have a dramatic effect on faceting perf
Is their a science to choosing a cache sizes? I have about 500,000
records and am seeing a lot of evictions, about 50% of lookups. What
factors can i look at to determine what my cache sizes should be?
Here are my cache statistics:
filterCache
class: org.apache.solr.search.LRUCache
ver
: but can someone please answer my question :'(
: is it illegal to put filters on the unique id ?
: or is it a bug that we get duplicate id's?
: or is this a know issue (since everybody is using copyfields?)
there's nothing illegal about using an Analyzer on your uniqueKey, but you
have to ensure
: Can people from cnet tell how to use solr in CNET.COM ?
I really don't understand your question, here's some links to CNET.com
that use Solr...
http://www.cnet.com/4244-5_1-0.html?query=ipod
http://search.news.com/search?q=apple
http://reviews.cnet.com/4566-3121-0.html
-Hoss
if your indexing while searching is causing problems, one way to reduce
the impact is to index on a master instance and then use the replication
scripts to sync it up with a slave instance (where all of your searches
happen)
if you are specificly seeing high CPU when indexing HTML, that's probabl
: will this query:
:
q=title:photoshop-adobe+OR+description:photoshop;score+desc&version=2.2&start=0&rows=170&indent=on&fl=*,score
:
: return documents that have photoshop but NOT adobe in the title and/or
: photoshop in the description?
"-" and "+" are only treated special if they are at the st
: on a given port, say :8983. I have my firewall (iptables) set up so
: that the outside world cannot connect to :8983. However, my httpd
: server, running on port 80, can connect to solr because they are
: running on the same box. Therefore all access to solr is mediated
: through whatever applica
: I thought so, but hoped there would be some experiences with heap space
: settings for Solr. But I guess I have to try for myself.
there's lots of experience, but it's hard to translate to generic rules
... there's so many variables involved that it's hard to even recognize
what the equation is
: I would be shocked if you noticed any performance difference between a
: single-valued field and a multivalued field with one entry.
there shouldnt' be any difference at all in search performance, or index
size ... marking a field multiValued should only have two effects:
1) on document add/
: but no results for:
: qt=dismax&q= blabla
:
^1.5&version=2.2&start=0&rows=1&indent=on&fl=*,score&debugQuery=true&sort=length_seconds+desc
:
: doesn't DisMax support term boosting?
the dismax syntax is specifically very limited to match the common
assumptions end users have about search input ..
: As a newbiie Java programmer I'm not sure what happens if I do the
: following:
:
: SolrCore core = new SolrCore(null,null);
:
: Will it replace the current core? What happens to requests that are running?
I think the best answer to that question is "undefined behavior" ...
that's not a situatio
Would you set rows=0.
-Andrew
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
Sent: Friday, March 16, 2007 12:24 PM
To: solr-user@lucene.apache.org
Subject: fetch only facets
Is their a way to not return any docs and only facets? I tried setting
the fl equal to bl
Is their a way to not return any docs and only facets? I tried setting
the fl equal to blank, but then i get everything back.
Thanks
Andrew
On Mar 16, 2007, at 10:54 AM, [EMAIL PROTECTED] wrote:
because we want to be able to search our unique id's :)
and we would like to use the Latin character filter and the Lowercase
filter so our searches dont have to be case sensitive and stuff.
that seems reasonable, however you'd have to be
yes, that is exactly what we are doing now ... copyfield with the filters
... we figured that much :)
but we are talking about a couple of million records, so the less data we
copy the better ...
but can someone please answer my question :'(
is it illegal to put filters on the unique id ?
or is
Why not use CopyField and put an analyzer on that field?
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
Sent: Friday, March 16, 2007 10:54 AM
To: solr-user@lucene.apache.org
Subject: Re: Bug ? unique id
because we want to be able to search our unique id's :)
and we
Hi Maarten
Why not copy your unique id into another field with the required filters and
use that for search?
Regards
Paul
On 3/16/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
because we want to be able to search our unique id's :)
and we would like to use the Latin character filter and th
because we want to be able to search our unique id's :)
and we would like to use the Latin character filter and the Lowercase
filter so our searches dont have to be case sensitive and stuff.
thanks for the quick response!
grts,m
Erik Hatcher <[EMAIL PROTECTED]>
16/03/2007 12:09
Please respo
Why in the world would you want to analyze your unique id?
Erik
On Mar 16, 2007, at 6:07 AM, [EMAIL PROTECTED] wrote:
Hello,
we have been using Solr for a month now and we are running into a
lot of
trouble .
one of the issues is a problem with the unique id field.
can this field
On Mar 16, 2007, at 5:17 AM, shjiang wrote:
I don't understand how solr make field-specific analysis possible .In
the source code ,they didn't use the PerFieldAnalyzerWrapper class.Can
any one tell me something about that?
It's configured through schema.xml.Solr has a fairly
sophisticat
Hello,
we have been using Solr for a month now and we are running into a lot of
trouble .
one of the issues is a problem with the unique id field.
can this field have analyzer, filters and tokenizers on it ??
because when we use filters or tokenizers on our unique id field, we get
duplicate i
I don't understand how solr make field-specific analysis possible .In
the source code ,they didn't use the PerFieldAnalyzerWrapper class.Can
any one tell me something about that?
32 matches
Mail list logo