Slow response

2007-09-06 Thread Aaron Hammond
I am pretty new to Solr and this is my first post to this list so please
forgive me if I make any glaring errors. 

 

Here's my problem. When I do a search using the Solr admin interface for
a term that I know does not exist in my index the QTime is about 1ms.
However, if I add facets to the search the response takes more than 20
seconds (and sometimes longer) to return. Here is the slow URL - 

 

/select?qf=AUTHOR_t+SUBJECT_t+TITLE_t&wt=xml&f.AUTHOR_facet.facet.sort=t
rue&f.FORMAT_t.facet.limit=25&start=0&facet=true&facet.mincount=1&q=frak
&f.FORMAT_t.facet.mincount=1&f.ITYPE_facet.facet.mincount=1&f.SUBJECT_fa
cet.facet.limit=25&facet.field=AUTHOR_facet&facet.field=FORMAT_t&facet.f
ield=LANGUAGE_t&facet.field=PUBDATE_t&facet.field=SUBJECT_facet&facet.fi
eld=AGENCY_facet&facet.field=ITYPE_facet&f.AGENCY_facet.facet.sort=true&
f.AGENCY_facet.facet.limit=-1&rows=10&f.ITYPE_facet.facet.limit=-1&f.ITY
PE_facet.facet.sort=true&f.AUTHOR_facet.facet.limit=25&f.LANGUAGE_t.face
t.sort=true&f.PUBDATE_t.facet.limit=-1&f.AGENCY_facet.facet.mincount=1&f
.AUTHOR_facet.facet.mincount=1&fl=*&fl=score&qt=dismax&version=2.2&f.SUB
JECT_facet.facet.sort=true&f.SUBJECT_facet.facet.mincount=1&f.PUBDATE_t.
facet.sort=false&f.FORMAT_t.facet.sort=true&f.LANGUAGE_t.facet.limit=25&
f.LANGUAGE_t.facet.mincount=1&f.PUBDATE_t.facet.mincount=1

 

I am pretty sure I can't be the first to ask this question but I can't
seem to find anything online with the answer. Thanks for your help.

 

Aaron



RE: Slow response

2007-09-06 Thread Aaron Hammond
Thank-you for your response, this does shed some light on the subject.
Our basic question was why were we seeing slower responses the smaller
our result set got. 

Currently we are searching about 1.2 million documents with the source
document about 2KB, but we do duplicate some of the data. I bumped up my
filterCache to 5 million and the 2nd search I did for an non-indexed
term came back in 2.1 seconds so that is much improved. I am a little
concerned about having this value so high but this is our problem and we
will play with it. 

I do have a few follow-up questions. First, in regards to the
filterCache once a single search has been done and facets requested, as
long as new facets aren't requested and the size is large enough then
the filters will remain in the cache, correct?

Also, you mention that faceting is more a "function of the number of the
number of terms in the field". The 2 fields causing our problems are
Authors and Subjects. If we divided up the data that made these facets
into more specific fields (Primary author, secondary author, etc.) would
this perform better? So the number of facet fields would increase but
the unique terms for a given facet should be less.

Thanks again for all your help.

Aaron


-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Yonik
Seeley
Sent: Thursday, September 06, 2007 4:17 PM
To: solr-user@lucene.apache.org
Subject: Re: Slow response

On 9/6/07, Aaron Hammond <[EMAIL PROTECTED]> wrote:
> I am pretty new to Solr and this is my first post to this list so
please
> forgive me if I make any glaring errors.
>
> Here's my problem. When I do a search using the Solr admin interface
for
> a term that I know does not exist in my index the QTime is about 1ms.
> However, if I add facets to the search the response takes more than 20
> seconds (and sometimes longer) to return. Here is the slow URL -

Faceting on multi-value fields is more a function of the number of
terms in the field (and their distribution) rather than the number of
hits for a query.  That said, perhaps faceting should be able to bail
out if there are no hits.

Is your question more about why faceting takes so long in general, or
why it takes so long if there are no results?  If you haven't, try
optimizing your index for facet faceting in general.  How many docs do
you have in your index?

As a side note, the way multi-valued faceting currently works, it's
actually normally faster if the query returns a large number of hits.

-Yonik


RE: Query for German "Special Characters " (i.e., ä, ö, ß)

2007-09-13 Thread Aaron Hammond
Are you using Tomcat with Solr? If so you need to add the URIEncoding attribute 
to your Connector. See this url -

http://tomcat.apache.org/tomcat-6.0-doc/config/http.html

I hope this helps. If you are using Jetty then . :) 

Aaron

-Original Message-
From: Marc Bechler [mailto:[EMAIL PROTECTED] 
Sent: Thursday, September 13, 2007 3:14 PM
To: solr-user@lucene.apache.org
Subject: Query for German "Special Characters" (i.e., ä, ö, ß)

Hi SOLR kings,

I'm just playing around with queries, but I was not able to query for 
any special characters like the German "Umlaute" (i.e., ä, ö, ü). Maybe 
others might have the same effects and already found a solution ;-)

Here is my example: I have one field called "sometext" of type "text" 
(the one delivered with the SOLR example). I indexed a few words similar to




Works fine, and searching for "really" shows the result and fünny will 
be displayed correctly. However, the query for "fünny" using the 
/solr/admin page is resolved (correctly) to the URL ...q=f%C3%BCnny... 
but does not find the document.

And now the question: Any ideas? ;-)

Cheers,

  marc