Re: Multiple open SegmentReaders?

2008-05-01 Thread Koji Sekiguchi
I can reproduce with solr/example setup. What I did: 1. $ svn co http://svn.apache.org/repos/asf/lucene/solr/trunk TEMP 2. $ cd TEMP 3. $ ant clean example 4. $ cd example 5. $ java -jar start.jar (to post commit) 6. $ cd $SOLR_HOME/example/exampledocs 7. $ ./post.sh then see admin>statistics.

Re: Bizarre DisMax behavior: "q" parameter not working but "q.alt" is, and multi-valued fields not matching at all

2008-05-01 Thread Chris Hostetter
: I would: : - enter the problematic query on the Solr Admin Analysis page and make sure all is sane there : - use &debugQuery=true and see how your query is rewritten Also look at the explanation output -- even if no docs are being returned you can use explainOther to see *why* something isn't

Re: MultiCore and Distributed Search

2008-05-01 Thread Otis Gospodnetic
It's in (you can see from SOLR-303) and it works. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Lance Norskog <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Cc: "Norskog, Lance" <[EMAIL PROTECTED]> > Sent: Friday, May 2, 2008 2:43:3

MultiCore and Distributed Search

2008-05-01 Thread Lance Norskog
Is Distributed Search () in the main line yet? Is it considered useable? And, how closely does it match the Wiki entry? https://issues.apache.org/jira/browse/SOLR-303 http://wiki.apache.org/solr/DistributedSearch

top documented in faceted query?

2008-05-01 Thread peter360
The faceted query returns a list of values with associated doc count. Is it possible to also get the top document id associated with each value? Basically I want a "representative" for each facet group. What is the best way to achieve this? To be more precise, suppose the query "q=x&facet=true

Re: Master / slave setup with multicore

2008-05-01 Thread James Brady
Hi Ryan, thanks for that! I have one outstanding question: when I take a snapshot on the master, snappull and snapinstall on the slave, the new index is not being used: restarting the slave server does pick up the changes, however. Has anyone else had this problem with recent development bu

Re: dismax query handler ignoring qf entirely!

2008-05-01 Thread Otis Gospodnetic
Unless I'm not understanding what you are saying, then no, this is not expected behaviour - DisMax doesn't rely on one copying the actual field data to a "text" field. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Ezra Epstein <[EMAIL PROT

Re: Bizarre DisMax behavior: "q" parameter not working but "q.alt" is, and multi-valued fields not matching at all

2008-05-01 Thread Otis Gospodnetic
I don't see an immediate problem. I would: - enter the problematic query on the Solr Admin Analysis page and make sure all is sane there - use &debugQuery=true and see how your query is rewritten Otis-- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From

dismax query handler ignoring qf entirely!

2008-05-01 Thread Ezra Epstein
It appears as though the DisMax query handler is ignoring our qf settings and only searching the "text" field as defined in the element of the schema.xml file. Thus if a field exists and is indexed it is not being searched unless its contents were copied to the "text" field. Is that corrected/ex

Bizarre DisMax behavior: "q" parameter not working but "q.alt" is, and multi-valued fields not matching at all

2008-05-01 Thread Ezra Epstein
Config: 1. The relevant part of the solrconfig.xml: explicit 0.01 primaryCategory^2 cat^0.5 primaryCategory^2 cat^0.5 id,contentID 100 *:* 2. The relevant part of the schema.xml 3. Some

Re: token concat filter?

2008-05-01 Thread Geoffrey Young
Otis Gospodnetic wrote: Geoff, Whether synonyms are applied at index time or query time is controlled via schema.xml - it depends on where you put the synonym factory, whether in the index-time or query-time section of a fieldType. Synonyms are read once on start, I believe. It might be good

Re: token concat filter?

2008-05-01 Thread Otis Gospodnetic
Geoff, Whether synonyms are applied at index time or query time is controlled via schema.xml - it depends on where you put the synonym factory, whether in the index-time or query-time section of a fieldType. Synonyms are read once on start, I believe. It might be good to have them read at ind

Re: token concat filter?

2008-05-01 Thread Geoffrey Young
Walter Underwood wrote: I doubt it would be that many. I recommend tracking the searches and the clicks, and working on queries with low clickthrough. the trouble is I'm in a dynamic biz - last weeks popular clicks are very different from this weeks, so by the time I analyze last weeks popul

Re: token concat filter?

2008-05-01 Thread Walter Underwood
I doubt it would be that many. I recommend tracking the searches and the clicks, and working on queries with low clickthrough. Here are a few of mine from that sort of analysis: ghost dog => ghost dog, ghostdog ghost hunters => ghost hunters, ghosthunters ghost rider => ghost rider, ghostrider gh

Re: token concat filter?

2008-05-01 Thread Geoffrey Young
Walter Underwood wrote: I've been doing it with synonyms and I have several hundred of them. I'm dealing mostly with proper names, so I expect more like 80k of them for our data :) Concatenating bi-word groups is pretty useful for English. We have a habit of gluing words together. "datab

Re: token concat filter?

2008-05-01 Thread Walter Underwood
I've been doing it with synonyms and I have several hundred of them. Concatenating bi-word groups is pretty useful for English. We have a habit of gluing words together. "database" used to be two words. Dictionaries still think it should be "web server". wunder On 5/1/08 10:47 AM, "Geoffrey Youn

Re: token concat filter?

2008-05-01 Thread Geoffrey Young
Yonik Seeley wrote: If there are only a few such cases, it might be better to use synonyms to correct them. unfortunately, there are too many to handle this way. Off the top of my head there's no concatenating token filter, but it wouldn't be hard to make one. hmm, ok. I'm not a java guy

Re: access control list

2008-05-01 Thread Kevin Osborn
I thought of that method. The problem I was thinking of is that if a new customer is added, that could potentially cause an update of about 2,000,000 records or so. Fortunately, this does not happen everyday. It also make indexing a little difficult because I now have to check permissions on eac

ERROR:unknown field, but what document was it?

2008-05-01 Thread solr
Hi, The way I understand it, having multiple documents in a single xml file and posting that to solr is faster then posting a single xml for each document (even with a single commit after all separate xml files). It makes sence, and some simple tests indicates that this is true. However,

solr on ubuntu 8.04

2008-05-01 Thread Jack Bates
I am trying to evaluate Solr for an open source records management project to which I contribute: http://code.google.com/p/qubit-toolkit/ I installed the Ubuntu solr-tomcat5.5 package: http://packages.ubuntu.com/hardy/solr-tomcat5.5 - and pointed my browser at: http://localhost:8180/solr/admin (T

Re: token concat filter?

2008-05-01 Thread Yonik Seeley
If there are only a few such cases, it might be better to use synonyms to correct them. Off the top of my head there's no concatenating token filter, but it wouldn't be hard to make one. -Yonik On Thu, May 1, 2008 at 8:44 AM, Geoffrey Young <[EMAIL PROTECTED]> wrote: > hi :) > > I'm looking for

token concat filter?

2008-05-01 Thread Geoffrey Young
hi :) I'm looking for a filter that will compress all tokens into a single token. the WordDelimiterFilterFactory does it for tokens it finds itself, but not ones passed to it. basically, I'm trying to match Radiohead in the index with radio head in the query. if it were spelled Radio