stopwords not working in multicore setup

2011-03-24 Thread Christopher Bottaro
Hello, I'm running a Solr server with 5 cores. Three are for English content and two are for German content. The default stopwords setup works fine for the English cores, but the German stopwords aren't working. The German stopwords file is stopwords-de.txt and resides in the same directory as

Re: [ANNOUNCEMENT] solr-packager 1.0.2 released!

2011-03-24 Thread Otis Gospodnetic
Hi Simone, This is handy! Any chance you'll be adding a version with Jetty 7.* ? Thanks, Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message > From: Simone Tripodi > To: solr-user@lucene.apache.or

Re: solr on the cloud

2011-03-24 Thread Otis Gospodnetic
Hi, > I have tried running the sharded solr with zoo keeper on a single machine. > The SOLR code is from current trunk. It runs nicely. Can you please point me > to a page, where I can check the status of the solr on the cloud development > and available features, apart from http://wiki.apach

Re: Multiple Cores with Solr Cell for indexing documents

2011-03-24 Thread Markus Jelsma
I believe it's example/solr/lib where it looks for shared libs in multicore. But, each core can has its own lib dir, usually in core/lib. This is referenced to in solrconfig.xml, see the example config for the lib directive. > Well, there lies the problem--it's not JUST the Tika jar. If it's no

Re: Fuzzy query using dismax query parser

2011-03-24 Thread cyang2010
OK, i will have to wait till solr 3 release then. -- View this message in context: http://lucene.472066.n3.nabble.com/Fuzzy-query-using-dismax-query-parser-tp2727075p2727572.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: how to run boost query for non-dismax query parser

2011-03-24 Thread cyang2010
iorixxx, thanks for your reply. Another a little bit off topic question. I looked over all the subclasses of QParserPlugin. It seesm like most of them provide complementary parsing to the default lucene/solr parser. Except prefixParser. What is the intended usage of that one? The default luc

Multiple Cores with Solr Cell for indexing documents

2011-03-24 Thread Brandon Waterloo
Well, there lies the problem--it's not JUST the Tika jar. If it's not one thing, it's another, and I'm not even sure which directory Solr actually looks in. In my Solr.xml file I have it use a shared library folder for every core. Since each core will be holding very homologous data, there's

Re: Fuzzy query using dismax query parser

2011-03-24 Thread Ahmet Arslan
> I wonder how to conduct fuzzy query using dismax query > parser?  I am able to > do prefix query with local params and > prefixQueryParser.  But how to handle > fuzzy query?  > > I like the behavior of dismax except it does not support > the prefix query > and fuzzy query. You may interested i

RE: Multiple Cores with Solr Cell for indexing documents

2011-03-24 Thread Brandon Waterloo
Well, there lies the problem--it's not JUST the Tika jar. If it's not one thing, it's another, and I'm not even sure which directory Solr actually looks in. In my Solr.xml file I have it use a shared library folder for every core. Since each core will be holding very homologous data, there's

Newbie wants to index XML content.

2011-03-24 Thread Marcelo Iturbe
Hello, I've been reading up on how to index XML content but have a few questions. How is data in element attributes handled or defined? How are nested elements handled? In the following XML structure, I want to index the content of what is between the tags. In one XML document, there can be up t

Re: invert terms in search with exact match

2011-03-24 Thread Ahmet Arslan
Then you need to write some custom code for that. Lucene in Action Book (second edition, section 6.3.4) has an example for Translating PhraseQuery to SpanNearQuery. Just use false for the third parameter in SpanNearQuery's ctor. You can plug https://issues.apache.org/jira/browse/SOLR-1604 to

Fuzzy query using dismax query parser

2011-03-24 Thread cyang2010
Hi, I wonder how to conduct fuzzy query using dismax query parser? I am able to do prefix query with local params and prefixQueryParser. But how to handle fuzzy query? I like the behavior of dismax except it does not support the prefix query and fuzzy query. Thanks. cy -- View this message

Re: how to run boost query for non-dismax query parser

2011-03-24 Thread Ahmet Arslan
> Thanks for your reply.  yeah, an additional query with > the boost value will > work. > > However, I just wonder where you get the information that > BoostQParserPlugin > only handles function query? > > I looked up the javadoc, and still can't get that.  > This is the javadoc. > > > Create a

Re: Solr throwing exception when evicting from filterCache

2011-03-24 Thread Yonik Seeley
On Thu, Mar 24, 2011 at 1:54 PM, Matt Mitchell wrote: > I have a recent build of solr (4.0.0.2011.02.25.13.06.24). I am seeing this > error when making a request (with fq's), right at the point where the > eviction count goes from 0 up: Yep, this was a bug that has since been fixed. -Yonik http:

Re: Solr throwing exception when evicting from filterCache

2011-03-24 Thread Matt Mitchell
Here's the full stack trace: [Ljava.lang.Object; cannot be cast to [Lorg.apache.solr.common.util.ConcurrentLRUCache$CacheEntry; java.lang.ClassCastException: [Ljava.lang.Object; cannot be cast to [Lorg.apache.solr.common.util.ConcurrentLRUCache$CacheEntry; at org.apache.solr.common.util.Concurrent

Re: how to run boost query for non-dismax query parser

2011-03-24 Thread cyang2010
Hi iorixxx, Thanks for your reply. yeah, an additional query with the boost value will work. However, I just wonder where you get the information that BoostQParserPlugin only handles function query? I looked up the javadoc, and still can't get that. This is the javadoc. Create a boosted quer

Re: Detecting an empty index during start-up

2011-03-24 Thread Chris Hostetter
: I am not familiar with Solr internals, so the approach I wanted to take was : to basically check the numDocs property of the index during start-up and set : a READABLE state in the ZooKeeper node if it's greater than 0. I also : planned to create a commit hook for replication and updating which :

Solr throwing exception when evicting from filterCache

2011-03-24 Thread Matt Mitchell
I have a recent build of solr (4.0.0.2011.02.25.13.06.24). I am seeing this error when making a request (with fq's), right at the point where the eviction count goes from 0 up: severe: java.lang.classcastexception: [ljava.lang.object; cannot be cast to [lorg.apache.solr.common.util.concurrentlruca

Wanted: a directory of quick-and-(not too)dirty analyzers for multi-language RDF.

2011-03-24 Thread fr . jurain
Hello Solrists, As it says in the subject line, I'm looking for a Java component that, given an ISO 639-1 code or some equivalent, would return a Lucene Analyzer ready to gobble documents in the corresponding language. Solr looks like it has to contain one, only I've not been able to locate it s

Re: invert terms in search with exact match

2011-03-24 Thread Gastone Penzo
yes sorry i made a mistake title(my AND love AND darling) all three words have to match. the problem is always i don't want results with other words. 2011/3/24 Dario Rigolin > On Thursday, March 24, 2011 03:52:31 pm Gastone Penzo wrote: > > > > > title1: my love darling > > title2: my darlin

Re: invert terms in search with exact match

2011-03-24 Thread Dario Rigolin
On Thursday, March 24, 2011 03:52:31 pm Gastone Penzo wrote: > > title1: my love darling > title2: my darling love > title3: darling my love > title4: love my darling Sorry but simply search for: title:( my OR love OR darling) If you have default operator OR you don't need to put OR on the q

Re: Multiple Cores with Solr Cell for indexing documents

2011-03-24 Thread Markus Jelsma
Sounds like the Tika jar is not on the class path. Add it to a directory where Solr's looking for libs. On Thursday 24 March 2011 16:24:17 Brandon Waterloo wrote: > Hello everyone, > > I've been trying for several hours now to set up Solr with multiple cores > with Solr Cell working on each core

Multiple Cores with Solr Cell for indexing documents

2011-03-24 Thread Brandon Waterloo
Hello everyone, I've been trying for several hours now to set up Solr with multiple cores with Solr Cell working on each core. The only items being indexed are PDF, DOC, and TXT files (with the possibility of expanding this list, but for now, just assume the only things in the index should be d

Re: invert terms in search with exact match

2011-03-24 Thread Jonathan Rochkind
You can use query slop as others have said to find documents with "my" and "love" right next to each other, in any order. And I think query slop can probably work for three or more words too to do that. But it won't find files with ONLY those words in it. For instance "my love"~2 will still ma

Re: dismax parser, parens, what do they do exactly

2011-03-24 Thread Jonathan Rochkind
Thanks Hoss, this is very helpful, okay, dismax is not intended to do anything with parens for semantics, they're just like any other char, handled by analyzers. I think you're right I cut and paste the wrong query before. Just for the record, on 1.4.1: qf=text pf= q=book (dog +(cat -frog))

Re: invert terms in search with exact match

2011-03-24 Thread Gastone Penzo
no beacuse i don't know the words i want to ignore.. and i don't want use dismax. i have to use standard handler. the problem is very simple. i want to recive only documents that have in title field ONLY the words i search, in any order. if i search "my love darling", i want solr returns me these

Re: invert terms in search with exact match

2011-03-24 Thread Bill Bell
Yes create qt with dismax and qf on field that has query stopwords for the words you want to ignore. Bill Bell Sent from mobile On Mar 24, 2011, at 7:58 AM, Gastone Penzo wrote: > Hi, > is it possible with standard query search (not dismax) to have > exact matches that allow any terms order?

Re: invert terms in search with exact match

2011-03-24 Thread Gastone Penzo
Hi Tommaso, thank you for the answer but the problem in your solution is that solr returns to me also docs with other words. For example: my love is the world i want to exclude the other words. it must give to me only docs with my love or love my. stop Thank you 2011/3/24 Tommaso Teofili > Hi

Detecting an empty index during start-up

2011-03-24 Thread David McLaughlin
Hi, In our Solr deployment we have a cluster of replicated Solr cores, with the small change that we have dynamic master look-up using ZooKeeper. The problem I am trying to solve is to make sure that when a new Solr core joins the cluster it isn't made available to any search services until it has

Question about http://wiki.apache.org/solr/Deduplication

2011-03-24 Thread eks dev
Hi, Use case I am trying to figure out is about preserving IDs without re-indexing on duplicate, rather adding this new ID under list of document id "aliases". Example: Input collection: "id":1, "text":"dummy text 1", "signature":"A" "id":2, "text":"dummy text 1", "signature":"A" I add the first

Re: invert terms in search with exact match

2011-03-24 Thread Tommaso Teofili
Hi Gastone, I think you should use proximity search as described here in Lucene query syntax page [1]. So searching for "my love"~2 should work for your use case. Cheers, Tommaso [1] : http://lucene.apache.org/java/2_9_3/queryparsersyntax.html#ProximitySearches 2011/3/24 Gastone Penzo > Hi, >

Re: how to run boost query for non-dismax query parser

2011-03-24 Thread Ahmet Arslan
> I need to code some boosting logic when some field equal to > some value.   I > was able to get it work if using dismax query parser.  > However, since the > solr query will need to handle prefix or fuzzy query, > therefore, dismax > query parser is not really my choice.  > > Therefore, i want

Re: invert terms in search with exact match

2011-03-24 Thread Ahmet Arslan
--- On Thu, 3/24/11, Gastone Penzo wrote: > From: Gastone Penzo > Subject: invert terms in search with exact match > To: solr-user@lucene.apache.org > Date: Thursday, March 24, 2011, 3:58 PM > Hi, > is it possible with standard query search (not dismax) to > have > exact matches that allow any

invert terms in search with exact match

2011-03-24 Thread Gastone Penzo
Hi, is it possible with standard query search (not dismax) to have exact matches that allow any terms order? for example: if i search "my love" i would solr gives to me docs with - my love - love my it's easy: q=title:(my AND love) the problem is it returns also docs with "my love is my dog"

Re: Why boost query not working?

2011-03-24 Thread Ahmet Arslan
--- On Thu, 3/24/11, cyang2010 wrote: > This solr query faile: > 1. get every title regardless what the title_name is > 2. within the result, boost the one which genre id = > 56.  (bq=genres:56^100) > > http://localhost:8983/solr/titles/select?indent=on&version=2.2&start=0&rows=10&fl=*%2Cscore

Re: boosting with standard search handler

2011-03-24 Thread Gastone Penzo
Thank you Tommaso.. your solution works. i read there's another methor, using _val_ parameter. Thank Gastone 2011/3/24 Tommaso Teofili > Hi Gastone, > I used to do that in standard search handler using the following > parameters: > q={!boost b=query($qq,0.7)} text:something title:other > qq=da

Re: boosting with standard search handler

2011-03-24 Thread Tommaso Teofili
Hi Gastone, I used to do that in standard search handler using the following parameters: q={!boost b=query($qq,0.7)} text:something title:other qq=date:[NOW-60DAY TO NOW]^5 OR date:[NOW-15DAY TO NOW]^8 that enabling custom recency based boosting. My 2 cents, Tommaso 2011/3/24 Gastone Penzo > Hi

boosting with standard search handler

2011-03-24 Thread Gastone Penzo
Hi, is possibile to boost fields like bf parameter of dismax in standard request handler? with or without funcions? thanx -- Gastone Penzo *www.solr-italia.it* *The first italian blog about Apache Solr*

Re: Problem with field collapsing of patched Solr 1.4

2011-03-24 Thread Kai Schlamp-2
Afroz Ahmad wrote: > > Have you enabled the collapse component in solconfig.xml? > > class="org.apache.solr.handler.component.CollapseComponent" > /> > No, it seems that I missed that completely. Thank you, Afroz. It works fine now. Kai -- View this message i

Re: which German stemmer to use?

2011-03-24 Thread Paul Libbrecht
In our ActiveMath project, we have had positive feedback in Lucene with the SnowBallAnalyzer(Version.LUCENE_29,"German") which is probably one of the two below. I note that you may want to be careful to use one field with exact matching (e.g. whitespace analyzer and lowercase filter) an done f