Re: ranged query on multivalued field doesnt seem to work

2009-01-30 Thread zqzuk
Hi, I am still struggling with this... but I guess would it be because for some data there are maximum interger values for the fields "start_year" "end_year", like "2.14748365E9", which solr does not recognise as "sfloat", because there is a "E" letter? In terms of doing ranged queries on multi

Re: WebLogic 10 Compatibility Issue - StackOverflowError

2009-01-30 Thread Ilan Rabinovitch
I created a wiki page shortly after posting to the list: http://wiki.apache.org/solr/SolrWeblogic From what we could tell Solr itself was fully functional, it was only the admin tools that were failing. Regards, Ilan Rabinovitch --- SCALE 7x: 2009 Southern California Linux Expo Los Angeles,

Re: How to handle database replication delay when using DataImportHandler?

2009-01-30 Thread Shalin Shekhar Mangar
On Fri, Jan 30, 2009 at 12:27 AM, Gregg Donovan wrote: > Noble, > > Thanks for the suggestion. The unfortunate thing is that we really don't > know ahead of time what sort of replication delay we're going to encounter > -- it could be one millisecond or it could be one hour. So, we end up > needi

Re: got background_merge_hit_exception during optimization

2009-01-30 Thread Yonik Seeley
What system and JVM was this using? Also, could you get the stack trace directly from the Solr logs and post it? -Yonik On Thu, Jan 29, 2009 at 4:06 PM, Qingdi wrote: > > We got the following background_merge_hit_exception during optimization: > exception: > )background_merge_hit_exception__4zsg

MultiValue DynamicFields?

2009-01-30 Thread Bruno Aranda
Hi, it is possible to create a dynamic field that is multi valued? Cheers, Bruno

Re: MultiValue DynamicFields?

2009-01-30 Thread Alexander Ramos Jardim
Yes. It's totally acceptable. 2009/1/30 Bruno Aranda > Hi, it is possible to create a dynamic field that is multi valued? > > Cheers, > > Bruno > -- Alexander Ramos Jardim

Re: Optimizing & Improving results based on user feedback

2009-01-30 Thread Ryan McKinley
It may not be as fine-grained as you want, but also check the QueryElevationComponent. This takes a preconfigured list of what the top results should be for a given query and makes thoes documents the top results. Presumably, you could use click logs to determine what the top result shou

Re: query with stemming, prefix and fuzzy?

2009-01-30 Thread Gert Brinkmann
Thanks, Mark, for your answer, Mark Miller wrote: > Truncation queries and stemming are difficult partners. You likely have > to accept compromise. You can try using multiple fields like you are, I already have multiple fields, one per language, to be able to use different stemmers. Wouldn't bec

Re: Optimizing & Improving results based on user feedback

2009-01-30 Thread Sean Timm
Matthew Runo wrote: Which papers did you see that actually talked about using clicks? I don't see those, beyond "Addressing Malicious Noise in Clickthrough Data" by Filip Radlinski and also his "Query Chains: Learning to Rank from Implicit Feedback" - but neither is really on topic. Here are t

Re: Optimizing & Improving results based on user feedback

2009-01-30 Thread Matthew Runo
I've thought about patching the QueryElevationComponent to apply boosts rather than a specific sort. Then the file might look like.. query> And I could write a script that looks at click data once a day to fill out this file. Thanks for your time! Matthew Runo Software Engineer, Zappos.c

Re: Rsyncd start and stop for multiple instances

2009-01-30 Thread sunnyfr
Hi, How can I hack the existing script to support multiple rsync module rsyncd.conf file uid = root gid = root use chroot = no list = no pid file = /data/solr/book/logs/rsyncd.pid log file = /data/solr/book/logs/rsyncd.log [solr] path = /data/solr/book/data comment = Solr Ho

User tag design for read-only index

2009-01-30 Thread Ryan McKinley
I am build a system that indexes a bunch of data and then will let users manually put the data in lists. I have seen http://wiki.apache.org/solr/UserTagDesign The behavior I would like is identical to 'tagging' each document with the list-id/user/order and then using standard faceting to sho

RE: Re: WebLogic 10 Compatibility Issue - StackOverflowError

2009-01-30 Thread Feak, Todd
Are the issues ran into due to non-standard code in Solr, or is there some WebLogic inconsistency? -Todd Feak -Original Message- From: news [mailto:n...@ger.gmane.org] On Behalf Of Ilan Rabinovitch Sent: Friday, January 30, 2009 1:11 AM To: solr-user@lucene.apache.org Subject: Re: WebLogi

Re: Optimizing & Improving results based on user feedback

2009-01-30 Thread Ryan McKinley
yes, applying a boost would be a good addition. patches are always welcome ;) On Jan 30, 2009, at 10:56 AM, Matthew Runo wrote: I've thought about patching the QueryElevationComponent to apply boosts rather than a specific sort. Then the file might look like.. And I could write a scr

1.3 <-> 1.4 patch for onError handling

2009-01-30 Thread Jon Baer
Hi, Ive just had a bump in the night where some feeds have disappeared, Im wondering since Im running the base 1.3 copy would patching it w/ https://issues.apache.org/jira/browse/SOLR-842 Break anything? Has anyone done this yet? Thanks. - Jon

Re: got background_merge_hit_exception during optimization

2009-01-30 Thread Qingdi
We are on solr 1.3, and we use the default jetty server, which is included in the solr 1.3 download package. The java version is: java version "1.5.0_12" Java(TM) 2 Runtime Environment, Standard Edition (build 1.5.0_12-b04) Java HotSpot(TM) 64-Bit Server VM (build 1.5.0_12-b04, mixed mode) I ch

Re: query with stemming, prefix and fuzzy?

2009-01-30 Thread Mark Miller
Gert Brinkmann wrote: Thanks, Mark, for your answer, Mark Miller wrote: Truncation queries and stemming are difficult partners. You likely have to accept compromise. You can try using multiple fields like you are, I already have multiple fields, one per language, to be able to use dif

Re: query with stemming, prefix and fuzzy?

2009-01-30 Thread Gert Brinkmann
Mark Miller wrote: > Yeah, sounds small. Its odd you would see such slow performance. It > depends though. You may still have a *lot* of unique terms in there. Is there a way to retrieve the list of terms in the index? Gert

Re: query with stemming, prefix and fuzzy?

2009-01-30 Thread Mark Miller
Gert Brinkmann wrote: Mark Miller wrote: Yeah, sounds small. Its odd you would see such slow performance. It depends though. You may still have a *lot* of unique terms in there. Is there a way to retrieve the list of terms in the index? Gert Try hitting /solr/admin/luke and see wh

solr booosting

2009-01-30 Thread Marc Sturlese
Hey there, I am trying to tune the boost of the results obtained using DisMaxQueryParser. As I understood lucene's boost, if you search for "John Le Carre" it will give better score to the results that contains just the searched string that results that have, for example, 50 words and the search i

exceeded limit of maxWarmingSearchers

2009-01-30 Thread Jon Drukman
I am getting hit by a storm of these once a day or so: SEVERE: org.apache.solr.common.SolrException: Error opening new searcher. exceeded limit of maxWarmingSearchers=16, try again later. I keep bumping up maxWarmingSearchers. It's at 32 now. Is there any way to figure out what the "right"

Re: exceeded limit of maxWarmingSearchers

2009-01-30 Thread Yonik Seeley
I'd advise setting it to a very low limit (like 2) and committing less often. Once you get too many overlapping searchers, things will slow to a crawl and that will just cause more to pile up. The root cause is simply too many commits in conjunction with warming too long. If you are using a dev

Re: query with stemming, prefix and fuzzy?

2009-01-30 Thread Gert Brinkmann
Mark Miller wrote: > Try hitting /solr/admin/luke and see what it says. Oh, interesting. I think I have to check the stopword list. Is there a way to filter single characters like the "h"? text_de_de ITS-- ITS-- 2340 57971 1454 1016 1008 980 927 924 895 843 730 730

Re: solr as the data store

2009-01-30 Thread Ian Connor
The other option was actually couchdb. It was very nice but the benefits were not compelling compared to the pure simplicity of just having solr. With the replication just so simple to setup now - it really does seem to solve all the problems we are looking for in a redundant distributed storage s

Re: query with stemming, prefix and fuzzy?

2009-01-30 Thread Mark Miller
Gert Brinkmann wrote: 57971 Its a lot for a small index. The fuzzy query will enumerate all of those terms and calculate an edit distance. Its not an insane amount of work, but it jives with the slowness you see. Doing that 60,000 times for a query is not that fast. Unfortunately, withou

Re: query with stemming, prefix and fuzzy?

2009-01-30 Thread Shalin Shekhar Mangar
On Fri, Jan 30, 2009 at 11:37 PM, Mark Miller wrote: > >> >>> you can try indexing the full term at the same position as the stemmed >>> term, >>> >>> >> >> what does this mean "at the same position" and how could I do this? >> >> > Write a custom filter. Normally, for every term, its position is

problems on solr search patterns and sorting rules

2009-01-30 Thread fei dong
Hi buddy, I work on an audio search based on solr engine. I want to realize lyric search and sort by relevance. Here is my confusion. . My schema.xml is like this: text ... http://localhost:8983/solr/select/?q=lyric:(tear the house down)&fl=*,scor

Separate error logs

2009-01-30 Thread James Brady
Hi all,What's the best way for me to split Solr/Lucene error message off to a separate log? Thanks James

Re: exceeded limit of maxWarmingSearchers

2009-01-30 Thread Jon Drukman
Yonik Seeley wrote: I'd advise setting it to a very low limit (like 2) and committing less often. Once you get too many overlapping searchers, things will slow to a crawl and that will just cause more to pile up. The root cause is simply too many commits in conjunction with warming too long. I

Re: solr as the data store

2009-01-30 Thread Paul Libbrecht
We've been using a Lucene index as the main data-store for ActiveMath, the indexing process of which takes the XML fragments apart and stores them in an organized way, including storage of the relationships both ways. The difference between SQL and Lucene in this case? Pure java was the m

Re: Separate error logs

2009-01-30 Thread Ryan McKinley
check: http://wiki.apache.org/solr/SolrLogging You configure whatever flavor logger to write error to a separate log On Jan 30, 2009, at 4:36 PM, James Brady wrote: Hi all,What's the best way for me to split Solr/Lucene error message off to a separate log? Thanks James

Re: exceeded limit of maxWarmingSearchers

2009-01-30 Thread Otis Gospodnetic
That should be fine (but apparently isn't), as long as you don't have some very slow machine or if your caches are are large and configured to copy a lot of data on commit. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Jon Drukman > To

Re: Separate error logs

2009-01-30 Thread James Brady
Oh... I should really have found that myself :/ Thank you! 2009/1/30 Ryan McKinley > check: > http://wiki.apache.org/solr/SolrLogging > > You configure whatever flavor logger to write error to a separate log > > > > On Jan 30, 2009, at 4:36 PM, James Brady wrote: > > Hi all,What's the best way

Re: problems on solr search patterns and sorting rules

2009-01-30 Thread Koji Sekiguchi
fei dong wrote: Hi buddy, I work on an audio search based on solr engine. I want to realize lyric search and sort by relevance. Here is my confusion. . My schema.xml is like this: text ... http://localhost:8983/solr/select/?q=lyric:(tear the hou

RE: Performance "dead-zone" due to garbage collection

2009-01-30 Thread wojtekpia
I profiled our application, and GC is definitely the problem. The IBM JVM didn't change much. I'm currently looking into ways of reducing my memory footprint. -- View this message in context: http://www.nabble.com/Performance-%22dead-zone%22-due-to-garbage-collection-tp21588427p21758001.html S

Solr on Sun Java Real-Time System

2009-01-30 Thread wojtekpia
Has anyone tried Solr on the Sun Java Real-Time JVM (http://java.sun.com/javase/technologies/realtime/index.jsp)? I've read that it includes better control over the garbage collector. Thanks. Wojtek -- View this message in context: http://www.nabble.com/Solr-on-Sun-Java-Real-Time-System-tp2175

Range search question

2009-01-30 Thread Jim Adams
I have a string field in my schema that actually numeric data. If I try a range search: fieldInQuestion:[ 100 TO 150 ] I fetch back a lot of data that is NOT in this range, such as 11, etc. Any idea why this happens? Is it because this is a string? Thanks.

Re: Range search question

2009-01-30 Thread Koji Sekiguchi
Jim Adams wrote: I have a string field in my schema that actually numeric data. If I try a range search: fieldInQuestion:[ 100 TO 150 ] I fetch back a lot of data that is NOT in this range, such as 11, etc. Any idea why this happens? Is it because this is a string? Thanks. Yep, try si

Re: Range search question

2009-01-30 Thread Jim Adams
True, which is what I'll probably do, but is there any way to do this using 'string'? Actually I have even seen this with date fields, which seems very odd (more data being returned than I expected). On Fri, Jan 30, 2009 at 7:04 PM, Koji Sekiguchi wrote: > Jim Adams wrote: > >> I have a string