Errors when registering MBeans

2010-01-15 Thread Daniel Persson
Hi. I'm running solr in a IBM WAS 7 server. I get alot of errors in the log and what I could read out from them its a problem when adding MBeans. I have tried to deploy it without generating MBeans but with no luck. Anyone know a good fix for this. The service works fine but alot of extra junk in

Problem with text field in Solr

2010-01-15 Thread deepak agrawal
HI, I am using Solr in which I have BODY field as text. But when i am searching with BODY having word like *aviation* when i am Searching *BODY:avia** (aviation is coming) when i am Searching *BODY:aviat** (aviation is coming) when i am searching *BODY:aviati** (aviation is not coming) when i am

Re: Problem with text field in Solr

2010-01-15 Thread Sven Maurmann
Hi, from a first glance on your configuration it appears that run run into the following: You use a wildcard query to query a stemmed term (aviation becomes aviat) in the index. Now if you provide a wildcard query with the trailing asterisk as the only wildcard, this wildcard query is rewrit

recent query execution cache in Solr

2010-01-15 Thread Smith G
Hello, I have some project-specific config files, which are being used to boost some important fields. These files are read by Solr for the first query and then stored in an Object, and used for the following queries. As those files are read for the first request, I need to rest

Term Dictionary + scoring

2010-01-15 Thread MitchK
Hello, I have searched the wiki and the mailing-lists, but I can't find any postings for the following training-use cases. First: I want to create a Term Dictionary, which I can response to my client. The client should be able to manipulate this response in any way he wants - so I really need a

Re: Problem with text field in Solr

2010-01-15 Thread MitchK
What is analysis.jsp showing to you, when you query the words? Due to stemming the input, there could be the mistake. What happens, if you search for "aviation" without wildcards? -- View this message in context: http://old.nabble.com/Problem-with-text-field-in-Solr-tp27175346p27175827.html Sen

Re: [1.3] help with update timeout issue?

2010-01-15 Thread MitchK
If, and only if you need to fix your problem as fast as you can, I would think about virtualization. You need to replicate your Solr and his index-files. The idea is quiete easy: while one Solr-server does its optimization, the other one is available for searching documents without any downtime.

Re: [1.3] help with update timeout issue?

2010-01-15 Thread Andre Parodi
add these to your JAVA_OPTS when you start your jvm. -verbose:gc -XX:+PrintGCTimeStamps -XX:+PrintGCDetails -Xloggc:/your/path/verbose-gc.log tail the verbose gc log to see if the timing of your pause corresponds with a full gc. On 15/01/10 03:59, Jerome L Quinn wrote: Is this related to GC

Re: recent query execution cache in Solr

2010-01-15 Thread Otis Gospodnetic
Hi, Yes, it's the cache. But not document/query/filter cache, but http cache. Yes, you can disable it in solrconfig.xml You can also modify the URL slightly (e.g. add &foo=1 to the end) to trick your browser into thinking you are looking at a different URL/page. Otis -- Sematext -- http://sem

Re: [1.3] help with update timeout issue?

2010-01-15 Thread MitchK
The current topic "Need deployment strategy" may give you another answer quite similar to mine one. It sounds much cleaner. -- View this message in context: http://old.nabble.com/-1.3--help-with-update-timeout-issue--tp27171798p27179780.html Sent from the Solr - User mailing list archive at Nab

Stripping Punctuation in a fieldType

2010-01-15 Thread David Seltzer
Hello All, I'm trying to find the best way to set up a fieldType that strips punctuation. I think the right way to do this is using a CharacterFilter of some type, but I can't seem to find any examples of how to set this up in a schema.xml file. Can anyone point me in the right direction? Right

Re: Stripping Punctuation in a fieldType

2010-01-15 Thread Otis Gospodnetic
Hi David, (this is the David I know, right?) The simplest may be to use one of the existing TokenFilters that removes punctuation. StandardFilterFactory should do it. Otis -- Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch - Original Message > From: David Seltzer > To: so

Re: Stripping Punctuation in a fieldType

2010-01-15 Thread Ahmet Arslan
> I'm trying to find the best way to set up a fieldType that > strips punctuation. Use solr.StandardTokenizerFactory that strips punctuations. Or if you do not care about alphanumeric or numeric queries use solr.LowerCaseTokenizerFactory that uses LetterTokenizer. I think the right way to do

RE: Stripping Punctuation in a fieldType

2010-01-15 Thread David Seltzer
I'm hesitant to change Tokenizers at the moment because what we have is working so nicely - or so I thought. What I'm looking for is case-insensitive search for words and numbers without any of the stemming features turned on. The new requirement is that we take punctuation out of the mix. Right

Re: Stripping Punctuation in a fieldType

2010-01-15 Thread Erick Erickson
If you haven't seen it, this page is invaluable for this kind of question: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.LetterTokenizerFactory LetterTokenizerFactory may well be your frien

RE: Stripping Punctuation in a fieldType

2010-01-15 Thread David Seltzer
Hi Erik, Thanks for your thoughtful reply! > It's actually quite rare for simple tokenizers like these to be satisfactory > unless it's a field you can guarantee is indexed/searched in a very > controlled manner, say part numbers or words from a list. In your > example above, none of the three va

Re: Stripping Punctuation in a fieldType

2010-01-15 Thread Robert Muir
hello, RTL languages like arabic are encoded "logically" in left-to-right order, too. only your display swaps the order. On Fri, Jan 15, 2010 at 2:07 PM, David Seltzer wrote: > Hi Erik, > > Thanks for your thoughtful reply! > >> It's actually quite rare for simple tokenizers like these to be > sa

Multi-word Terms

2010-01-15 Thread shamrockstores
What is the best way to essentially get a term frequency vector for multi-word terms? Thanks in advance. -- View this message in context: http://old.nabble.com/Multi-word-Terms-tp27182199p27182199.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Multi-word Terms

2010-01-15 Thread Ahmet Arslan
> What is the best way to essentially get a term frequency > vector for > multi-word terms? To use solr.ShingleFilterFactory and TermVectorComponent. http://wiki.apache.org/solr/TermVectorComponent http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ShingleFilterFactory

Re: Stripping Punctuation in a fieldType

2010-01-15 Thread Erick Erickson
Ah, ok, your approach makes sense. Mostly I was trying to insure that you weren't flying blind. Perhaps you would find some joy with PatternReplaceCharFilterFactory, replacing all non-alphanum with empty string? HTH Erick On Fri, Jan 15, 2010 at 2:07 PM, David Seltzer wrote: > Hi Erik, > > Tha

Re: Stripping Punctuation in a fieldType

2010-01-15 Thread Robert Muir
also, if you are really concerned about different languages, but can use solr 1.5 then take a look at Unicode Collation. you can simply add after your tokenizer and ignore case,accents,punctuation in a reasonable way for all languages. http://wiki.apache.org/solr/UnicodeCollation On Fri, Jan

RE: Stripping Punctuation in a fieldType

2010-01-15 Thread David Seltzer
Does anyone out there know how to use PatternReplaceCharFilterFactory? The closest think to an example I see is in the default schema.xml: I'm not sure how the PatternReplaceCharFilterFactory differs from the PatternReplaceFilterFactory. Can anyone give me an example of how to strip all commas

Index Courruption after replication by new Solr 1.4 Replication

2010-01-15 Thread Osborn Chan
Hi all, I have migrated new Solr 1.4 Replication feature with multicore support from Solr 1.2 with NFS mounting recently. The following exceptions are in catalina.log from time to time, and there are some EOF exceptions which I believe the slave index files are corrupted after replication from

Re: Index Courruption after replication by new Solr 1.4 Replication

2010-01-15 Thread Otis Gospodnetic
This is not a direct answer to your question, but can you avoid NFS? My first guess would be that NFS somehow causes this problem. If you check the ML archives for: NFS lock , you will see what I mean. Otis -- Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch - Original Message

RE: Index Courruption after replication by new Solr 1.4 Replication

2010-01-15 Thread Osborn Chan
Hi Otis, Thanks. There is no NFS anymore, and all index files are local. We migrated to new Solr 1.4 new Replication in order to avoid all the NSF Stale Exception. Thanks, Osborn -Original Message- From: Otis Gospodnetic [mailto:otis_gospodne...@yahoo.com] Sent: Friday, January 15, 2

Re: Term Dictionary + scoring

2010-01-15 Thread Grant Ingersoll
On Jan 15, 2010, at 8:19 AM, MitchK wrote: > > Hello, > > I have searched the wiki and the mailing-lists, but I can't find any > postings for the following training-use cases. > > First: > I want to create a Term Dictionary, which I can response to my client. The > client should be able to man

only use sorting when there's no "q" is "*:*"?

2010-01-15 Thread Andy
Is it possible to set up Solr such that when there's no query (client would send in "*:*" for "q"), Solr would sort results (basically all the documents) by date or some other criterion. And when there is a query other than "*:*", I'd want the results to be ranked by score. Is that something t

Re: [1.3] help with update timeout issue?

2010-01-15 Thread Jerome L Quinn
Otis Gospodnetic wrote on 01/14/2010 10:07:15 PM: > See those "waitFlush=true,waitSearcher=true" ? Do things improve if > you make them false? (not sure how with autocommit without looking > at the config and not sure if this makes a difference when > autocommit triggers commits) Looking at Dir

Re: only use sorting when there's no "q" is "*:*"?

2010-01-15 Thread Otis Gospodnetic
I don't think there is anything like that built right into Solr, but it looks like something that should be easy enough to add to your search app talking to Solr. Otis -- Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch - Original Message > From: Andy > To: solr-user@lucene

Re: only use sorting when there's no "q" is "*:*"?

2010-01-15 Thread Andy
I see. Thank you. Out of curiosity, when q is *:*, how does Solr decide on the ordering of the results? --- On Fri, 1/15/10, Otis Gospodnetic wrote: > From: Otis Gospodnetic > Subject: Re: only use sorting when there's no "q" is "*:*"? > To: solr-user@lucene.apache.org > Date: Friday, January

Design Question - Dynamic Field Names (*)

2010-01-15 Thread Kumaravel Kandasami
Need to your suggestion in best designing the following requirement. - We have two indexes. Index 1: "name_index", Fields: "id" - indexed, not stored "field_name" - indexed, stored. Index 2: "trans_index', Fields(Dynamic Schema): "id" - indexed, not stored "*" - indexed, stored. (Dynamic field

Re: only use sorting when there's no "q" is "*:*"?

2010-01-15 Thread Otis Gospodnetic
It uses the doc insertion order by default. Otis -- Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch - Original Message > From: Andy > To: solr-user@lucene.apache.org > Sent: Fri, January 15, 2010 4:52:44 PM > Subject: Re: only use sorting when there's no "q" is "*:*"? > > I

Re: Errors when registering MBeans

2010-01-15 Thread Daniel Persson
Hi again Solr users. I forgot to describe environment and version. I'm running a 1.4.0 without any changes to the war file directly deployed to a IBM WAS 7 in a windows environment. I have checked the code and the line that breaks is public SolrInfoMBean put(String key, SolrInfoMBean infoBean)

OverlappingFileLockException when using startup

2010-01-15 Thread Joe Kessel
I have an instance of Solr that won't start since I have added the replication startup. I am using Solr 1.4 and only see this with my index that contains 200k documents with a total size of 400MB. Removing the replicate after startup and the instance starts without error. We found that we n

Re: OverlappingFileLockException when using startup

2010-01-15 Thread Yonik Seeley
Interesting... this should be impossible. Unless there is a bug in Lucene's NativeFSLock (and it doesn't look like it), the only way I see that this could happen is if there were multiple instances of that class loaded in different classloaders. Are you using any kind of plugins? Could you open a

Re: Stripping Punctuation in a fieldType

2010-01-15 Thread Koji Sekiguchi
David, PatternReplaceCharFilterFactory accepts pattern and replacement args. Please read PatternReplaceCharFilter javadoc to see few samples: http://lucene.apache.org/solr/api/org/apache/solr/analysis/PatternReplaceCharFilter.html Koji -- http://www.rondhuit.com/en/ David Seltzer wrote: Do

RE: OverlappingFileLockException when using startup

2010-01-15 Thread Joe Kessel
I am using the example solrconfig.xml with only a few changes. Mainly the replication section for the master has been changed. I am not using any plugins that I am aware of. Here is my replication section: startup optimize If this is valid, then I will open

Re: [1.3] help with update timeout issue?

2010-01-15 Thread Lance Norskog
If your indexing software does not have the ability to retry after a failure, you might with to change the timeout from 20 seconds to, say, 5 minutes. On Fri, Jan 15, 2010 at 1:20 PM, Jerome L Quinn wrote: > Otis Gospodnetic wrote on 01/14/2010 10:07:15 > PM: > >> See those "waitFlush=true,waitS

Re: Errors when registering MBeans

2010-01-15 Thread Lance Norskog
Does WAS have a custom security model? This might interfere with registering MBeans. On Fri, Jan 15, 2010 at 2:39 PM, Daniel Persson wrote: > Hi again Solr users. > > I forgot to describe environment and version. > > I'm running a 1.4.0 without any changes to the war file directly > deployed to a

Re: Stripping Punctuation in a fieldType

2010-01-15 Thread Lance Norskog
If your user query syntax has a small number of features, you could write your own query parser. On Fri, Jan 15, 2010 at 4:18 PM, Koji Sekiguchi wrote: > David, > > PatternReplaceCharFilterFactory accepts pattern and replacement args. > Please read PatternReplaceCharFilter javadoc to see few samp