Message = "The remote server returned an error: (500) Internal Server Error."

2008-03-27 Thread farhanali
Hi: when i post XML file to solr,data is indexed but if after a week or two i again post the same file to solr i usually get this error "The remote server returned an error: (500) Internal Server Error." i dont know what is the problem. if i create a new instance of solr and place "solr.config"

RE: Highlight - get terms used by lucene

2008-03-27 Thread Chris Hostetter
: thanks for the answer, with that information I can pull out the term : frequency. Reason for all this, is that we want to use this scoring : algorithm: : http://download-uk.oracle.com/docs/cd/B19306_01/text.102/b14218/ascore.htm Uh why? Based on the description this sounds exactly like

Re: Making stop-words optional with DisMax?

2008-03-27 Thread Ronald K. Braun
> sure, but what logic would you suggest be used to decide when to make them > optional? :) Operationally, I was thinking a tokenizer could use the stop-word list (or an optional-word list) to mark tokens as optional rather than removing them from the token stream. DisMaxOptional would then gene

Re: hl.requireFieldMatch and idf

2008-03-27 Thread Mike Klaas
On 27-Mar-08, at 1:46 AM, Koji Sekiguchi wrote: Hello, If an index has (many) deleted docs and not optimized, when I set hl.requireFieldMatch=true, highlight doesn't work sometimes. cause: If hl.requireFieldMatch set to true, DefaultSolrHighlight.getQueryScorer() uses QueryScorer(Query,I

Re: Making stop-words optional with DisMax?

2008-03-27 Thread Ronald K. Braun
> We use two fields, one with and one without stopwords. The exact > field has a higher boost than the other. That works pretty well. Thanks for the tip, wunder! We are doing likewise for our pf parm of DisMax and that part works well -- exact matches are highly relevant and stopped-matches less

Re: Multiple schemas?

2008-03-27 Thread tim robertson
Thanks all, for the answers On Thu, Mar 27, 2008 at 10:04 PM, Yonik Seeley <[EMAIL PROTECTED]> wrote: > On Thu, Mar 27, 2008 at 4:56 PM, Otis Gospodnetic > <[EMAIL PROTECTED]> wrote: > > Or use the JNDI approach that's described on the Wiki. I've used it > with Jetty and it works nicely. Multip

Re: Making stop-words optional with DisMax?

2008-03-27 Thread Otis Gospodnetic
If you have "doors" in your index and a person enters: "the doors", why not just drop stop-words at query time? If a person searches for "music by the doors" and you have "music doors" in the index and really uses quotes to get the exact phrase, you can try it like Hoss said, and retry without s

Re: Multiple schemas?

2008-03-27 Thread Yonik Seeley
On Thu, Mar 27, 2008 at 4:56 PM, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > Or use the JNDI approach that's described on the Wiki. I've used it with > Jetty and it works nicely. Multiple webapp contexts, multiple Solr indices, > but a single JVM. With multiple smaller collections, one might

Re: document retrieval, nested field and HTMLStripStandardTokenizerFactory

2008-03-27 Thread Otis Gospodnetic
For a field to be searchable it has to be indexed (and not just stored). Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Vinci <[EMAIL PROTECTED]> To: solr-user@lucene.apache.org Sent: Thursday, March 27, 2008 4:43:02 AM Subject: Re: document

Re: Multiple schemas?

2008-03-27 Thread Otis Gospodnetic
Or use the JNDI approach that's described on the Wiki. I've used it with Jetty and it works nicely. Multiple webapp contexts, multiple Solr indices, but a single JVM. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Yonik Seeley <[EMAIL PROTE

Re: Multiple schemas?

2008-03-27 Thread Yonik Seeley
On Thu, Mar 27, 2008 at 3:47 PM, tim robertson <[EMAIL PROTECTED]> wrote: > Would I be correct in thinking that for each schema I want, I need a new > SOLR instance running? For different search collections, it's generally best to run a separate Solr instance. If you need to run multiple in the

Re: Multiple schemas?

2008-03-27 Thread Daniel Papasian
tim robertson wrote: Hi, Would I be correct in thinking that for each schema I want, I need a new SOLR instance running? Hey Tim, Documents aren't required to have all of the fields (it's not a database), so what I would do is just have all of the field definitions in a single schema.xml fil

Re: size limitation when adding document?

2008-03-27 Thread tim robertson
Today I added a single 9gig tab file into solr, with the resulting index being 16gig.3 hours to load and is performing mightily fine (jvm -Xmx3G) On Thu, Mar 27, 2008 at 7:08 PM, Yonik Seeley <[EMAIL PROTECTED]> wrote: > On Thu, Mar 27, 2008 at 1:21 PM, Andrew Tan <[EMAIL PROTECTED]> > wrote: >

Multiple schemas?

2008-03-27 Thread tim robertson
Hi, Would I be correct in thinking that for each schema I want, I need a new SOLR instance running? Thanks Tim

Re: size limitation when adding document?

2008-03-27 Thread Yonik Seeley
On Thu, Mar 27, 2008 at 1:21 PM, Andrew Tan <[EMAIL PROTECTED]> wrote: > I am new to solr and just try it out. I copied solr.war (from 1.2.0 > distribution) into tomcat 5.5.26's webapps directory and started tomcat. > Then I use the java SimplePostTool to add documents. when the document > is s

size limitation when adding document?

2008-03-27 Thread Andrew Tan
Hi , I am new to solr and just try it out. I copied solr.war (from 1.2.0 distribution) into tomcat 5.5.26's webapps directory and started tomcat. Then I use the java SimplePostTool to add documents. when the document is small, things are fine. However, when I tried to add document (greater than

Re: logging in 24hour time

2008-03-27 Thread Chris Hostetter
: Is there any way to get the logs to stderr/stdout to be in 24hour time? http://wiki.apache.org/solr/FAQ#head-ffe035452f21ffdb4e4658c2f8f6553bd6ca "How do I change the logging levels/files/format ?" -Hoss

RE: Highlight - get terms used by lucene

2008-03-27 Thread Tim Mahy
Hi, thanks for the answer, with that information I can pull out the term frequency. Reason for all this, is that we want to use this scoring algorithm: http://download-uk.oracle.com/docs/cd/B19306_01/text.102/b14218/ascore.htm but is there a performance cost on the explain, that can be painfull

logging in 24hour time

2008-03-27 Thread Doug Steigerwald
Is there any way to get the logs to stderr/stdout to be in 24hour time? Thanks. Doug

hl.requireFieldMatch and idf

2008-03-27 Thread Koji Sekiguchi
Hello, If an index has (many) deleted docs and not optimized, when I set hl.requireFieldMatch=true, highlight doesn't work sometimes. cause: If hl.requireFieldMatch set to true, DefaultSolrHighlight.getQueryScorer() uses QueryScorer(Query,IndexReader,String) constructor in Lucene highlighter. The

Re: document retrieval, nested field and HTMLStripStandardTokenizerFactory

2008-03-27 Thread Vinci
Hi hossman, Thank you for your reply, question for the searchable field: Am I declare the field to be indexed in schema is enough to make it searchable? (Assume I write my schema based on the default one) Thank you, Vinci hossman wrote: > > > : 1. Can I limit the number of returned document i

Re: Adding custom field for sorting?

2008-03-27 Thread Chris Hostetter
: 1. Does Solr have a limit, e.g a % or a number to limit the number of : document involved in sorting? or just sort all document? it depends on the context of your question ... if you use any existing fieldtypes, then standard LUcnee sorting kicks in, and a Fieldcache is built containing the v

Re: Making stop-words optional with DisMax?

2008-03-27 Thread Chris Hostetter
: frequently get queried for "The Doors". Articles and prepositions : (the stuff of good stop-lists) seem to me to be in a fuzzier class -- : use 'em if you have 'em during matching, but don't kill your queries : because of them. Hence some desire to make them in some way : "optional" during mat