Re: SOLR 1.2 - Updates sent containing fields that are not on the Schema fail silently

2007-11-29 Thread Daniel Alheiros
Hi Hoss. Well I'll enable this ignore options for fields that aren't declared in my schema. Thanks. Exactly, you can try it really easily, just remove one of your fields on the example schema config and try to add content using the Java client API... Well I'm using SOLRJ and it returns no error c

Re: Schema class configuration syntax

2007-11-29 Thread Ryan McKinley
Norskog, Lance wrote: Hi- What is the element in an element that will load this class: org.apache.lucene.analysis.cn.ChineseFilter This did not work: This is in Solr 1.2. the class needs to point to a FilterFactory (not a Filter) 1.3-dev adds FilterFactories for all the lucne

Re: SOLR 1.2 - Updates sent containing fields that are not on the Schema fail silently

2007-11-29 Thread Ryan McKinley
To be clear: solr *should* fail with an error if you send an unknown field. I just tested this with a clean checkout of 1.3-dev and 1.2 and in both cases I get an error 400 "unknown field 'asgasdgasgd'" The suggestion to look at the "ignore option" is to make sure you don't have one -- this

Re: LowerCaseFilterFactory and spellchecker

2007-11-29 Thread Sean Timm
It seems the best thing to do would be to do a case-insensitive spellcheck, but provide the suggestion preserving the original case that the user provided--or at least make this an option. Users are often lazy about capitalization, especially with search where they've learned from web search e

How much disc space Solr consumes?

2007-11-29 Thread Evgeniy Strokin
Hello,.. If index size is 100Gb and I want to run optimize command, how much more space I need for this? Also,.. If I run snapshooter does it take more space during shooting than actual snapshoot? \Thank you Gene

can I do *thing* substring searches at all?

2007-11-29 Thread Brian Whitman
With a fieldtype of string, can I do any sort of *thing* search? I can do thing* but not *thing or *thing*. Workarounds?

Re: can I do *thing* substring searches at all?

2007-11-29 Thread Charles Hornberger
Store a copy with the string reversed in another field. Then you can search that field for gniht* ... Also, I believe I saw some comments about prefix wildcards being available in some upcoming release (1.3?) ... sorry I can't remember any better than that. Google may help ... -Charlie On Nov 29

Document field data not getting indexed

2007-11-29 Thread Phillip Farber
Hi, I have 22 documents. I index these by posting them using LWP::UserAgent all with http status 200 OK. One of my documents (id=44) contains the word "Campeau" in the "ocr" field. But according to luke this term does not appear in the index. Yet when I delete the index (delete by query *:

Distribution without SSH?

2007-11-29 Thread Justin Knoll
Hello, I recently set up Solr with distribution on a couple of servers. I just learned that our network policies do not permit us to use SSH with passphraseless keys, and the snappuller script uses SSH to examine the master Solr instance's state before it pulls the newest index via rsync.

Re: Document field data not getting indexed

2007-11-29 Thread Yonik Seeley
On Nov 29, 2007 7:29 PM, Phillip Farber <[EMAIL PROTECTED]> wrote: > One of my documents (id=44) contains the word "Campeau" in the "ocr" > field. But according to luke this term does not appear in the index. AFAIK the Luke handler lists the top terms, not necessarily all of them. Do a search for

Re: Document field data not getting indexed

2007-11-29 Thread Chris Hostetter
see yonik's comments regarding Luke and wether or not your term is indexedx, as for this point : Also I notice that the numTerms for 22 documents is 5579 and for just the doc : id=44 it's 2194. Hard to believe that 22 documents only increase the number : of terms by so little. this is not

Re: LowerCaseFilterFactory and spellchecker

2007-11-29 Thread Chris Hostetter
: think i'm just doing something wrong... : : was experimenting with the spellcheck handler with the nightly : checkout from 11-28; seems my spellchecking is case-sensitive, even : tho i think i'm adding the LowerCaseFilterFactory to both the index : and query analyzers. I'm not very familiar wi

Re: SOLR/Lucene sorting - Question/ requesting suggestion

2007-11-29 Thread Ryan McKinley
Kasi Sankaralingam wrote: When we have the following set of data, they are first sorted based on Capital letters and then lower case . Is there a way to make them sort regardless of character case? Avaneesh Bruce Veda caroleY jonathan junit So carole would come after Bruce. Thanks sorting i

Re: Distribution without SSH?

2007-11-29 Thread Matt Kangas
Your company's network policies seem to be a good thing. I've worked at places with this same policy, for good reason. But it does tend to complicate operations sometimes. Some options you might pursue: * Set up ssh-agent on the clients and use passphrase-protected keys. Downside to this, s

SOLR/Lucene sorting - Question/ requesting suggestion

2007-11-29 Thread Kasi Sankaralingam
When we have the following set of data, they are first sorted based on Capital letters and then lower case . Is there a way to make them sort regardless of character case? Avaneesh Bruce Veda caroleY jonathan junit So carole would come after Bruce. Thanks

Re: LowerCaseFilterFactory and spellchecker

2007-11-29 Thread Mike Klaas
On 29-Nov-07, at 5:40 PM, Chris Hostetter wrote: I'm not very familiar with the SpellCheckerRequestHandler, but i don't think you are doing anything wrong. a quick skim of the code indicates that the "q" param isn't being analyzed by that handler, so the raw input string is pased to the Spe

Re: SOLR 1.2 - Updates sent containing fields that are not on the Schema fail silently

2007-11-29 Thread Chris Hostetter
: Exactly, you can try it really easily, just remove one of your fields on the : example schema config and try to add content using the Java client API... : Well I'm using SOLRJ and it returns no error code for me. But anyway don't : you think the server should also have some logging informing tha