Re: Document field data not getting indexed

2007-11-30 Thread Phillip Farber
Well this one falls into the category of bald faced embarrassment. It's a bug in my process. Thanks to all for taking the time to respond. Have I said how great solr support is? :-) Phil Phillip Farber wrote: Hi Yonik, Hoss, et. al. I'm using numItems=2000 in the luke url so I am seeing a

Re: Document field data not getting indexed

2007-11-30 Thread Yonik Seeley
On Nov 30, 2007 9:03 AM, Phillip Farber <[EMAIL PROTECTED]> wrote: > I'm using numItems=2000 in the luke url so I am seeing all the items in > the index or at least up through c in the alphabet: If Luke is sorting by high term, you wouldn't necessarily see it. Regardless, the search you did below

Re: Document field data not getting indexed

2007-11-30 Thread Phillip Farber
Hi Yonik, Hoss, et. al. I'm using numItems=2000 in the luke url so I am seeing all the items in the index or at least up through c in the alphabet: http://localhost:8983/solr/admin/luke?fl=ocr&numTerms=2000 When I index all 22 of my documents including doc id=44 which contains the word "Camp

Re: Document field data not getting indexed

2007-11-29 Thread Chris Hostetter
see yonik's comments regarding Luke and wether or not your term is indexedx, as for this point : Also I notice that the numTerms for 22 documents is 5579 and for just the doc : id=44 it's 2194. Hard to believe that 22 documents only increase the number : of terms by so little. this is not

Re: Document field data not getting indexed

2007-11-29 Thread Yonik Seeley
On Nov 29, 2007 7:29 PM, Phillip Farber <[EMAIL PROTECTED]> wrote: > One of my documents (id=44) contains the word "Campeau" in the "ocr" > field. But according to luke this term does not appear in the index. AFAIK the Luke handler lists the top terms, not necessarily all of them. Do a search for

Document field data not getting indexed

2007-11-29 Thread Phillip Farber
Hi, I have 22 documents. I index these by posting them using LWP::UserAgent all with http status 200 OK. One of my documents (id=44) contains the word "Campeau" in the "ocr" field. But according to luke this term does not appear in the index. Yet when I delete the index (delete by query *: