Hi Yonik, Hoss, et. al.

I'm using numItems=2000 in the luke url so I am seeing all the items in the index or at least up through c in the alphabet:

http://localhost:8983/solr/admin/luke?fl=ocr&numTerms=2000

When I index all 22 of my documents including doc id=44 which contains the word "Campeau" it is not in the index:

Luke:

<int name="call">22</int>
<int name="called">22</int>
<int name="calls">22</int>
<int name="cally">22</int>
<int name="cam">22</int>
<int name="cammi">22</int> <<<<<???
<int name="cams">22</int>
<int name="can">22</int>

and my search ocr:campeau does not return it:

<response>

<lst name="responseHeader">
 <int name="status">0</int>
 <int name="QTime">82</int>
 <lst name="params">
  <str name="indent">on</str>
  <str name="start">0</str>
  <str name="q">ocr:campeau</str>
  <str name="version">2.2</str>
  <str name="rows">10</str>
 </lst>
</lst>
<result name="response" numFound="0" start="0"/>
</response>


When I delete data/index and restarting solr and index just doc id=44 using the same process as for the 22 docs Campeau *is* in the index and I can retrieve it:

<response>
<lst name="responseHeader">
 <int name="status">0</int>
 <int name="QTime">90</int>
 <lst name="params">
  <str name="indent">on</str>
  <str name="start">0</str>
  <str name="q">ocr:campeau</str>
  <str name="version">2.2</str>
  <str name="rows">10</str>
 </lst>
</lst>
<result name="response" numFound="1" start="0">
 <doc>
  <str name="extern_id">mdp.39015015394847</str>
  <str name="id">44</str>
  <date name="timestamp">2007-11-30T13:59:45.783Z</date>
 </doc>
</result>
</response>

Luke:

<int name="call">1</int>
<int name="called">1</int>
<int name="came">1</int>
<int name="camerons">1</int>
<int name="campeau">1</int>  <<<<<<<<<<<<<
<int name="can">1</int>
<int name="canadian">1</int>


Yonik Seeley wrote:
On Nov 29, 2007 7:29 PM, Phillip Farber <[EMAIL PROTECTED]> wrote:
One of my documents (id=44) contains the word "Campeau" in the "ocr"
field.  But according to luke this term does not appear in the index.

AFAIK the Luke handler lists the top terms, not necessarily all of them.
Do a search for ocr:Campeau and see if it returns anything.

-Yonik

Reply via email to