I did a dataimport with 'clean' set to false.
The DIH status upon completion was:

<str name="status">idle</str>
<str name="importResponse"/>
<lst name="statusMessages">
<str name="Total Requests made to DataSource">1</str>
<str name="Total Rows Fetched">6843427</str>
<str name="Total Documents Processed">6843427</str>
<str name="Total Documents Skipped">0</str>
<str name="Full Dump Started">2015-08-16 16:50:54</str>
<str name="">
Indexing completed. Added/Updated: 6843427 documents. Deleted 0 documents.
</str>
Whereas when I query using 'query?q=*:*&rows=0', I get the following count
{
  "responseHeader":{
    "status":0,
    "QTime":1,
    "params":{
      "q":"*:*",
      "rows":"0"}},
  "response":{"numFound":1616376,"start":0,"docs":[]
  }}

There is a difference of 5 million records. Can anyone help me understand the 
behavior? The logs look fine.
Thanks

Reply via email to