On 26 February 2015 at 08:32, Gary Taylor <g...@inovem.com> wrote: > Alex, > > Same results on recursive=true / recursive=false. > > I also tried importing plain text files instead of epub (still using > TikeEntityProcessor though) and get exactly the same result - ie. all files > fetched, but only one document indexed in Solr.
To me, this would indicate that something is a problem with the inner DIH entity then. As a next set of steps, I would probably 1) remove both onError statements and see if there is an exception that is being swallowed. 2) run the import under ProcessMonitor and see if the other files are actually being read https://technet.microsoft.com/en-us/library/bb896645.aspx 3) Assume a Windows bug and test this on Mac/Linux 4) File a JIRA with a replication case. If there is a full replication setup, I'll test it machines I have access to with full debugger step-through For example, I wonder if FileBinDataSource is somehow not cleaning up after the first file properly on Windows and fails to open the second one. Regards, Alex. ---- Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter: http://www.solr-start.com/