Package ruby-bdb caused index++ to choke on 'README_en.html': run-parts: executing /etc/cron.weekly/0anacron run-parts: executing /etc/cron.weekly/dhelp terminate called after throwing an instance of 'std::out_of_range' what(): basic_string::erase: __pos (which is 4294967295) > this->size() (which is 0) Dhelp::IndexerError: Broken pipe indexing /usr/share/doc/ruby-bdb/rdoc/README_en.html , /usr/share/doc/ruby-bdb/rdoc/BDB.html , /usr/share/doc/ruby-bdb/rdoc/Thread.html , /usr/share/doc/ruby-bdb/rdoc/unknown.html , /usr/share/doc/ruby-bdb/rdoc/table_of_contents.html , /usr/share/doc/ruby-bdb/rdoc/index.html , /usr/share/doc/ruby-bdb/rdoc/String.html , /usr/share/doc/ruby-bdb/rdoc/RbConfig.html , /usr/share/doc/ruby-bdb/rdoc/Object.html , /usr/share/doc/ruby-bdb/rdoc/index.html
<* snip *> I created another part ("14.part"), removed the README file from it, and indexed it as follows: cat 14.part | /usr/bin/index++ --config-file /usr/share/dhelp/config/swish++.conf --index-file ./14.index --follow-links -v4 - BDB.html (269 words) index.html (165 words) Object.html (175 words) RbConfig.html (25 words) String.html (40 words) table_of_contents.html (1004 words) Thread.html (77 words) unknown.html (27 words) index++: ranking index... "based" discarded (100%) "berkeley" discarded (100%) "classes" discarded (100%) "darkfish" discarded (100%) "generated" discarded (100%) "granger" discarded (100%) "home" discarded (100%) "interface" discarded (100%) "load" discarded (100%) "methods" discarded (100%) "michael" discarded (100%) "pages" discarded (100%) "rdoc" discarded (100%) "rdoc_rel_prefix" discarded (100%) "validate" discarded (100%) index++: writing index... index++: done: 00:00 (min:sec) elapsed time 8 files, 8 indexed 2342 words, 1782 indexed, 596 unique cat 14.part /usr/share/doc/ruby-bdb/rdoc/BDB.html /usr/share/doc/ruby-bdb/rdoc/index.html /usr/share/doc/ruby-bdb/rdoc/Object.html /usr/share/doc/ruby-bdb/rdoc/RbConfig.html /usr/share/doc/ruby-bdb/rdoc/String.html /usr/share/doc/ruby-bdb/rdoc/table_of_contents.html /usr/share/doc/ruby-bdb/rdoc/Thread.html /usr/share/doc/ruby-bdb/rdoc/unknown.html I looked at the README file and couldn't see any obvious reason why the indexer wouldn't like it. This bug is very difficult to find the root cause.