tag 569900 + fixed-upstream pending
thanks

On Sun, Feb 14, 2010 at 01:29:25PM -0800, Ian Zimmerman wrote:
> It seems there's no way to skip files that have not been modified
> since omindex last indexed them.  This looks like a huge missed
> opportunity to speed things up.

This has been implemented upstream:

http://trac.xapian.org/ticket/342

The fix makes use of a new backend feature (tracking upper and lower bounds
on each value slot).  On trunk, we have an upper bound on the lastmod value for
any document in the database, so we can just compare with that single threshold
to spot most modified documents.

A backported patch would have to check the timestamp of every file against its
current lastmod value in the database, which requires fetching that data.  This
would make the "most documents have changed" case slower.

Since 1.2.0 is due out in the next few weeks, I don't intend to try to backport
this patch, but instead I'm going to focus on helping get 1.2.0 released and
packaged for Debian.

> In general, it is disconcerting that
> omindex can only be given a whole directory to process, and only one
> at a time.  The Unix Way (TM), and some existing utilities like cpio
> and pax, suggest another interface: accept the list of files to be
> added or updated on standard input, either newline or NUL separated.

Please don't raise unrelated issues in a single ticket!  This is:

http://trac.xapian.org/ticket/376

Cheers,
    Olly



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20100215003714.gp8...@survex.com

Reply via email to