Package: rdfind
Version: 1.3.5-1
Severity: normal

rdfind bareally uses single core on my 8 core machine.

I do have some highly duplicate files or similar size, about 500GB is
size, with few million files. Using async io, or multiple threads to do
things like tree traversal, stat, open/read/close, and for hashing, would
speed it up many many times, as the IO latency is a bottlneck often.

Not all parts can be easilly parallelized, and some additional
synchronization and granularity level needs to be decide, but still I
think it might be worth it.

Regards,
Witold


-- System Information:
Debian Release: buster/sid
  APT prefers unstable
  APT policy: (500, 'unstable'), (1, 'experimental')
Architecture: amd64 (x86_64)
Foreign Architectures: i386

Kernel: Linux 4.16.0-2-amd64 (SMP w/12 CPU cores)
Locale: LANG=pl_PL.utf8, LC_CTYPE=pl_PL.utf8 (charmap=UTF-8), 
LANGUAGE=pl_PL.utf8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled

Versions of packages rdfind depends on:
ii  libc6       2.27-3
ii  libgcc1     1:8.1.0-5
ii  libnettle6  3.4-1
ii  libstdc++6  8.1.0-5

rdfind recommends no packages.

rdfind suggests no packages.

-- no debconf information

Reply via email to