Hi Kito,we discussed this in yesterday's patchwork sync. Would you mind sharing what the current LLVM implementation does and if this is written down/documented somewhere?
In particular the chunk size we split large vectors. Like for a 1024b vector with the "128b ABI", does LLVM use LMUL8, i.e. always the largest possible LMUL, or is there another limit?
-- Regards Robin