Hello,
I don't need to say how great GNU parallel is (GREAT!). But for the
first time, I have encountered a behavior I didn't expect from it. I
am trying to break up a big input FASTA file (DNA sequence) using the
--block and --recstart options. But it always seems to create ONE
more file than
Hi Ole, thanks for the reply.
Not quite. True, I am observing the same thing (empty files 12
through 20 below), but what is bothering me is file #11, which has 13
bytes, and could have easily fit into file #10 (1092 bytes) and still
been well below the 1200 threshold.
Another way to have asked t
I was using parallel to do blast searches on a large number of
biological sequences.
cat sequences.fasta | parallel --block 2k --recstart '>' --pipe -j 30
"blastn -db nt -task blastn -evalue 0.1 \
-outfmt '6 qseqid sseqid pident length mismatch gapopen qstart qend
sstart send evalue bitscore