I just discovered a fork of GNU Parallel:
https://github.com/stephen-fralich/parallel-sql/
It saves into PostgreSQL.
If GNU Parallel should have an --sql option, it should be more general
than that. It would be obvious to use a DBURL to specify which driver,
username, password, and database to us
The Unix 'way' of simple tools connected by pipes is good in my mind. And
Parallel is perhaps a bit complex for this model. Sorry if it sounds like a
complaint; Parallel is working great for me.
I use Parallel to launch a Perl program, and it writes to a DB. There are
other options too.
Rick
On T
I created the fork for a very particular group of users
(http://www.bakerlab.org/), so the features I implemented are pretty
narrow. They run a serial protein folding code called Rosetta. They
use about 2-3 million CPU hours a month running it on the HPC system I
manage. Everyone in the lab uses GN
I have a process that I want run on every folder on a file system. The
operation has to happen from the deepest child folder first then back to
the parent. Some folders will have many folders, which is where parallel
fits well. However, all child folders must be processed before their
parent.