Re: Chapel IO

David Iten Wed, 25 Mar 2020 12:26:17 -0700


On 3/24/20 12:49 PM, Barrett, Richard F via Chapel-users wrote:

Greetings, Chaps,

I have some questions regarding parallel IO, in the context of some basic (1d 
block decomposed) arrays. Apologies in advance if I'm missing this information 
in the documentation or examples:

1) Can order be enforced for writing? Writing within a forall loop, each core 
writes its part of an array, apparently simultaneously, and so things are 
interleaved. I tried this:

                            const IOHINT_PARALLEL = QIO_HINT_PARALLEL;

                            var MatrixOutput = open ( output_matrix_filename, 
iomode.cw, IOHINT_PARALLEL );

                            var AdjMatChannel = MatrixOutput.writer();

                            forall i in AdjMatrix.dom_nnz with (ref AdjMatrix.rowidx ) { 
//  AdjMatrix is a record; can't "ref" a field? Guess it doesn't really matter.
                                  AdjMatChannel.writeln ( AdjMatrix.rowidx[i], " 
", AdjMatrix.colidx[i] );

I'd recommend that you use the start/end offsets when creating theoutput array. So that each task in the forall is writing to a differentportion of the file. This kind of thing is easier to do with binary I/Obut can be done with text I/O by first computing the number of bytesthat would be written by each task, doing a scan to compute the startingoffsets, and then actually writing in a second pass. It would be nice ifthe I/O system supported "appending" and issue #9992 covers some futurework in this area. (But I'm not so sure it would do what you want inthat setting either).

2) So far only running on-node. Any expectations/tips for multinode, in 
particular useful means your found for controlling writing, reading, and 
otherwise managing?

The I/O system currently does support writing from a remote node to achannel/file on another node but it is slow. I have long had a TODO tomake "remote file local channel" support where you could buffer locallybut operate on a remote file. That would help a lot with the performancein such a case. But until we have that, for now, you need to create afile per locale (and probably a channel per task).

3) For multinode, is it possible to configure to write to N files, where N=one 
per node, one per subset of nodes, or one global file?

        a) I do intend to read the file back in and operate on it, where if the 
same number of locales I expect N=one per node to work, otherwise I expect N=1 
necessary. Correct?

In addition to using the built-in IO functionality, the HDF5 modulehttps://chapel-lang.org/docs/latest/modules/packages/HDF5.html providesan interface for reading/writing arrays from/to HDF5 files in parallel.It can read/write a block distributed array into one file in parallelusing:https://chapel-lang.org/docs/master/modules/packages/HDF5/IOusingMPI.htmlThis requires both HDF5 and MPI as it uses MPI I/O to do theparallel/distributed operations.

4) At this point I’m writing text, but will switch to binary once confident 
things are working. Any tips in this regard?

If you use the HDF5 interface it will write the arrays in binary bydefault. If you're using the built-in I/O system switching to binaryshould make things easier since the array elements will all be the samesize.


David

Suggestions, experiences, etc much appreciated.

Richard





_______________________________________________
Chapel-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/chapel-users

_______________________________________________
Chapel-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/chapel-users

Re: Chapel IO

Reply via email to