Re: [gdal-dev] does gdal support multiple simultaneous writers to raster

Joaquim Luis Fri, 11 Jan 2013 18:04:12 -0800

On 12-01-2013 01:38, Kennedy, Paul wrote:

Hi,
Yes, we are pretty sure we will see a significant benefit. Theprocessing algorithms are CPU bound not io bound. Our digital terrainmodel interpolations often run for many hours ( we do them overnight)but the underlying file is only a few gigabytes. If we split them intomultiple files of tiles and run each on a dedicated process the wholething is quicker, but this is messy and results in a stitching error.

Some many years ago when I had to do that type of operations due tomemory limitations the trick was to compute each tile larger thanneeded. Let say 10% wider in each of the 4 sides (except the borders ofcourse). The extra zone will work as a boundary condition and isstripped at the end. The stripped tiles could than be pasted together tobuild the final mosaic. I did that with minimum curvature (GMT)interpolation and the final 'gluing' resulted perfect as it couldn't benoticed not even with shaded illumination.


Joaquim

Another example is gdalwarp. It takes quite some time with a largedata set and would be. A good candidate for parallelisation, as wouldgdaladdo.
I believe slower cores but more of them in pcs are the future. My pchas 8 but they rarely get used to their potential.
I am certain there are some challenges here, that's why it isinteresting;)
Regards
pk
On 11/01/2013, at 6:54 PM, "Even Rouault"<even.roua...@mines-paris.org <mailto:even.roua...@mines-paris.org>>wrote:
Hi,
This is an intersting topic, with many "intersecting" issues to dealwith at
different levels.
First, are you confident that in the use cases you imagine that I/Oaccess won'tbe the limiting factor, in which case serialization of I/O could beacceptable
and this would just require an API with a dataset level mutex.

There are several places where parallel write should be addressed :
- The GDAL core mechanisms that deal with the block cache
- Each GDAL driver where parallel write would be supported. I guessthat GDAL
drivers should advertize a specific capability
- The low-level library used by the driver. In the case of GDAL, libtiff
And finally, as Frank underlined, there are intrinsic limitations dueto theformat itself. For a compressed TIFF, at some point, you have toserialize the
writing of the tile, because you cannot kown in advance the size of the
compressed data, or at least have some coordination of the writers sothat a"next offset available" is properly synchronized between them. Thecompression
itself could be serialized.
I'm not sure however if what Jan mentionned, different process,writing the same
dataset is doable.
_______________________________________________
gdal-dev mailing list
gdal-dev@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/gdal-dev

_______________________________________________
gdal-dev mailing list
gdal-dev@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/gdal-dev

Re: [gdal-dev] does gdal support multiple simultaneous writers to raster

Reply via email to