Joe McDonnell created IMPALA-12745:
--------------------------------------

             Summary: dump_breakpad_symbols.py's parallelism doesn't work with 
RPM/DEBs
                 Key: IMPALA-12745
                 URL: https://issues.apache.org/jira/browse/IMPALA-12745
             Project: IMPALA
          Issue Type: Bug
          Components: Infrastructure
    Affects Versions: Impala 4.4.0
            Reporter: Joe McDonnell


When using the "-r" or "--pkg" option, dump_breakpad_symbols.py is extracting 
the RPM/DEB into a temporary directory. The lifetime of that temporary 
directory is maintained by yielding tuples from enumerate_pkg_files(). When 
using parallelism (added in IMPALA-11511), the yield doesn't keep the temporary 
directory around while the parallel threads are processing, so they fail with:

 
{noformat}
Found debugging info in 
/tmp/tmpqfZ9MZ/usr/lib/debug/usr/lib/impala/sbin-retail/impalad.debug
Failed to open ELF file 
'/tmp/tmpqfZ9MZ/usr/lib/debug/usr/lib/impala/sbin-retail/impalad.debug': No 
such file or directory
Failed to write symbol file.
{noformat}
 

Testing shows that this is still a problem with num_processes=1, so there 
should also be a change to be able to turn off the ThreadPool entirely. 
Processing OS packages can force the parallelism off for now as they don't 
benefit much from parallelism.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to