Package: libopenmpi-dev Version: 4.1.3-1 Severity: serious Justification: debci failure (mpi4py) Control: affects -1 src:mpi4py
mpi4py is failing IO tests with the new openmpi 4.1.3. i386 has actually been failing test_io.TestIOSelf for some time, but now the problem has expanded to amd64 and other arches. The problem (exposed in mpi4py tests) has been reported at https://github.com/mpi4py/mpi4py/issues/105 We checked that the same tests are passing with mpich, which I why I'm filing this bug against openmpi rather than mpi4py The mpi4py maintainer can reproduce the error (after building on Fedora without ucx, which was causing other problems) using OMPI_MCA_io=romio321. He observes that the default OMPIO component failed to load at runtime, so ROMIO is being used instead. He provides a backtrace: $ OMPI_MCA_io=romio321 valgrind -q python test/main.py -q -i test_io -k TestIOSelf.testIReadIWriteAll -v [0@kw61149] Python 3.10.4 (/usr/bin/python) [0@kw61149] numpy 1.21.5 (/usr/lib64/python3.10/site-packages/numpy) [0@kw61149] MPI 3.1 (Open MPI 4.1.3) [0@kw61149] mpi4py 4.0.0.dev0 (/home/dalcinl/Devel/mpi4py/build/lib.linux-x86_64-3.10/mpi4py) testIReadIWriteAll (test_io.TestIOSelf) ... ==3623039== Jump to the invalid address stated on the next line ==3623039== at 0x0: ??? ==3623039== by 0x1DA8E8B3: mca_io_romio_dist_MPI_File_iwrite_all (iwrite_all.c:58) ==3623039== by 0x1DA8BAF5: mca_io_romio321_file_iwrite_all (io_romio321_file_write.c:204) ==3623039== by 0x1600D5D5: PMPI_File_iwrite_all (pfile_iwrite_all.c:83) ==3623039== by 0x15DBACA9: PyMPI_File_iwrite_all_c (largecnt.h:2377) ==3623039== by 0x15ECAA01: __pyx_pf_6mpi4py_3MPI_4File_62Iwrite_all (MPI.c:171765) ==3623039== by 0x15ECA843: __pyx_pw_6mpi4py_3MPI_4File_63Iwrite_all (MPI.c:171700) ==3623039== by 0x498A50F: method_vectorcall_VARARGS_KEYWORDS (descrobject.c:344) ==3623039== by 0x497CBA2: UnknownInlinedFun (abstract.h:114) ==3623039== by 0x497CBA2: UnknownInlinedFun (abstract.h:123) ==3623039== by 0x497CBA2: UnknownInlinedFun (ceval.c:5867) ==3623039== by 0x497CBA2: _PyEval_EvalFrameDefault (ceval.c:4198) ==3623039== by 0x497B5FF: UnknownInlinedFun (pycore_ceval.h:46) ==3623039== by 0x497B5FF: _PyEval_Vector (ceval.c:5065) ==3623039== by 0x49918F7: UnknownInlinedFun (call.c:342) ==3623039== by 0x49918F7: UnknownInlinedFun (abstract.h:114) ==3623039== by 0x49918F7: method_vectorcall (classobject.c:53) ==3623039== by 0x497C7C6: UnknownInlinedFun (abstract.h:114) ==3623039== by 0x497C7C6: UnknownInlinedFun (abstract.h:123) ==3623039== by 0x497C7C6: UnknownInlinedFun (ceval.c:5867) ==3623039== by 0x497C7C6: _PyEval_EvalFrameDefault (ceval.c:4213) ==3623039== Address 0x0 is not stack'd, malloc'd or (recently) free'd ==3623039== [kw61149:3623039] *** Process received signal *** [kw61149:3623039] Signal: Segmentation fault (11) [kw61149:3623039] Signal code: Invalid permissions (2) [kw61149:3623039] Failing at address: (nil) [kw61149:3623039] [ 0] /lib64/libc.so.6(+0x42750)[0x4bff750] [kw61149:3623039] *** End of error message *** Segmentation fault (core dumped) -- System Information: Debian Release: bookworm/sid APT prefers unstable APT policy: (500, 'unstable'), (1, 'experimental') Architecture: amd64 (x86_64) Foreign Architectures: i386 Kernel: Linux 5.16.0-6-amd64 (SMP w/8 CPU threads; PREEMPT) Kernel taint flags: TAINT_OOT_MODULE Locale: LANG=en_AU.UTF-8, LC_CTYPE=en_AU.UTF-8 (charmap=UTF-8), LANGUAGE=en_AU:en Shell: /bin/sh linked to /bin/dash Init: systemd (via /run/systemd/system) LSM: AppArmor: enabled Versions of packages libopenmpi-dev depends on: ii gfortran [gfortran-mod-15] 4:11.2.0-2 ii gfortran-11 [gfortran-mod-15] 11.2.0-19 ii gfortran-9 [gfortran-mod-15] 9.4.0-5 ii libevent-dev 2.1.12-stable-1 ii libhwloc-dev 2.7.1-1 ii libibverbs-dev 39.0-1+b1 ii libjs-jquery 3.6.0+dfsg+~3.5.13-1 ii libjs-jquery-ui 1.13.1+dfsg-1 ii libopenmpi3 4.1.3-1 ii libpmix-dev 4.1.2-2 ii openmpi-bin 4.1.3-1 ii openmpi-common 4.1.3-1 ii zlib1g-dev 1:1.2.11.dfsg-4 Versions of packages libopenmpi-dev recommends: ii libcoarrays-openmpi-dev 2.9.3-1 Versions of packages libopenmpi-dev suggests: pn openmpi-doc <none> -- no debconf information