Package: libopenmpi-dev
Version: 4.1.3-1
Severity: serious
Justification: debci failure (mpi4py)
Control: affects -1 src:mpi4py

mpi4py is failing IO tests with the new openmpi 4.1.3.

i386 has actually been failing test_io.TestIOSelf for some time, but
now the problem has expanded to amd64 and other arches.

The problem (exposed in mpi4py tests) has been reported at
https://github.com/mpi4py/mpi4py/issues/105
We checked that the same tests are passing with mpich, which I why I'm
filing this bug against openmpi rather than mpi4py

The mpi4py maintainer can reproduce the error (after building on
Fedora without ucx, which was causing other problems) using
OMPI_MCA_io=romio321.  He observes that the default OMPIO component
failed to load at runtime, so ROMIO is being used instead.

He provides a backtrace:

$ OMPI_MCA_io=romio321 valgrind -q python test/main.py -q -i test_io -k 
TestIOSelf.testIReadIWriteAll -v
[0@kw61149] Python 3.10.4 (/usr/bin/python)
[0@kw61149] numpy 1.21.5 (/usr/lib64/python3.10/site-packages/numpy)
[0@kw61149] MPI 3.1 (Open MPI 4.1.3)
[0@kw61149] mpi4py 4.0.0.dev0 
(/home/dalcinl/Devel/mpi4py/build/lib.linux-x86_64-3.10/mpi4py)
testIReadIWriteAll (test_io.TestIOSelf) ... ==3623039== Jump to the invalid 
address stated on the next line
==3623039==    at 0x0: ???
==3623039==    by 0x1DA8E8B3: mca_io_romio_dist_MPI_File_iwrite_all 
(iwrite_all.c:58)
==3623039==    by 0x1DA8BAF5: mca_io_romio321_file_iwrite_all 
(io_romio321_file_write.c:204)
==3623039==    by 0x1600D5D5: PMPI_File_iwrite_all (pfile_iwrite_all.c:83)
==3623039==    by 0x15DBACA9: PyMPI_File_iwrite_all_c (largecnt.h:2377)
==3623039==    by 0x15ECAA01: __pyx_pf_6mpi4py_3MPI_4File_62Iwrite_all 
(MPI.c:171765)
==3623039==    by 0x15ECA843: __pyx_pw_6mpi4py_3MPI_4File_63Iwrite_all 
(MPI.c:171700)
==3623039==    by 0x498A50F: method_vectorcall_VARARGS_KEYWORDS 
(descrobject.c:344)
==3623039==    by 0x497CBA2: UnknownInlinedFun (abstract.h:114)
==3623039==    by 0x497CBA2: UnknownInlinedFun (abstract.h:123)
==3623039==    by 0x497CBA2: UnknownInlinedFun (ceval.c:5867)
==3623039==    by 0x497CBA2: _PyEval_EvalFrameDefault (ceval.c:4198)
==3623039==    by 0x497B5FF: UnknownInlinedFun (pycore_ceval.h:46)
==3623039==    by 0x497B5FF: _PyEval_Vector (ceval.c:5065)
==3623039==    by 0x49918F7: UnknownInlinedFun (call.c:342)
==3623039==    by 0x49918F7: UnknownInlinedFun (abstract.h:114)
==3623039==    by 0x49918F7: method_vectorcall (classobject.c:53)
==3623039==    by 0x497C7C6: UnknownInlinedFun (abstract.h:114)
==3623039==    by 0x497C7C6: UnknownInlinedFun (abstract.h:123)
==3623039==    by 0x497C7C6: UnknownInlinedFun (ceval.c:5867)
==3623039==    by 0x497C7C6: _PyEval_EvalFrameDefault (ceval.c:4213)
==3623039==  Address 0x0 is not stack'd, malloc'd or (recently) free'd
==3623039== 
[kw61149:3623039] *** Process received signal ***
[kw61149:3623039] Signal: Segmentation fault (11)
[kw61149:3623039] Signal code: Invalid permissions (2)
[kw61149:3623039] Failing at address: (nil)
[kw61149:3623039] [ 0] /lib64/libc.so.6(+0x42750)[0x4bff750]
[kw61149:3623039] *** End of error message ***
Segmentation fault (core dumped)





-- System Information:
Debian Release: bookworm/sid
  APT prefers unstable
  APT policy: (500, 'unstable'), (1, 'experimental')
Architecture: amd64 (x86_64)
Foreign Architectures: i386

Kernel: Linux 5.16.0-6-amd64 (SMP w/8 CPU threads; PREEMPT)
Kernel taint flags: TAINT_OOT_MODULE
Locale: LANG=en_AU.UTF-8, LC_CTYPE=en_AU.UTF-8 (charmap=UTF-8), 
LANGUAGE=en_AU:en
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled

Versions of packages libopenmpi-dev depends on:
ii  gfortran [gfortran-mod-15]     4:11.2.0-2
ii  gfortran-11 [gfortran-mod-15]  11.2.0-19
ii  gfortran-9 [gfortran-mod-15]   9.4.0-5
ii  libevent-dev                   2.1.12-stable-1
ii  libhwloc-dev                   2.7.1-1
ii  libibverbs-dev                 39.0-1+b1
ii  libjs-jquery                   3.6.0+dfsg+~3.5.13-1
ii  libjs-jquery-ui                1.13.1+dfsg-1
ii  libopenmpi3                    4.1.3-1
ii  libpmix-dev                    4.1.2-2
ii  openmpi-bin                    4.1.3-1
ii  openmpi-common                 4.1.3-1
ii  zlib1g-dev                     1:1.2.11.dfsg-4

Versions of packages libopenmpi-dev recommends:
ii  libcoarrays-openmpi-dev  2.9.3-1

Versions of packages libopenmpi-dev suggests:
pn  openmpi-doc  <none>

-- no debconf information

Reply via email to