Package: src:dbcsr
Version: 2.8.0-2
Severity: important
Tags: ftbfs trixie sid

Dear team:

This package used to FTBFS on single-cpu systems, so I did a team upload
and added this line to debian/rules:

export PRTE_MCA_rmaps_default_mapping_policy=:oversubscribe

which is completely standard among scientific packages dealing with OpenMPI.


Now the package finally builds ok on the AWS instances with a single-CPU
on which I tried, but paradoxically, now it fails with timeout on
AWS instances with 2 CPUs, as in the log below.

So, I request the help from other team members to find a solution which works
everywhere.

Note: Before the change, the package used to take 3 minutes and a half to build.
Now it takes 40 minutes to build. Should I just extend the timeout?

--------------------------------------------------------------------------------
[...]
 debian/rules clean
dh clean --buildsystem=cmake
   dh_auto_clean -O--buildsystem=cmake
   dh_autoreconf_clean -O--buildsystem=cmake
   dh_clean -O--buildsystem=cmake
 debian/rules binary
dh binary --buildsystem=cmake
   dh_update_autotools_config -O--buildsystem=cmake
   dh_autoreconf -O--buildsystem=cmake
   dh_auto_configure -O--buildsystem=cmake
        cd obj-x86_64-linux-gnu && DEB_PYTHON_INSTALL_LAYOUT=deb 
PKG_CONFIG=/usr/bin/pkg-config cmake -DCMAKE_INSTALL_PREFIX=/usr 
-DCMAKE_BUILD_TYPE=None -DCMAKE_INSTALL_SYSCONFDIR=/etc 
-DCMAKE_INSTALL_LOCALSTATEDIR=/var -DCMAKE_EXPORT_NO_PACKAGE_REGISTRY=ON 
-DCMAKE_FIND_USE_PACKAGE_REGISTRY=OFF 
-DCMAKE_FIND_PACKAGE_NO_PACKAGE_REGISTRY=ON 
-DFETCHCONTENT_FULLY_DISCONNECTED=ON "-GUnix Makefiles" 
-DCMAKE_VERBOSE_MAKEFILE=ON -DCMAKE_INSTALL_LIBDIR=lib/x86_64-linux-gnu ..
-- The C compiler identification is GNU 14.2.0
-- The CXX compiler identification is GNU 14.2.0
-- Detecting C compiler ABI info

[... snipped ...]

18:  
-------------------------------------------------------------------------------
18:  # multiplications                              1
18:  max memory usage/rank             278.441984E+06
18:  # max total images/rank                        2
18:  # max 3D layers                                1
18:  # MPI messages exchanged                       2
18:  MPI messages size (bytes):
18:   total size                         1.808000E+03
18:   min size                         840.000000E+00
18:   max size                         968.000000E+00
18:   average size                     904.000000E+00
18:  MPI breakdown and total messages size (bytes):
18:              size <=      128                   0                        0
18:        128 < size <=     8192                   2                     1808
18:       8192 < size <=    32768                   0                        0
18:      32768 < size <=   131072                   0                        0
18:     131072 < size <=  4194304                   0                        0
18:    4194304 < size <= 16777216                   0                        0
18:   16777216 < size                               0                        0
18:  
-------------------------------------------------------------------------------
18: 
18:  *** WARNING in dbcsr_mm.F:301 :: Using a non-square number of MPI ranks ***
18:  *** might lead to poor performance. Used ranks: 2 Suggested: 1 4        ***
18: 
18:  
-------------------------------------------------------------------------------
18:  -                                                                          
   -
18:  -                      DBCSR MESSAGE PASSING PERFORMANCE                   
   -
18:  -                                                                          
   -
18:  
-------------------------------------------------------------------------------
18:  ROUTINE             CALLS      AVE VOLUME [Bytes]
18:  MP_Bcast               19                     12.
18:  MP_Allreduce           59                     26.
18:  MP_Alltoall             2                     48.
18:  MP_ISend                6                    315.
18:  MP_IRecv                6                    287.
18:  
-------------------------------------------------------------------------------
18/19 Test #18: dbcsr_test ............................................   
Passed    1.00 sec
test 19
      Start 19: dbcsr_tensor_test

19: Test command: /usr/bin/mpiexec "-n" "2" "./dbcsr_tensor_test"
19: Working Directory: /<<PKGBUILDDIR>>/obj-x86_64-linux-gnu/tests
19: Environment variables: 
19:  OMP_NUM_THREADS=2
19: Test timeout computed to be: 1500
19: 
[c7a-large-1742981929][[11101,1],0][../../../../../../opal/mca/btl/tcp/btl_tcp_proc.c:266:mca_btl_tcp_proc_create_interface_graph]
 Unable to find reachable pairing between local and remote interfaces
19: 
[c7a-large-1742981929][[11101,1],1][../../../../../../opal/mca/btl/tcp/btl_tcp_proc.c:266:mca_btl_tcp_proc_create_interface_graph]
 Unable to find reachable pairing between local and remote interfaces
19: pgrid3-dimensions:
19: 2 1 1 
19: 
19: pgrid4-dimensions:
19: 2 1 1 1 
19: 
19: dist11:
19: 0 1 0 1 
19: 
19: dist12:
19: 0 0 0 0 0 0 0 0 0 0 0 
19: 
19: dist13:
19: 0 0 0 0 0 0 0 0 0 
19: 
19: dist21:
19: 0 1 0 1 
19: 
19: dist22:
19: 0 0 0 0 0 0 0 0 0 0 0 
19: 
19: dist23:
19: 0 0 0 0 0 
19: 
19: dist24:
19: 0 0 0 
19: 
19: dist31:
19: 0 1 0 1 0 1 0 1 0 
19: 
19: dist32:
19: 0 0 0 0 0 
19: 
19: dist33:
19: 0 0 0 
19: 
19: Creating dist objects...
19: 
19: Creating tensors...
19: Tensor 1
19: 
19: Filling Tensor...
19: Dimension: 3
19: Reserving blocks...
19: Tensor 2
19: 
19: Filling Tensor...
19: Dimension: 4
19: Reserving blocks...
19: Tensor 3
19: 
19: Filling Tensor...
19: Dimension: 3
19: Reserving blocks...
19: Contracting...
19: 
--------------------------------------------------------------------------------
19: DBCSR TENSOR CONTRACTION: (13|2) x (54|21) = (3|45)
19: 
--------------------------------------------------------------------------------
19:  GLOBAL INFO OF (13|2)
19:    block dimensions:      4    11     9
19:    full dimensions:       25      83      74
19:    process grid dimensions:      2     1     1
19:    Block sizes:
19:        Dim 1:  3  9 12  1 
19:        Dim 2:  4  2  3  1  9  2 32 10  5  8  7 
19:        Dim 3:  7  3  8  7  9  5 10 23  2 
19:    Block distribution:
19:        Dim 1:   0   1   0   1 
19:        Dim 2:   0   0   0   0   0   0   0   0   0   0   0 
19:        Dim 3:   0   0   0   0   0   0   0   0   0 
19:  DISTRIBUTION OF (13|2)
19:               Number of non-zero blocks:                                    
  32
19:               Percentage of non-zero blocks:                                
8.08
19:               Average number of blocks per CPU:                             
  16
19:               Maximum number of blocks per CPU:                             
  20
19:               Average number of matrix elements per CPU:                    
8155
19:               Maximum number of matrix elements per CPU:                    
9756
19:  GLOBAL INFO OF (54|21)
19:    block dimensions:      4    11     5     3
19:    full dimensions:       25      83      32      28
19:    process grid dimensions:      2     1     1     1
19:    Block sizes:
19:        Dim 1:  3  9 12  1 
19:        Dim 2:  4  2  3  1  9  2 32 10  5  8  7 
19:        Dim 3:  8  1  4 13  6 
19:        Dim 4:  4  2 22 
19:    Block distribution:
19:        Dim 1:   0   1   0   1 
19:        Dim 2:   0   0   0   0   0   0   0   0   0   0   0 
19:        Dim 3:   0   0   0   0   0 
19:        Dim 4:   0   0   0 
19:  DISTRIBUTION OF (54|21)
19:               Number of non-zero blocks:                                    
  36
19:               Percentage of non-zero blocks:                                
5.45
19:               Average number of blocks per CPU:                             
  18
19:               Maximum number of blocks per CPU:                             
  21
19:               Average number of matrix elements per CPU:                   
74745
19:               Maximum number of matrix elements per CPU:                  
132456
19:  INDEX INFO
19:               tensor index: (bca) x (bcde) = (ade)
19:               matrix index: (ba|c) x (ed|cb) = (a|de)
19:  aligning tensor index with data
19:  INDEX INFO
19:               tensor index: (bac) x (edcb) = (ade)
19:               matrix index: (ba|c) x (ed|cb) = (a|de)
19:  large tensors: 1, 2; small tensor: 3
19:  sorting contraction indices
19:  compatibility of (13|2): Not compatible
19:  compatibility of (54|21): Transposed
19:  No redistribution of (54|21)
19:  Redistribution of (13|2) compatible with (54|21)
19:  compatibility of (13|2): Transposed
19:  compatibility of (3|45): Not compatible
19:  Redistribution of (3|45)
19:  compatibility of (3|45): Normal
19:  INDEX INFO
19:               tensor index: (bac) x (edcb) = (ade)
19:               matrix index: (a|cb) x (ed|cb) = (a|ed)
19:  GLOBAL INFO OF (13|2)
19:    block dimensions:      4     9    11
19:    full dimensions:       25      74      83
19:    process grid dimensions:      2     1     1
19:    Block sizes:
19:        Dim 1:  3  9 12  1 
19:        Dim 2:  7  3  8  7  9  5 10 23  2 
19:        Dim 3:  4  2  3  1  9  2 32 10  5  8  7 
19:    Block distribution:
19:        Dim 1:   0   1   0   1 
19:        Dim 2:   0   0   0   0   0   0   0   0   0 
19:        Dim 3:   0   0   0   0   0   0   0   0   0   0   0 
19:  DISTRIBUTION OF (13|2)
19:               Number of non-zero blocks:                                    
  32
19:               Percentage of non-zero blocks:                                
8.08
19:               Average number of blocks per CPU:                             
  16
19:               Maximum number of blocks per CPU:                             
  20
19:               Average number of matrix elements per CPU:                    
8155
19:               Maximum number of matrix elements per CPU:                    
9756
19: 
--------------------------------------------------------------------------------
19: DBCSR TAS MATRIX MULTIPLICATION: (13|2) matrix x (54|21) matrix = (3|45) 
matrix
19: 
--------------------------------------------------------------------------------
19:  mm dims:            9           44           15
19:  MM PARAMETERS
19:    Est. number of matrix elements per CPU of result matrix:                 
5269
19:    Est. optimal split factor:                                               
   2
19:  No redistribution of (13|2) matrix and (54|21) matrix
19:  Change split factor of (13|2) matrix : No
19:  Change split factor of (54|21) matrix : No
19:  mm case: -- x --T = +
19:  SPLIT / PARALLELIZATION INFO
19:    splitting columns by factor   2
19:    global grid sizes:   1x   2
19:    grid sizes on subgroups:   1x   1
19:  GLOBAL INFO OF (13|2) matrix
19:    block dimensions:            9          44
19:    full dimensions:             74          2075
19:    process grid dimensions:          1         2
19:    Block sizes:
19:        Row:   7    3    8    7    9    5   10   23    2 
19:        Column:  12    6    9    3   27    6   96   30   15   24   21   36   
18   27    9   81   18  288   90   45   72   63   48   24   36   12  108   24  
384  120   60   96   84    4    2    3    1    9    2   32   10    5    8    7 
19:    Block distribution:
19:        Row:   0    0    0    0    0    0    0    0    0 
19:        Column:   0    0    0    0    0    0    0    0    0    0    0    1   
 1    1    1    1    1    1    1    1    1    1    0    0    0    0    0    0   
 0    0    0    0    0    1    1    1    1    1    1    1    1    1    1    1 
19:  GLOBAL INFO OF (54|21) matrix
19:    block dimensions:           15          44
19:    full dimensions:            896          2075
19:    process grid dimensions:          1         2
19:    Block sizes:
19:        Row:  32   16  176    4    2   22   16    8   88   52   26  286   24 
  12  132 
19:        Column:  12    6    9    3   27    6   96   30   15   24   21   36   
18   27    9   81   18  288   90   45   72   63   48   24   36   12  108   24  
384  120   60   96   84    4    2    3    1    9    2   32   10    5    8    7 
19:    Block distribution:
19:        Row:   0    0    0    0    0    0    0    0    0    0    0    0    0 
   0    0 
19:        Column:   0    0    0    0    0    0    0    0    0    0    0    1   
 1    1    1    1    1    1    1    1    1    1    0    0    0    0    0    0   
 0    0    0    0    0    1    1    1    1    1    1    1    1    1    1    1 
19:  GLOBAL INFO OF (3|45) matrix
19:    block dimensions:            9          15
19:    full dimensions:             74           896
19:    process grid dimensions:          1         2
19:    Block sizes:
19:        Row:   7    3    8    7    9    5   10   23    2 
19:        Column:  32   16  176    4    2   22   16    8   88   52   26  286   
24   12  132 
19:    Block distribution:
19:        Row:   0    0    0    0    0    0    0    0    0 
19:        Column:   0    1    0    0    1    1    0    1    1    0    1    1   
 0    0    0 
19:    Change process grid: No
19:  DISTRIBUTION OF (13|2) matrix
19:               Number of non-zero blocks:                                    
  32
19:               Percentage of non-zero blocks:                                
8.08
19:               Average number of blocks per group:                           
  16
19:               Maximum number of blocks per group:                           
  20
19:               Average number of matrix elements per group:                  
8155
19:               Maximum number of matrix elements per group:                  
9756
19:               Average number of blocks per CPU:                             
  16
19:               Maximum number of blocks per CPU:                             
  20
19:               Average number of matrix elements per CPU:                    
8155
19:               Maximum number of matrix elements per CPU:                    
9756
19:  DISTRIBUTION OF (54|21) matrix
19:               Number of non-zero blocks:                                    
  36
19:               Percentage of non-zero blocks:                                
5.45
19:               Average number of blocks per group:                           
  18
19:               Maximum number of blocks per group:                           
  21
19:               Average number of matrix elements per group:                 
74745
19:               Maximum number of matrix elements per group:                
132456
19:               Average number of blocks per CPU:                             
  18
19:               Maximum number of blocks per CPU:                             
  21
19:               Average number of matrix elements per CPU:                   
74745
19:               Maximum number of matrix elements per CPU:                  
132456
19:  DISTRIBUTION OF (3|45) matrix replicated
19:               Number of non-zero blocks:                                    
  23
19:               Percentage of non-zero blocks:                                
8.52
19:               Average number of blocks per group:                           
  12
19:               Maximum number of blocks per group:                           
  13
19:               Average number of matrix elements per group:                  
5351
19:               Maximum number of matrix elements per group:                  
7306
19:               Average number of blocks per CPU:                             
  12
19:               Maximum number of blocks per CPU:                             
  13
19:               Average number of matrix elements per CPU:                    
5351
19:               Maximum number of matrix elements per CPU:                    
7306
19:               Row distribution on subgroup:
19:                 0   0   0   0   0   0   0   0   0 
19:               Column distribution on subgroup:
19:                 0   0   0   0   0   0   0   0   0   0   0   0   0   0   0 
19:  MM PARAMETERS
19:    Number of matrix elements per CPU of result matrix:                      
3653
19:    Optimal split factor:                                                    
   2
19: 
--------------------------------------------------------------------------------
19: TAS MATRIX MULTIPLICATION DONE
19: 
--------------------------------------------------------------------------------
19:  GLOBAL INFO OF (3|45)
19:    block dimensions:      9     5     3
19:    full dimensions:       74      32      28
19:    process grid dimensions:      2     1     1
19:    Block sizes:
19:        Dim 1:  7  3  8  7  9  5 10 23  2 
19:        Dim 2:  8  1  4 13  6 
19:        Dim 3:  4  2 22 
19:    Block distribution:
19:        Dim 1:   0   0   0   1   0   1   0   1   1 
19:        Dim 2:   0   0   0   0   0 
19:        Dim 3:   0   0   0 
19:  DISTRIBUTION OF (3|45)
19:               Number of non-zero blocks:                                    
  21
19:               Percentage of non-zero blocks:                               
15.56
19:               Average number of blocks per CPU:                             
  11
19:               Maximum number of blocks per CPU:                             
  13
19:               Average number of matrix elements per CPU:                    
5269
19:               Maximum number of matrix elements per CPU:                    
9052
19:  GLOBAL INFO OF (3|45)
19:    block dimensions:      9     5     3
19:    full dimensions:       74      32      28
19:    process grid dimensions:      2     1     1
19:    Block sizes:
19:        Dim 1:  7  3  8  7  9  5 10 23  2 
19:        Dim 2:  8  1  4 13  6 
19:        Dim 3:  4  2 22 
19:    Block distribution:
19:        Dim 1:   0   1   0   1   0   1   0   1   0 
19:        Dim 2:   0   0   0   0   0 
19:        Dim 3:   0   0   0 
19:  DISTRIBUTION OF (3|45)
19:               Number of non-zero blocks:                                    
  29
19:               Percentage of non-zero blocks:                               
21.48
19:               Average number of blocks per CPU:                             
  15
19:               Maximum number of blocks per CPU:                             
  15
19:               Average number of matrix elements per CPU:                    
6431
19:               Maximum number of matrix elements per CPU:                    
9322
19: 
--------------------------------------------------------------------------------
19: TENSOR CONTRACTION DONE
19: 
--------------------------------------------------------------------------------
19: Testing get_info for Tensor 1...
19: Name: (13|2)
19: Data_type: 3
19: ======= Process: 0 ========
19: Total number of blocks:
19: 4 11 9 
19: 
19: Total number of elements:
19: 25 83 74 
19: 
19: Total number of local blocks:
19: 2 11 9 
19: 
19: Total number of local elements:
19: 15 83 74 
19: 
19: Pgrid dimensions:
19: 2 1 1 
19: 
19: Process coordinates:
19: 0 0 0 
19: 
19: blks_local:
19: 0 2 
19: 
19: 0 1 2 3 4 5 6 7 8 9 10 
19: 
19: 0 1 2 3 4 5 6 7 8 
19: 
19: proc_dist:
19: 0 1 0 1 
19: 
19: 0 0 0 0 0 0 0 0 0 0 0 
19: 
19: 0 0 0 0 0 0 0 0 0 
19: 
19: blk_size:
19: 3 9 12 1 
19: 
19: 4 2 3 1 9 2 32 10 5 8 7 
19: 
19: 7 3 8 7 9 5 10 23 2 
19: 
19: blk_offset:
19: 0 3 12 24 
19: 
19: 0 4 6 9 10 19 21 53 63 68 76 
19: 
19: 0 7 10 18 25 34 39 49 72 
19: 
19: ======= Process: 1 ========
19: Total number of blocks:
19: 4 11 9 
19: 
19: Total number of elements:
19: 25 83 74 
19: 
19: Total number of local blocks:
19: 2 11 9 
19: 
19: Total number of local elements:
19: 10 83 74 
19: 
19: Pgrid dimensions:
19: 2 1 1 
19: 
19: Process coordinates:
19: 1 0 0 
19: 
19: blks_local:
19: 1 3 
19: 
19: 0 1 2 3 4 5 6 7 8 9 10 
19: 
19: 0 1 2 3 4 5 6 7 8 
19: 
19: proc_dist:
19: 0 1 0 1 
19: 
19: 0 0 0 0 0 0 0 0 0 0 0 
19: 
19: 0 0 0 0 0 0 0 0 0 
19: 
19: blk_size:
19: 3 9 12 1 
19: 
19: 4 2 3 1 9 2 32 10 5 8 7 
19: 
19: 7 3 8 7 9 5 10 23 2 
19: 
19: blk_offset:
19: 0 3 12 24 
19: 
19: 0 4 6 9 10 19 21 53 63 68 76 
19: 
19: 0 7 10 18 25 34 39 49 72 
19: 
19: Testing get_mapping_info for Tensor 1...
19: ndim_nd = 3
19: ndim1_2d = 2
19: ndim2_2d = 1
19: dims_2d_i8: 36 11 
19: 
19: dims_2d: 36 11 
19: 
19: dims_nd: 
19: 3 10 8 
19: 
19: dims1_2d: 
19: 3 8 
19: 
19: dims2_2d: 
19: 10 
19: 
19: map1_2d: 
19: 0 2 
19: 
19: map2_2d: 
19: 1 
19: 
19: map_nd: 
19: 0 2 1 
19: 
19: Base: 1
19: col_major 1
19: 
19: Testing c_dbcsr_t_contract_index...
19: 
19: Result Indices on Rank 0
19: 0 3 1 
19: 1 0 0 
19: 1 3 2 
19: 2 1 0 
19: 2 3 1 
19: 4 1 0 
19: 4 2 1 
19: 4 3 1 
19: 4 3 2 
19: 6 0 0 
19: 6 3 1 
19: 6 3 2 
19: 6 4 2 
19: Result Indices on Rank 1
19: 3 2 1 
19: 5 0 0 
19: 5 2 0 
19: 5 2 1 
19: 7 1 0 
19: 7 1 2 
19: 7 2 0 
19: 7 2 1 
19/19 Test #19: dbcsr_tensor_test .....................................   
Passed    2.55 sec

95% tests passed, 1 tests failed out of 19

Total Test time (real) = 2334.56 sec

The following tests FAILED:
         11 - dbcsr_unittest1 (Timeout)
Errors while running CTest
make[2]: *** [Makefile:74: test] Error 8
make[2]: Leaving directory '/<<PKGBUILDDIR>>/obj-x86_64-linux-gnu'
dh_auto_test: error: cd obj-x86_64-linux-gnu && make -j1 test ARGS\+=--verbose 
ARGS\+=-j1 returned exit code 2
make[1]: *** [debian/rules:21: override_dh_auto_test] Error 25
make[1]: Leaving directory '/<<PKGBUILDDIR>>'
make: *** [debian/rules:12: binary] Error 2
dpkg-buildpackage: error: debian/rules binary subprocess returned exit status 2
--------------------------------------------------------------------------------

The above is just how the build ends and not necessarily the most relevant part.
If required, the full build log is available here:

https://people.debian.org/~sanvila/build-logs/202503/

About the archive rebuild: The build was made on virtual machines from AWS,
using sbuild and a reduced chroot with only build-essential packages.

If you could not reproduce the bug please contact me privately, as I
am willing to provide ssh access to a virtual machine where the bug is
fully reproducible.

If this is really a bug in one of the build-depends, please use
reassign and add an affects on src:dbcsr, so that this is still
visible in the BTS web page for this package.

Thanks.

Reply via email to