raulcd opened a new issue, #46343: URL: https://github.com/apache/arrow/issues/46343
### Describe the bug, including details regarding any error messages, version, and platform. The CI job `AMD64 Conda Python 3.10 Without Pandas` has started failing on `test_gdb.py` with several `UnicodeDecodeError` `invalid continuation byte`. Example of job failures from different PRs and main: - https://github.com/apache/arrow/actions/runs/14878658706/job/41781302772?pr=46261 - https://github.com/apache/arrow/actions/runs/14878500696/job/41780789650 - https://github.com/apache/arrow/actions/runs/14876650269/job/41775137379 The log is quite long so I am adding only one of the test failures but there are several so please check the output of the jobs: ``` ______________________________ test_scalars_heap _______________________________ gdb_arrow = <pyarrow.tests.test_gdb.GdbSession object at 0x7f722c8add10> def test_scalars_heap(gdb_arrow): > check_heap_repr(gdb_arrow, "heap_null_scalar", "arrow::NullScalar") opt/conda/envs/arrow/lib/python3.10/site-packages/pyarrow/tests/test_gdb.py:728: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ opt/conda/envs/arrow/lib/python3.10/site-packages/pyarrow/tests/test_gdb.py:242: in check_heap_repr s = gdb.print_value(f"*{expr}") opt/conda/envs/arrow/lib/python3.10/site-packages/pyarrow/tests/test_gdb.py:143: in print_value out = self.run_command(f"p {expr}") opt/conda/envs/arrow/lib/python3.10/site-packages/pyarrow/tests/test_gdb.py:137: in run_command return self.wait_until_ready() _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ self = <pyarrow.tests.test_gdb.GdbSession object at 0x7f722c8add10> def wait_until_ready(self): """ Record output until the gdb prompt displays. Return recorded output. """ # TODO: add timeout? while (not self.last_stdout_line.startswith(b"(gdb) ") and self.proc.poll() is None): block = self.proc.stdout.read(4096) if self.verbose: sys.stdout.buffer.write(block) sys.stdout.buffer.flush() block, sep, last_line = block.rpartition(b"\n") if sep: self.last_stdout.append(self.last_stdout_line) self.last_stdout.append(block + sep) self.last_stdout_line = last_line else: assert block == b"" self.last_stdout_line += last_line if self.proc.poll() is not None: raise IOError("gdb session terminated unexpectedly") > out = b"".join(self.last_stdout).decode('utf-8') E UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe9 in position 46: invalid continuation byte opt/conda/envs/arrow/lib/python3.10/site-packages/pyarrow/tests/test_gdb.py:122: UnicodeDecodeError ----------------------------- Captured stdout call ----------------------------- p *heap_null_scalar _______________________________ test_array_data ________________________________ ``` ### Component(s) Continuous Integration, Python -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org