ForceBru opened a new issue, #540:
URL: https://github.com/apache/arrow-julia/issues/540

   Python code that writes the file:
   ```python
   #!/usr/bin/env -S uv run --script
   # /// script
   # requires-python = ">=3.11"
   # dependencies = ["polars"]
   # ///
   import polars as pl
   
   pl.DataFrame({'text': "this is some text".split()}).write_ipc("data.arrow")
   ```
   
   Polars can read this file:
   ```
   >>> import polars as pl
   >>> pl.read_ipc("data.arrow")
   shape: (4, 1)
   ┌──────┐
   │ text │
   │ ---  │
   │ str  │
   ╞══════╡
   │ this │
   │ is   │
   │ some │
   │ text │
   └──────┘
   >>>
   ```
   
   Arrow.jl reads garbage:
   ```julia
   julia> import Pkg; Pkg.status()
   Status `~/tmp/Project.toml`
     [69666777] Arrow v2.8.0
     [a93c6f00] DataFrames v1.7.0
   
   julia> using DataFrames; import Arrow
   
   julia> DataFrame(Arrow.Table("./data.arrow"))
   4×1 DataFrame
    Row │ text     
        │ String?  
   ─────┼──────────
      1 │ W1\0\0
      2 │ \xf2\xff
      3 │ \v\0\b\0
      4 │ \b\0\b\0
   
   julia> 
   ```
   
   __Issue__: this is not at all what Polars wrote to the file


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to