jimjam-slam opened a new issue, #45901:
URL: https://github.com/apache/arrow/issues/45901

   ### Describe the enhancement requested
   
   The `arrow` R package presently (and reasonably) appears to reject ragged 
CSV input:
   
   ```r
   library(arrow)
   
   csv_string <- "name,group,score
   North,A,5
   East,A
   West,B,7
   South
   "
   
   read_csv_arrow(I(csv_string))
   # Error:
   # ! Invalid: CSV parse error: Expected 3 columns, got 2: East,A
   # Run `rlang::last_trace()` to see where the error occurred.
   ```
   
   There is a convert option `include_missing_columns`, but this appears to 
refer to columns that are completely absent from the file, not ones that are 
specified I the header but missing in some (or even all) rows. The 
`strings_can_be_null` option also does not apply to ragged rows.
   
   It would be great if users could supply an option to fill ragged rows with a 
type-relevant missing value (in R I assume that would either be an `NA_*_` for 
atomic types or `NULL` for non-atomic ones, but perhaps a user-supplied value 
would be more appropriate).
   
   R: 4.4.1
   Arrow: 19.0.1
   OS: macOS 15.3.2
   
   ### Component(s)
   
   R


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to