Re: [PR] Add IPC fuzz regression file [arrow-testing]

2024-08-29 Thread via GitHub
mapleFU commented on PR #101: URL: https://github.com/apache/arrow-testing/pull/101#issuecomment-2318233262 > By the way, do you want to have access to OSS-Fuzz? Aha, I would like to -- This is an automated message from the Apache Git Service. To respond to the message, please log o

Re: [PR] Add IPC fuzz regression file [arrow-testing]

2024-08-29 Thread via GitHub
pitrou commented on PR #101: URL: https://github.com/apache/arrow-testing/pull/101#issuecomment-2318256265 > > By the way, do you want to have access to OSS-Fuzz? > > Aha, I would like to Ok, you need a Google or GMail address that we can add to https://github.com/google/oss-fu

Re: [PR] Add IPC fuzz regression file [arrow-testing]

2024-08-29 Thread via GitHub
mapleFU commented on PR #101: URL: https://github.com/apache/arrow-testing/pull/101#issuecomment-2318298735 Added here: https://github.com/google/oss-fuzz/pull/12425 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

[I] Add CODEOWNERS [arrow-go]

2024-08-29 Thread via GitHub
assignUser opened a new issue, #26: URL: https://github.com/apache/arrow-go/issues/26 Unless you just going to watch all of the repo @zeroshade ? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to t

[I] Update .asf.yaml [arrow-go]

2024-08-29 Thread via GitHub
assignUser opened a new issue, #27: URL: https://github.com/apache/arrow-go/issues/27 Enable branch protection and other gh features as desired (e.g. enable only squash+merge). Documentation is here: https://cwiki.apache.org/confluence/display/INFRA/Git+-+.asf.yaml+features -- This is a

Re: [I] Update .asf.yaml [arrow-go]

2024-08-29 Thread via GitHub
assignUser closed issue #27: Update .asf.yaml URL: https://github.com/apache/arrow-go/issues/27 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-u

Re: [I] [CI][C++] Potential integer overflow in PoolBuffer [arrow]

2024-08-29 Thread via GitHub
pitrou closed issue #43885: [CI][C++] Potential integer overflow in PoolBuffer URL: https://github.com/apache/arrow/issues/43885 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [I] [Python] Create Python examples of HTTP GET Arrow client/server supporting multipart/mixed response [arrow]

2024-08-29 Thread via GitHub
ianmcook closed issue #40598: [Python] Create Python examples of HTTP GET Arrow client/server supporting multipart/mixed response URL: https://github.com/apache/arrow/issues/40598 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub a

[PR] Add .asf.yaml [arrow-testing]

2024-08-29 Thread via GitHub
kou opened a new pull request, #102: URL: https://github.com/apache/arrow-testing/pull/102 This is based on https://github.com/apache/arrow-go/blob/main/.asf.yaml . I want to use github@ not issues@ for PR related notifications like other our repositories do. For example,

Re: [I] adbc_ingest for snowflake dropping rows when called repeatedly [arrow-adbc]

2024-08-29 Thread via GitHub
lidavidm closed issue #2094: adbc_ingest for snowflake dropping rows when called repeatedly URL: https://github.com/apache/arrow-adbc/issues/2094 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [I] adbc_ingest for snowflake dropping rows when called repeatedly [arrow-adbc]

2024-08-29 Thread via GitHub
lidavidm closed issue #2094: adbc_ingest for snowflake dropping rows when called repeatedly URL: https://github.com/apache/arrow-adbc/issues/2094 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[I] [Go] Fatal Error in pqarrow.writeDenseArrow : invalid pointer [arrow-go]

2024-08-29 Thread via GitHub
Niivii opened a new issue, #28: URL: https://github.com/apache/arrow-go/issues/28 Got a runtine error for this code in Python with polars and adbc engine to Snowflake: ```python with adbc_driver_snowflake.dbapi.connect(db_kwargs={ "username": user, "adbc.snowflake.sql.acc

[I] [Go] Schema inference on `RecordFromJSON` and `TableFromJSON` functions [arrow-go]

2024-08-29 Thread via GitHub
agchang opened a new issue, #30: URL: https://github.com/apache/arrow-go/issues/30 ### Describe the enhancement requested I am interested in support for schema inference in the `RecordFromJSON` and `TableFromJSON` functions, as these currently require an `arrow.Schema` up front. I ca

[I] [Go] Improve 32-bit support (`GOARCH=386`) [arrow-go]

2024-08-29 Thread via GitHub
joellubi opened a new issue, #32: URL: https://github.com/apache/arrow-go/issues/32 ### Describe the enhancement requested There are several parts of the code that currently fail on 32-bit systems like 386. 1. One problem is that the `refCount int64` we currently use for refer

[I] [Go][Parquet] Does this implementation support page indexes? [arrow-go]

2024-08-29 Thread via GitHub
jhump opened a new issue, #33: URL: https://github.com/apache/arrow-go/issues/33 I would like to create page indexes (per [this doc](https://github.com/apache/parquet-format/blob/master/PageIndex.md)) for a column when writing a parquet file, and then use that index to seek to a particular

[I] [Go] FlightSQL Stateless Prepared Statements handling of `Any` protobuf messages [arrow-go]

2024-08-29 Thread via GitHub
matthewmturner opened a new issue, #34: URL: https://github.com/apache/arrow-go/issues/34 ### Describe the bug, including details regarding any error messages, version, and platform. We just upgraded our Go FlightSQL client to point to master so that we can use the recently added sta

[I] [Go] Allow adding existing arrays into structs [arrow-go]

2024-08-29 Thread via GitHub
sfc-gh-pfus opened a new issue, #35: URL: https://github.com/apache/arrow-go/issues/35 ### Describe the enhancement requested Hi! We have a use case, in which we have existing Arrow arrays and we want to compose them to struct. Unfortunately, currently `StructBuilder` does not suppor

[I] [Go] BinaryBuilder AppendValues/AppendStringValues don't invoke ReserveData internally [arrow-go]

2024-08-29 Thread via GitHub
drahnreb1 opened a new issue, #36: URL: https://github.com/apache/arrow-go/issues/36 ### Describe the usage question you have. Please include as many useful details as possible. AppendValues and AppendStringValues of BinaryBuilder invoke `Reserve` internally, but don't invoke `R

[I] [Go] Need help on reading parquet from S3 [arrow-go]

2024-08-29 Thread via GitHub
Zeeyi13 opened a new issue, #37: URL: https://github.com/apache/arrow-go/issues/37 Hi team, I would like to read a parquet file from S3 with high performance. Is there any hit or an example for me to start with? I have some ideas , but not sure which one is recommended or any better

[I] [Go][Parquet] Looking for Memory-friendly way to seek & extract data from parquet columns [arrow-go]

2024-08-29 Thread via GitHub
jo-me opened a new issue, #38: URL: https://github.com/apache/arrow-go/issues/38 ### Describe the usage question you have. Please include as many useful details as possible. Hi, beeing new to Apache Arrow I'm a little confused about the different options to interact with

[I] [Go] How to use prepared statements with timestamps [arrow-go]

2024-08-29 Thread via GitHub
si-fernandes opened a new issue, #39: URL: https://github.com/apache/arrow-go/issues/39 ### Describe the usage question you have. Please include as many useful details as possible. Hello, I'm trying to implement a prepared statement that has a timestamp parameter. I hav

[I] [Go] Random segmentation faults when calling Read() on a pqarrow.RecordReader [arrow-go]

2024-08-29 Thread via GitHub
reiades opened a new issue, #29: URL: https://github.com/apache/arrow-go/issues/29 Hello! I am currently using `github.com/apache/arrow/go/v16/parquet` to read the records of a downloaded s3 parquet file (75KB, stored in `bytes.Buffer`). My implementation is the following:

[I] [Go] reflect.SliceHeader is deprecated as of go-1.20, unsafe.SliceData is recommended instead. [arrow-go]

2024-08-29 Thread via GitHub
dr2chase opened a new issue, #40: URL: https://github.com/apache/arrow-go/issues/40 ### Describe the bug, including details regarding any error messages, version, and platform. `bitutils.go` and `type_traits_*.go` cast a slice address to a `*reflect.SliceHeader` to to extract the poi

[I] [Go] Estimate physical size of data accumulated in array.Builder [arrow-go]

2024-08-29 Thread via GitHub
vitalyisaev2 opened a new issue, #42: URL: https://github.com/apache/arrow-go/issues/42 ### Describe the usage question you have. Please include as many useful details as possible. I'm working on a streaming service that accumulates blocks of Arrow data, dumps them into Arrow St

[I] [Go] Replace buffer builders with utilities developed for compute kernels [arrow-go]

2024-08-29 Thread via GitHub
felipecrv opened a new issue, #43: URL: https://github.com/apache/arrow-go/issues/43 ### Describe the enhancement requested Replace `bufBuilder` and `bufferBuilder` with the utilities (by moving them) used by compute kernels [1]. The existing builders have confusing APIs and do

[I] [Go][Parquet] Writing a Parquet file from a slice of structs [arrow-go]

2024-08-29 Thread via GitHub
tschaub opened a new issue, #44: URL: https://github.com/apache/arrow-go/issues/44 ### Describe the usage question you have. Please include as many useful details as possible. I'm hoping to get suggestions on the best way to use the library to write a Parquet file given a slice

[I] [Go] array.RecordFromJSON() method does not return for a significant amount of time for large JSON file [arrow-go]

2024-08-29 Thread via GitHub
AlwaysZhi opened a new issue, #45: URL: https://github.com/apache/arrow-go/issues/45 ### Describe the usage question you have. Please include as many useful details as possible. When I attempt to use the array.RecordFromJSON() method to read a relatively large JSON file (approxi

[I] [Go] Allow prepending dictionary [arrow-go]

2024-08-29 Thread via GitHub
brancz opened a new issue, #47: URL: https://github.com/apache/arrow-go/issues/47 ### Describe the enhancement requested The dictionary builders already have methods to insert whole arrays, but unfortunately they cause a lot of potentially unnecessary CPU time. Take the followi

[I] [Go] Dictionary compaction [arrow-go]

2024-08-29 Thread via GitHub
brancz opened a new issue, #49: URL: https://github.com/apache/arrow-go/issues/49 ### Describe the enhancement requested I don't know if other languages offer this, but I assume this is a common thing needed. My scenario: I have dictionary arrays that traverse a query plan and

[I] [Go][Arrow] Remove `panic` from `arrow.Schema` and `arrow.Metadata` construction [arrow-go]

2024-08-29 Thread via GitHub
ggodik opened a new issue, #46: URL: https://github.com/apache/arrow-go/issues/46 ### Describe the enhancement requested Prefer to return an `error` instead of `panic` as this situation is due to a bad user input. Change the signature to return a `Schema`/`Metadata` and an `error`

[I] [Go] Missing example how to read nested data [arrow-go]

2024-08-29 Thread via GitHub
AndreasBergmeier6176 opened a new issue, #51: URL: https://github.com/apache/arrow-go/issues/51 ### Describe the usage question you have. Please include as many useful details as possible. From the API using ColumnChunkReaders it is absolutely incomprehensible to me how one is s

[I] [Go] How to approach to implement Parquet-Go file format [arrow-go]

2024-08-29 Thread via GitHub
hkpeaks opened a new issue, #54: URL: https://github.com/apache/arrow-go/issues/54 ### Describe the usage question you have. Please include as many useful details as possible. Based on current testing my Golang dataframe project which is achieving outstanding performance when da

[I] [Go]: ApproxEqual for strings [arrow-go]

2024-08-29 Thread via GitHub
yevgenypats opened a new issue, #56: URL: https://github.com/apache/arrow-go/issues/56 ### Describe the enhancement requested In lot's of databases null characters (`\x00`) are not accepted in strings (For example [postgres](https://stackoverflow.com/questions/28813409/are-null-bytes

[I] [Go][FlightSQL] Add support for "grpc+tcp", "grpc+unix" and "grpc+tls" schemes in driver [arrow-go]

2024-08-29 Thread via GitHub
zeroshade opened a new issue, #57: URL: https://github.com/apache/arrow-go/issues/57 ### Describe the enhancement requested Currently the scheme expected for the `database/sql` driver introduced with apache/arrow#34331 only accepts a URI scheme of `flightsql://`. In order to maintain

[I] [Go]: Add IndexField and InsertField for Schema [arrow-go]

2024-08-29 Thread via GitHub
yevgenypats opened a new issue, #60: URL: https://github.com/apache/arrow-go/issues/60 ### Describe the enhancement requested Similar to the cpp and python library - https://arrow.apache.org/docs/python/generated/pyarrow.Schema.html#pyarrow.Schema `IndexField` and InsertField c

[I] [Go] Support for casting to extension types [arrow-go]

2024-08-29 Thread via GitHub
hermanschaaf opened a new issue, #61: URL: https://github.com/apache/arrow-go/issues/61 ### Describe the enhancement requested Right now it seems like there is support for casting from extension types to their underlying storage types, but not vice versa. The `cast_test.go` te

[I] [Go] Implement Usage of Golang arenas (go1.20+) [arrow-go]

2024-08-29 Thread via GitHub
ivanmemruk opened a new issue, #62: URL: https://github.com/apache/arrow-go/issues/62 ### Describe the enhancement requested Golang v1.20 has added an experimental `arenas` package, which allows more "manual" memory allocation and could be potentially useful for the golang arrow libr

[I] [Go] Date32 and Date64 String() methods return numbers instead of dates [arrow-go]

2024-08-29 Thread via GitHub
hermanschaaf opened a new issue, #63: URL: https://github.com/apache/arrow-go/issues/63 ### Describe the bug, including details regarding any error messages, version, and platform. The String methods on the Go `array.Date32` and `array.Date64` types return strings formatted as the un

[I] [Go] Improved building of structs into arrow record [arrow-go]

2024-08-29 Thread via GitHub
gmintoco opened a new issue, #64: URL: https://github.com/apache/arrow-go/issues/64 ### Describe the enhancement requested Hi, I recently made a post on the mailing list but I thought this might make more sense as a location to communicate. I am using Arrow for Go mostly to rea

[I] [Go][Builder] Add AppendFromText method [arrow-go]

2024-08-29 Thread via GitHub
yevgenypats opened a new issue, #65: URL: https://github.com/apache/arrow-go/issues/65 ### Describe the enhancement requested AppendFromText can be useful in CSV parsing and various other scenarios. Full discussion in this [PR](https://github.com/apache/arrow/pull/34585#discussion_r1

[I] [Go][Table] Implement sort function [arrow-go]

2024-08-29 Thread via GitHub
yevgenypats opened a new issue, #66: URL: https://github.com/apache/arrow-go/issues/66 ### Describe the enhancement requested Seems the python library has a sort functionality for table - https://arrow.apache.org/cookbook/py/data.html#sort-a-table Will it make sense to implemen

[I] [Go][Parquet] reader panics when selectively decrypting columns [arrow-go]

2024-08-29 Thread via GitHub
ahmb84 opened a new issue, #69: URL: https://github.com/apache/arrow-go/issues/69 ### Describe the bug, including details regarding any error messages, version, and platform. Hello, following up on issue apache/arrow#14940, I am leveraging the go arrow library to encrypt/decrypt parq

[I] [Go][Parquet] PlainFixedLenByteArrayEncoder behaves differently from DictFixedLenByteArrayEncoder with null values where schema has Nullable: false [arrow-go]

2024-08-29 Thread via GitHub
asfimport opened a new issue, #71: URL: https://github.com/apache/arrow-go/issues/71 I have created a small repro to illustrate this bug: https://gist.github.com/phillipleblanc/5e3e2d0e6914d276cf9fd79e019581de When writing a Decimal128 array to a Parquet file the pqarrow package will

[I] [Go] Support external schema in ipc readers [arrow-go]

2024-08-29 Thread via GitHub
asfimport opened a new issue, #72: URL: https://github.com/apache/arrow-go/issues/72 (Apologies if I'm imprecise here, I'm still coming up to speed on the arrow details.)   The IPC message format describes how data and metadata messages are encapsulated, but it is not a requir

[I] [Go] Optimize memory usage for CSV writer [arrow-go]

2024-08-29 Thread via GitHub
asfimport opened a new issue, #78: URL: https://github.com/apache/arrow-go/issues/78 perhaps not for this PR, but, depending on the number of rows and cols this record contains, this may be a very large allocation, and very big memory chunk. it could be more interesting performance wise t

[I] [Go] tensor doesn't support access to Null bitmap data? [arrow-go]

2024-08-29 Thread via GitHub
asfimport opened a new issue, #79: URL: https://github.com/apache/arrow-go/issues/79 It seems that the tensor interface does not provide access to the Null bitmap data in the underlying array. The root cause appears to be that Data does not have the null bitmap data and associated IsNull /

[I] [Go] Dependency pmezard/go-difflib is unmaintained [arrow-go]

2024-08-29 Thread via GitHub
asfimport opened a new issue, #75: URL: https://github.com/apache/arrow-go/issues/75 I saw this error (transient?) in CI ```Java Parsing meta tags from https://golang.org/x/xerrors?go-get=1 (status code 200) get "golang.org/x/xerrors": found meta tag get.metaImport{Prefix:"

[I] [Go] Panic when importing different versions of Arrow Flight in the same project (v12, v13) [arrow-go]

2024-08-29 Thread via GitHub
phillipleblanc opened a new issue, #48: URL: https://github.com/apache/arrow-go/issues/48 ### Describe the bug, including details regarding any error messages, version, and platform. Importing both `github.com/apache/arrow/go/v12/arrow/flight` and `github.com/apache/arrow/go/v13/arro

[I] [Go] the output is not passed to emitResult in scalarExecutor.executeSpans which may cause the function execute result is wrong [arrow-go]

2024-08-29 Thread via GitHub
bigwhite opened a new issue, #52: URL: https://github.com/apache/arrow-go/issues/52 ### Describe the bug, including details regarding any error messages, version, and platform. When I read the code in executor.go, I found a maybe problem: ![image](https://github.com/apache/arr

[I] [IPC/Go] Flatbuffers package declared as internal - Useful with custom MessageReaders [arrow-go]

2024-08-29 Thread via GitHub
peetie2k opened a new issue, #53: URL: https://github.com/apache/arrow-go/issues/53 ### Describe the enhancement requested Hi, While building a custom MessageReader I realized that the flatbuffers package in go is declared internal. Yet, this is of importance when reading the

[I] [Go] ipc writer is not concurrency safe [arrow-go]

2024-08-29 Thread via GitHub
sunpe opened a new issue, #55: URL: https://github.com/apache/arrow-go/issues/55 ### Describe the bug, including details regarding any error messages, version, and platform. Ipc writer is not concurrency safe. When using ipc writer under concurrent conditions, schema data may be writ

[I] [Go][CI]: Run `gofmt` check in CI [arrow-go]

2024-08-29 Thread via GitHub
yevgenypats opened a new issue, #58: URL: https://github.com/apache/arrow-go/issues/58 ### Describe the enhancement requested This would help save contributors and reviewers time so they can focus on code logic and bugs and offload formatting issues to the CI. ### Component(s)

[I] [Go] enhancement request to expose AnyValue() on Scalar [arrow-go]

2024-08-29 Thread via GitHub
chrisirhc opened a new issue, #67: URL: https://github.com/apache/arrow-go/issues/67 ### Describe the enhancement requested I wanted to gauge interest in a method on the Scalar interface to expose the value via any/interface{} like: ```go type Scalar interface { …

[I] [Go] array.String.Value is unsafe [arrow-go]

2024-08-29 Thread via GitHub
lidavidm opened a new issue, #68: URL: https://github.com/apache/arrow-go/issues/68 ### Describe the bug, including details regarding any error messages, version, and platform. String.Value slices a backing string: ```go func (a *String) Value(i int) string { i = i +

[I] [Go] C Data Interface implementation violates cgo rules by default [arrow-go]

2024-08-29 Thread via GitHub
lidavidm opened a new issue, #70: URL: https://github.com/apache/arrow-go/issues/70 ### Describe the bug, including details regarding any error messages, version, and platform. [cgo requires that C memory cannot contain persistent pointers to Go memory](https://pkg.go.dev/cmd/cgo):

[I] [Go] Use Feature enum [arrow-go]

2024-08-29 Thread via GitHub
asfimport opened a new issue, #73: URL: https://github.com/apache/arrow-go/issues/73 **Reporter**: [Micah Kornfield](https://issues.apache.org/jira/browse/ARROW-9314) / @emkornfield **Note**: *This issue was originally created as [ARROW-9314](https://issues.apache.org/jira/br

[I] [Go][Packaging] Revisit the license file attachment to the go packages [arrow-go]

2024-08-29 Thread via GitHub
asfimport opened a new issue, #74: URL: https://github.com/apache/arrow-go/issues/74 As per https://github.com/apache/arrow/pull/7355#issuecomment-639560475 A nicer solution would be to rename the top level LICENSE.txt to LICENSE, so we wouldn't need to maintain another copy of it.

[I] [Go] ipc.Writer Option to skip appending data buffers [arrow-go]

2024-08-29 Thread via GitHub
asfimport opened a new issue, #76: URL: https://github.com/apache/arrow-go/issues/76 For cases where we have a known shared memory region, it would be great if the ipc.Writer (and by extension ipc.Reader?) had the ability to write out everything but the actual buffers holding the data. That

[I] [Go] Test fuzzer inputs [arrow-go]

2024-08-29 Thread via GitHub
asfimport opened a new issue, #77: URL: https://github.com/apache/arrow-go/issues/77 We are developing a fuzzer-based corpus of malformed IPC inputs https://github.com/apache/arrow-testing/tree/master/data/arrow-ipc-stream The Go implementation should also test against these to

Re: [I] [Java][CI] Flight related failure in the AMD64 Windows Server 2022 Java JDK 11 CI [arrow]

2024-08-29 Thread via GitHub
lidavidm closed issue #43869: [Java][CI] Flight related failure in the AMD64 Windows Server 2022 Java JDK 11 CI URL: https://github.com/apache/arrow/issues/43869 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL ab

Re: [I] Import commits from apache/arrow [arrow-go]

2024-08-29 Thread via GitHub
kou closed issue #1: Import commits from apache/arrow URL: https://github.com/apache/arrow-go/issues/1 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: i

Re: [I] Transfer Go related open issues from apache/arrow [arrow-go]

2024-08-29 Thread via GitHub
kou closed issue #25: Transfer Go related open issues from apache/arrow URL: https://github.com/apache/arrow-go/issues/25 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To uns

[I] Enable delete branch on merge [arrow-go]

2024-08-29 Thread via GitHub
kou opened a new issue, #81: URL: https://github.com/apache/arrow-go/issues/81 https://cwiki.apache.org/confluence/pages/viewpage.action?spaceKey=INFRA&title=Git+-+.asf.yaml+features#Git.asf.yamlfeatures-Deletebranchonmerge > Add this snippet below so branches get auto-deleted upon PR

[I] Add a package for bigquery plugin [arrow-adbc]

2024-08-30 Thread via GitHub
josevalim opened a new issue, #2110: URL: https://github.com/apache/arrow-adbc/issues/2110 ### What feature or improvement would you like to see? BigQuery support was recently added but there is no Python package for it. :) Thank you! -- This is an automated message from the Apache

[I] PyArrow: Parquet files with fp16 columns should be faster to read than fp32 [arrow]

2024-08-30 Thread via GitHub
marcin-krystianc opened a new issue, #43891: URL: https://github.com/apache/arrow/issues/43891 ### Describe the enhancement requested We want to start using fp16 data for our Ml workflows. We hoped for disk space savings, reduced RAM consumption, and doubled reading performance. Parq

Re: [I] [Integration][Go] IPC files do not embed a valid IPC stream [arrow]

2024-08-30 Thread via GitHub
joellubi closed issue #43837: [Integration][Go] IPC files do not embed a valid IPC stream URL: https://github.com/apache/arrow/issues/43837 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specif

Re: [I] [Integration][Go] IPC files do not embed a valid IPC stream [arrow]

2024-08-30 Thread via GitHub
joellubi closed issue #43837: [Integration][Go] IPC files do not embed a valid IPC stream URL: https://github.com/apache/arrow/issues/43837 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specif

[I] BUG: Process hangs Indefinitely on UnicodeDecodeError When use_threads=True in pyarrow.csv.read_csv [arrow]

2024-08-30 Thread via GitHub
dxdc opened a new issue, #43892: URL: https://github.com/apache/arrow/issues/43892 ### Describe the bug, including details regarding any error messages, version, and platform. ## Summary When using `pyarrow.csv.read_csv` with `ReadOptions(use_threads=True)` and encountering a `Uni

Re: [I] [Python] Hangs on pyarrow.csv.read_csv with use_threads=True and UnicodeDecodeError [arrow]

2024-08-30 Thread via GitHub
dxdc closed issue #43741: [Python] Hangs on pyarrow.csv.read_csv with use_threads=True and UnicodeDecodeError URL: https://github.com/apache/arrow/issues/43741 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[I] Getting a 'pyarrow.lib.ArrowTypeError' error following a schema merge on delta table with nested columns [arrow]

2024-08-30 Thread via GitHub
liamphmurphy opened a new issue, #43893: URL: https://github.com/apache/arrow/issues/43893 ### Describe the bug, including details regarding any error messages, version, and platform. Following a schema merge operation involving nested columns, PyArrow seems to struggle with loading

Re: [I] python/adbc_driver_manager: test_lowlevel.py::test_pycapsule failed with PyArrow >= 17 [arrow-adbc]

2024-09-01 Thread via GitHub
lidavidm closed issue #2111: python/adbc_driver_manager: test_lowlevel.py::test_pycapsule failed with PyArrow >= 17 URL: https://github.com/apache/arrow-adbc/issues/2111 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

Re: [I] R/C++ bindings documentation missing (Error 404) [arrow]

2024-09-01 Thread via GitHub
nealrichardson closed issue #43665: R/C++ bindings documentation missing (Error 404) URL: https://github.com/apache/arrow/issues/43665 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific co

[I] [R] format_aggregation() should print options too [arrow]

2024-08-31 Thread via GitHub
nealrichardson opened a new issue, #43894: URL: https://github.com/apache/arrow/issues/43894 ### Describe the enhancement requested We rely on C++ ToString methods for printing Arrow Expressions, which include functions, arguments, and options. But when we print `arrow_dplyr_query` o

[I] pytyon/adbc_driver_manager: test_lowlevel.py::test_pycapsule failed with PyArrow >= 17 [arrow-adbc]

2024-08-31 Thread via GitHub
kou opened a new issue, #2111: URL: https://github.com/apache/arrow-adbc/issues/2111 ### What feature or improvement would you like to see? ```text ../venv-source/lib/python3.12/site-packages/_pytest/python_api.py:1019: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

Re: [I] python/adbc_driver_manager: test_lowlevel.py::test_pycapsule failed with PyArrow >= 17 [arrow-adbc]

2024-09-01 Thread via GitHub
lidavidm closed issue #2111: python/adbc_driver_manager: test_lowlevel.py::test_pycapsule failed with PyArrow >= 17 URL: https://github.com/apache/arrow-adbc/issues/2111 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

[I] [Java] Support for Long memory addresses [arrow]

2024-09-02 Thread via GitHub
vibhatha opened a new issue, #43902: URL: https://github.com/apache/arrow/issues/43902 ### Describe the enhancement requested In the memory-core module, still, there are the usage of `Integer` instead of `Long` in case of representing memory addresses, index, and sizes. This must be

[I] [Python][Packaging] Stop uploading nightly wheels to gemfury [arrow]

2024-09-02 Thread via GitHub
raulcd opened a new issue, #43904: URL: https://github.com/apache/arrow/issues/43904 ### Describe the enhancement requested Once we start uploading nightly wheels to the scientific python channel on anaconda: - https://github.com/apache/arrow/issues/40216 we can drop uploadi

[I] [CI][Python] python_test_emscripten fails with missing PARQUET_TEST_DATA environment variable [arrow]

2024-09-02 Thread via GitHub
raulcd opened a new issue, #43905: URL: https://github.com/apache/arrow/issues/43905 ### Describe the bug, including details regarding any error messages, version, and platform. The [test-conda-python-emscripten](https://github.com/ursacomputing/crossbow/actions/runs/10658679057/job/

[I] [C#][Flight] Add Grpc Call Options support on Flight Client [arrow]

2024-09-02 Thread via GitHub
qmmk opened a new issue, #43907: URL: https://github.com/apache/arrow/issues/43907 ### Describe the enhancement requested Describe the enhancement requested Add parameter for Grpc.CallOptions including the following: - Grpc.Core.Metadata headers - System.DateTime dealine -

[I] BigQuery Reads Null Values from Parquet Files Generated with pyarrow Versions > 12.0.1 [arrow]

2024-09-02 Thread via GitHub
matteosdocsity opened a new issue, #43908: URL: https://github.com/apache/arrow/issues/43908 ### Describe the bug, including details regarding any error messages, version, and platform. When using `pyarrow` versions greater than `12.0.1` to write Parquet files that are then loaded in

Re: [I] [R] format_aggregation() should print options too [arrow]

2024-09-02 Thread via GitHub
thisisnic closed issue #43894: [R] format_aggregation() should print options too URL: https://github.com/apache/arrow/issues/43894 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

[I] [C++] Compute: RowEncoder supports a unsafe_encoded_row to get the std::string_view for row [arrow]

2024-09-02 Thread via GitHub
mapleFU opened a new issue, #43909: URL: https://github.com/apache/arrow/issues/43909 ### Describe the enhancement requested Currently, a RowTable always get row by `encoded_row`, which returns an `std::string` We could supports an unsafe_encoded_row, like below: ```c++

Re: [I] [Python] Make NumPy an optional runtime dependency [arrow]

2024-09-02 Thread via GitHub
pitrou closed issue #25118: [Python] Make NumPy an optional runtime dependency URL: https://github.com/apache/arrow/issues/25118 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [I] [C++] Compute: Adding more comments for RowEncoder [arrow]

2024-09-02 Thread via GitHub
mapleFU closed issue #43758: [C++] Compute: Adding more comments for RowEncoder URL: https://github.com/apache/arrow/issues/43758 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [I] [C++] Compute: Why some ScalarAggregator handles scalar ExecSpan like single value, other like groups value? [arrow]

2024-09-02 Thread via GitHub
mapleFU closed issue #43768: [C++] Compute: Why some ScalarAggregator handles scalar ExecSpan like single value, other like groups value? URL: https://github.com/apache/arrow/issues/43768 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

[I] [C++] Compute: KeyEncoder support List Type [arrow]

2024-09-02 Thread via GitHub
mapleFU opened a new issue, #43911: URL: https://github.com/apache/arrow/issues/43911 ### Describe the enhancement requested This is part of https://github.com/apache/arrow/issues/43716 . We need first support KeyEncoder in ListType ### Component(s) C++ -- This

Re: [I] [Python] Uploading nightly wheels to the scientific python nightly channel [arrow]

2024-09-02 Thread via GitHub
assignUser closed issue #40216: [Python] Uploading nightly wheels to the scientific python nightly channel URL: https://github.com/apache/arrow/issues/40216 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[I] [Python]: OSError: Write out of bounds (offset = 13784, size = 1496) in file of size 14638 [arrow]

2024-09-02 Thread via GitHub
u3Izx9ql7vW4 opened a new issue, #43929: URL: https://github.com/apache/arrow/issues/43929 ### Describe the bug, including details regarding any error messages, version, and platform. Hello, I'm trying to save a table to a memory mapped file, but I'm getting an error: ``

Re: [I] [CI] Remove Python version guard when installing GCS testbench [arrow]

2024-09-02 Thread via GitHub
pitrou closed issue #43883: [CI] Remove Python version guard when installing GCS testbench URL: https://github.com/apache/arrow/issues/43883 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the speci

[I] [C++] Make ChunkResolver::ResolveMany output a list of ChunkLocations [arrow]

2024-09-02 Thread via GitHub
felipecrv opened a new issue, #43927: URL: https://github.com/apache/arrow/issues/43927 ### Describe the enhancement requested Instead of two indexes buffers. This helps with cache locality and performance of kernels using it (so far, only the non-merged `"array_take"` implementation

Re: [I] [C++] Add support for Boost 1.86 [arrow]

2024-09-02 Thread via GitHub
kou closed issue #43746: [C++] Add support for Boost 1.86 URL: https://github.com/apache/arrow/issues/43746 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-ma

Re: [I] [C++] Attach `arrow::ArrayStatistics` to `arrow::ArrayData` [arrow]

2024-09-02 Thread via GitHub
kou closed issue #43797: [C++] Attach `arrow::ArrayStatistics` to `arrow::ArrayData` URL: https://github.com/apache/arrow/issues/43797 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific co

Re: [I] python: add CI build with debug python [arrow-adbc]

2024-09-02 Thread via GitHub
lidavidm closed issue #2077: python: add CI build with debug python URL: https://github.com/apache/arrow-adbc/issues/2077 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To uns

Re: [I] python: add CI build with debug python [arrow-adbc]

2024-09-02 Thread via GitHub
lidavidm closed issue #2077: python: add CI build with debug python URL: https://github.com/apache/arrow-adbc/issues/2077 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To uns

[I] [C#] Use "asf" organization in NuGet [arrow]

2024-09-02 Thread via GitHub
kou opened a new issue, #43930: URL: https://github.com/apache/arrow/issues/43930 ### Describe the enhancement requested https://www.nuget.org/profiles/asf We don't use the organization feature in NuGet now. Some PMC members are owner of https://www.nuget.org/packages/Apache.Ar

Re: [I] [C++] Attach `arrow::ArrayStatistics` to `arrow::Array` [arrow]

2024-09-02 Thread via GitHub
kou closed issue #43666: [C++] Attach `arrow::ArrayStatistics` to `arrow::Array` URL: https://github.com/apache/arrow/issues/43666 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

Re: [I] [CI][Python] python_test_emscripten fails with missing PARQUET_TEST_DATA environment variable [arrow]

2024-09-03 Thread via GitHub
raulcd closed issue #43905: [CI][Python] python_test_emscripten fails with missing PARQUET_TEST_DATA environment variable URL: https://github.com/apache/arrow/issues/43905 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use t

[I] Docker warnings about docker-compose version and PYTHON_IMAGE_TAG default value [arrow]

2024-09-03 Thread via GitHub
lysnikolaou opened a new issue, #43933: URL: https://github.com/apache/arrow/issues/43933 ### Describe the bug, including details regarding any error messages, version, and platform. Running `docker-compose` locally with any of the services in the `docker-compose.yml` file results in

[I] Add Flight JDBC Connection String example [arrow-flight-sql-postgresql]

2024-09-03 Thread via GitHub
edmondop opened a new issue, #190: URL: https://github.com/apache/arrow-flight-sql-postgresql/issues/190 I have tried to connect using the [JDBC driver](https://arrow.apache.org/docs/java/flight_sql_jdbc_driver.html) added to my Intellij IDEA, with a different combinations of parameters. Th

Re: [I] [C#][Flight] Add Grpc Call Options support on Flight Client [arrow]

2024-09-03 Thread via GitHub
CurtHagenlocher closed issue #43907: [C#][Flight] Add Grpc Call Options support on Flight Client URL: https://github.com/apache/arrow/issues/43907 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

<    1   2   3   4   5   6   7   8   9   10   >