Re: [I] JWT token is invalid with Snowflake Go Driver - Python [arrow-adbc]

2024-11-26 Thread via GitHub
davlee1972 closed issue #1422: JWT token is invalid with Snowflake Go Driver - Python URL: https://github.com/apache/arrow-adbc/issues/1422 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specif

[I] R package fails to build against arrow 18.1.0: `error: 'ChunkLocation' in namespace 'arrow::internal' does not name a type` [arrow]

2024-11-26 Thread via GitHub
barracuda156 opened a new issue, #44863: URL: https://github.com/apache/arrow/issues/44863 ### Describe the bug, including details regarding any error messages, version, and platform. Worked fine with 18.0.0, now fails: ``` ---> Configuring R-arrow Executing: cd "/opt/loca

Re: [I] [JS][Release] Wrong -.tar.gz name on arrow2csv fails when publishing [arrow]

2024-11-26 Thread via GitHub
amoeba closed issue #44585: [JS][Release] Wrong -.tar.gz name on arrow2csv fails when publishing URL: https://github.com/apache/arrow/issues/44585 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [I] Reading multiple file corrupt values and is also order dependent [arrow-julia]

2024-11-26 Thread via GitHub
quinnj closed issue #534: Reading multiple file corrupt values and is also order dependent URL: https://github.com/apache/arrow-julia/issues/534 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the s

Re: [I] Reading multiple file corrupt values and is also order dependent [arrow-julia]

2024-11-26 Thread via GitHub
quinnj closed issue #534: Reading multiple file corrupt values and is also order dependent URL: https://github.com/apache/arrow-julia/issues/534 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the s

[I] [Feature Request] Support lazy load/zero copy when using shared memory [arrow-go]

2024-11-26 Thread via GitHub
vtk9 opened a new issue, #207: URL: https://github.com/apache/arrow-go/issues/207 ### Describe the enhancement requested **Use case**: there is an arrow table in shared memory. It was written using Python's Arrow `RecordBatchStreamWriter` (although even if it was written using `Recor

[I] [Python] Rewrite pyarrow.jvm using the C data interface [arrow]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #44860: URL: https://github.com/apache/arrow/issues/44860 The `pyarrow.jvm` is currently a custom-written bridge between PyArrow and Arrow Java, with limited datatype support. Now that Java implements the C data interface (see ARROW-12965), we should be able to

Re: [I] Change the default merge message [arrow-java]

2024-11-26 Thread via GitHub
kou closed issue #23: Change the default merge message URL: https://github.com/apache/arrow-java/issues/23 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-ma

Re: [I] [Java] Test fails due to JSON key order [arrow-java]

2024-11-26 Thread via GitHub
assignUser closed issue #228: [Java] Test fails due to JSON key order URL: https://github.com/apache/arrow-java/issues/228 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To un

[I] [Java] Integrate Maven Spotless to CIs [arrow-java]

2024-11-26 Thread via GitHub
vibhatha opened a new issue, #113: URL: https://github.com/apache/arrow-java/issues/113 ### Describe the enhancement requested Once Maven spotless plugin is integrated to all Java modules as stated https://github.com/apache/arrow/issues/40757 This should be enabled in the CIs along

[I] [Java] Avro to Arrow converter should reuse the same VectorSchemaRoot [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #328: URL: https://github.com/apache/arrow-java/issues/328 When originally reviewing the code I suggested a new VectorSchemaRoot on each call to the iterator.  After further discussions on the mailing list, it seems that this is an anit-pattern for working with

[I] [Java] Development Java Substrait to call JNI Wrappers [arrow-java]

2024-11-26 Thread via GitHub
davisusanibar opened a new issue, #216: URL: https://github.com/apache/arrow-java/issues/216 ### Describe the enhancement requested Review last PoC https://github.com/apache/arrow/issues/34223 Define if this new functionality will be created on: - Extend current Java componen

[I] [Java] VectorSchemaRoot#addVector() cannot add a vector to the end of the current vector collection [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #301: URL: https://github.com/apache/arrow-java/issues/301 The current implementation of Java VectorSchemaRoot cannot add a vector at the end of the current list (which is the generally understood meaning of "add"). The Precondition check in the method's

[I] [Java] JDBC to Arrow converter iterator should reuse the same VectorSchemaRoot [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #327: URL: https://github.com/apache/arrow-java/issues/327 When originally reviewing the code I suggested a new VectorSchemaRoot on each call to the iterator.  After further discussions on the mailing list, it seems that this is an anit-pattern for working with

[I] [Java] Add forward compatibility checks for unrecognized future MetadataVersion [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #330: URL: https://github.com/apache/arrow-java/issues/330 We should have no need of these checks in theory, but they present a safeguard should some years in the future it became necessary to increment the MetadataVersion. **Reporter**: [Bryan Cutler]

[I] [Java] Replace multiple calls to getShort in NettyArrowBuf [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #335: URL: https://github.com/apache/arrow-java/issues/335 As per https://github.com/apache/arrow/pull/7347#discussion_r437116925 replace two calls to getShort with one **Reporter**: [Ryan Murray](https://issues.apache.org/jira/browse/ARROW-9148) / @rymu

[I] JVM failed when use gandiva udf with dynamic libraries [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #326: URL: https://github.com/apache/arrow-java/issues/326 Hi there, Recently I'm trying to add some UDF with dynamic link libaries. It is fine compiling and running test in cpp, but when I call the udf from java, JVM failed with errors. Steps to

[I] [Java] Remove Netty dependency from arrow-vector [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #332: URL: https://github.com/apache/arrow-java/issues/332 Cleanup remaining usage of Netty from arrow-vector and remove as a dependency after ARROW-9300. **Reporter**: [Bryan Cutler](https://issues.apache.org/jira/browse/ARROW-9356) / @BryanCutler ##

[I] [Java] MapWithOrdinal javadoc doesn't describe actual behaviour [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #336: URL: https://github.com/apache/arrow-java/issues/336 MapWithOrdinal states that ordinals are recycled when keys are removed, it does not currently do this and grows unbounded **Reporter**: [Ryan Murray](https://issues.apache.org/jira/browse/ARROW-8

[I] [FlightRPC][Java] CallStatus.metadata should get copied into the gRPC StatusRuntimeException [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #325: URL: https://github.com/apache/arrow-java/issues/325 Reported on the mailing list. ``` I had been doing it that way, but I found that If I was sending something along the lines of: ErrorFlightMetadata errorFlightMetadata = new ErrorFligh

[I] Arrow vector in Java(Scala) allocate byteBuffer error while read the bytes from Python pyarrow [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #321: URL: https://github.com/apache/arrow-java/issues/321 I am using scala arrow 1.0.1 and pyarrow 1.0.1 Following error occurs when scala decode the byte that encoded from python. tried to downgrade pyarrow to 0.17.0, 0.14.1, error still exists.

[I] [Java] ListVector's setValueCount does not set inner vector's value count correctly [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #338: URL: https://github.com/apache/arrow-java/issues/338 In ListVector#setValueCount, we always compute data vector's value count based on lastSet - this is incorrect when valueCount is less than lastSet **Reporter**: [Prudhvi Porandla](https://issues.

[I] Start using feature enums to value (umbrella issue) [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #334: URL: https://github.com/apache/arrow-java/issues/334 Proposed logic: 1.  Add flag where appropriate for supports dictionary replacement if there is a possibility it can be used. 2.  Only add compressed buffers when requested.  

[I] [Java] Improve documentation in regards to basic-usage / memory-management [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #331: URL: https://github.com/apache/arrow-java/issues/331 I'm experimenting with Arrow using Java, C+ and Python  IPC format  (Bytestream, File) and Parquet: I am struggling alot on the Java-side, even  after looking for external resources and some code-reading 

[I] [Flight][Java] Add optional session headers [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #320: URL: https://github.com/apache/arrow-java/issues/320 - Add a client middleware which receives and re-uses a session token from the server. - Add a server middleware which, when encountering a request with no session header, creates a session on the bac

[I] [Java] Use feature enum [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #333: URL: https://github.com/apache/arrow-java/issues/333 **Reporter**: [Micah Kornfield](https://issues.apache.org/jira/browse/ARROW-9310) / @emkornfield Related issues: - [[JS] Use feature enum in javascript](https://github.com/apache/arro

[I] [FlightRPC][Java] Allow using a per-call Arrow allocator [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #329: URL: https://github.com/apache/arrow-java/issues/329 We've been running into issues with Flight and gRPC leaking direct memory at scale. One thing we'd like to do is have a (child) allocator per DoGet/DoPut call, so we can more accurately track memory usa

[I] Arrow vector in Java(Scala) allocate byteBuffer error while read the bytes from Python pyarrow [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #322: URL: https://github.com/apache/arrow-java/issues/322 I am using scala arrow 1.0.1 and pyarrow 1.0.1 Following error occurs when scala decode the byte that encoded from python. tried to downgrade pyarrow to 0.17.0, 0.14.1, error still exists.

[I] [Java] getObject API for temporal types is inconsistent and in some cases incorrect [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #324: URL: https://github.com/apache/arrow-java/issues/324 It seems that the work which has been tracked in ARROW-2015 and merged in to change the return types of the various Time and Date vector types when using the

[I] [Java] Implement non-sparse tensors [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #323: URL: https://github.com/apache/arrow-java/issues/323 We'd like to be able to round-trip NumPy ndarrays through Java, and create tensors in Java that can be eventually mapped to ndarrays in Python. Having even a basic Tensor implementation, with extension

[I] [FlightRPC][C++][Python] Header-based auth in servers [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #319: URL: https://github.com/apache/arrow-java/issues/319 Add support for a middleware that executes header-based authorization to C++ and Python servers. The design should be such that a Flight server can validate incoming headers then generate a peer

[I] Create build scripts and documentation for producing DLLs for JNI interfaces [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #318: URL: https://github.com/apache/arrow-java/issues/318 **Reporter**: [Larry White](https://issues.apache.org/jira/browse/ARROW-16465) / @lwhite1 Related issues: - [Bundle DLLs for JNI interfaces into Maven Jars](https://github.com/apache/

[I] [FlightRPC][Java] Provide standard way to get client IP address [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #316: URL: https://github.com/apache/arrow-java/issues/316 Obtaining the IP address of the client can be [accomplished via gRPC](https://github.com/apache/arrow-cookbook/issues/196), but the gRPC interface exposing this information is considered unstable.  It w

[I] Bundle DLLs for JNI interfaces into Maven Jars [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #317: URL: https://github.com/apache/arrow-java/issues/317 **Reporter**: [Larry White](https://issues.apache.org/jira/browse/ARROW-16466) / @lwhite1 Related issues: - [Create build scripts and documentation for producing DLLs for JNI interfac

[I] [Java][CI] Add testing to current java-jars builds and extend to JDK 11 and 17 [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #315: URL: https://github.com/apache/arrow-java/issues/315 As discussed on when we execute the crossbow build to the java-jars task we are not testing the built jars. Currently we are also on

[I] [Java][C++] Reorganize JNI code from several modules [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #314: URL: https://github.com/apache/arrow-java/issues/314 We have several modules implementing JNI wrappers: 1. gandiva 1. plasma 1. dataset 1. orc We can reorganize all the JNI-related codes, maybe into a single parent module, to s

[I] [CI][Java] Automatically extract dependent library versions from pom.xml for s390x [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #302: URL: https://github.com/apache/arrow-java/issues/302 Current script `java_build.sh` requires a manual update for dependent library versions. **Reporter**: [Kazuaki Ishizaki](https://issues.apache.org/jira/browse/ARROW-17369) / @kiszk PRs an

[I] [Java] Improve testing of Dataset bindings [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #304: URL: https://github.com/apache/arrow-java/issues/304 From https://github.com/apache/arrow/pull/13811 - We should ensure all types are tested - We should organize tests in a way that Parquet, IPC, and eventually CSV/ORC can mostly share test code

[I] [Java][C++] Separate JNI compilation & linking from main arrow CMakeLists [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #311: URL: https://github.com/apache/arrow-java/issues/311 We need to separate the JNI elements from CMakeLists, with related modifications to the CI build scripts likely. Separating the JNI portion serves two related purposes: 1. Simplify building JNI code

[I] [Java] Further Consolidate JNI compilation [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #312: URL: https://github.com/apache/arrow-java/issues/312 See https://issues.apache.org/jira/browse/ARROW-15174 for the original effort that shipped in version 9.0.0 **Reporter**: [Alessandro Molina](https://issues.apache.org/jira/browse/ARROW-16979) /

[I] [CI][Java] Use current cookbooks projects to validate new release versions [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #308: URL: https://github.com/apache/arrow-java/issues/308 Current there are recipes to validate Arrow components at: There are a CI process to create components for new Arrow Java releases:

[I] [Java] Ensure Flight with native Netty transport is actually being tested [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #309: URL: https://github.com/apache/arrow-java/issues/309 There's only one test that exercises the domain socket path and it appears it's getting skipped on CI ``` [INFO] Running org.apache.arrow.flight.TestServerOptions Warning:  Tests run: 5, Fai

[I] [C++][Java] Dataset: Support reading from fixed offset of a file for Parquet format [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #307: URL: https://github.com/apache/arrow-java/issues/307 This adds property start_offset_ and length_ to FileSource and should be functional for Parquet dataset format. Supporting Java and C++ dataset API at this time. **Reporter**: [Jin Chengcheng](h

[I] [Java][Docs] Compare/contrast the Netty and Unsafe memory backends [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #305: URL: https://github.com/apache/arrow-java/issues/305 We should compare why you might want to use each. Are there benchmarks in the Java benchmark suite that might also be useful? **Reporter**: [David Li](https://issues.apache.org/jira/browse/

[I] [Java] ListVector to return same UnionListWriter instead of returning new UnionListWriter [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #306: URL: https://github.com/apache/arrow-java/issues/306 `The {color:#FF}getWriter{color} {color:#4c9aff}ListVector{color} API should return same {color:#4c9aff}UnionListWriter {color:#172b4d}instead of returning new {color:#4c9aff}UnionListWriter{color}

[I] [FlightRPC][Integration] Ensure all RPC methods are covered by integration testing [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #299: URL: https://github.com/apache/arrow-java/issues/299 This would help catch issues like https://github.com/apache/arrow/issues/13853 **Reporter**: [David Li](https://issues.apache.org/jira/browse/ARROW-17568) / @lidavidm Related issues: -

[I] [Java][Docs] Document the use of the batchSize argument in Dataset ScanOptions [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #303: URL: https://github.com/apache/arrow-java/issues/303 Several ScanOptions methods take a batchSize argument as shown:  `public ScanOptions(long batchSize) {` `    this(batchSize, Optional.empty());` ``} Since the scanner reads one ArrowRe

[I] [Java][FlightRPC] Update benchmark to be on par with C++ [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #300: URL: https://github.com/apache/arrow-java/issues/300 See https://github.com/apache/arrow/issues/13980 The Java benchmark isn't comparable out of the box (and it seems like there's an unexplained gap between it and the C++ benchmark) **Reporte

Re: [I] [Java] ArrowFileReader.getRecordBlocks() seems incompatibility with JDK 8 [arrow-java]

2024-11-26 Thread via GitHub
assignUser closed issue #49: [Java] ArrowFileReader.getRecordBlocks() seems incompatibility with JDK 8 URL: https://github.com/apache/arrow-java/issues/49 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to g

[I] [Java][Memory][Core]: Initialize DEBUG variable in BaseAllocator class separately [arrow-java]

2024-11-26 Thread via GitHub
davisusanibar opened a new issue, #142: URL: https://github.com/apache/arrow-java/issues/142 ### Describe the enhancement requested As we were implementing Java Nullability in https://github.com/apache/arrow/pull/37723, we found that there are many variables that are all tied to the

[I] [FlightRPC][Java] Properties of JDBC driver are not autodiscovered by clients like DBeaver [arrow-java]

2024-11-26 Thread via GitHub
lidavidm opened a new issue, #133: URL: https://github.com/apache/arrow-java/issues/133 ### Describe the enhancement requested When using the JDBC Flight SQL driver with DBeaver, some options are shown, but not the Arrow Flight SQL specific ones like useEncryption. They have to be ad

[I] [Java] NullVector constructor should not accept an arbitrary field type [arrow-java]

2024-11-26 Thread via GitHub
barronw opened a new issue, #119: URL: https://github.com/apache/arrow-java/issues/119 ### Describe the bug, including details regarding any error messages, version, and platform. NullVector constructor should not accept an arbitrary field type. https://github.com/apache/arrow

Re: [I] [Java] Allow Arrow API to read file contents from inputStream instead of only Path [arrow-java]

2024-11-26 Thread via GitHub
assignUser closed issue #47: [Java] Allow Arrow API to read file contents from inputStream instead of only Path URL: https://github.com/apache/arrow-java/issues/47 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL a

[I] [Java] ErrorProne Warnings to be considered as Errors [arrow-java]

2024-11-26 Thread via GitHub
vibhatha opened a new issue, #59: URL: https://github.com/apache/arrow-java/issues/59 ### Describe the enhancement requested Even though we have integrated the ErrorProne module in each module, we still see iterative development cycle introduces more warnings, and to avoid this it wo

[I] [Java] Netty memory release [arrow-java]

2024-11-26 Thread via GitHub
Vincent-bin opened a new issue, #215: URL: https://github.com/apache/arrow-java/issues/215 ### Describe the bug, including details regarding any error messages, version, and platform. Since when we use netty as my memory allocator manager, I found that direct memory is not released a

[I] [Java] Introduce memory leak detector/handler utility to hook on unused unreleased buffers [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #252: URL: https://github.com/apache/arrow-java/issues/252 See previous discussions in mail thread: https://lists.apache.org/thread.html/re9896b902cddc0931e4efbdecf27203710fb87505b63e927eef7ea77%40%3Cdev.arrow.apache.org%3E **Reporter**: [Hongze Zhang](h

[I] [Java] Make sure alignment and memory padding conform to spec [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #298: URL: https://github.com/apache/arrow-java/issues/298 Per spec 8 byte alignment and padding for buffers. **Reporter**: [Micah Kornfield](https://issues.apache.org/jira/browse/ARROW-186) / @emkornfield PRs and other links: - [GitHub Pull Re

[I] [Java] Fix getField() for NullableMapVector [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #291: URL: https://github.com/apache/arrow-java/issues/291 Needs to call super.getField() and return a nullable version of that field. **Reporter**: [Steven Phillips](https://issues.apache.org/jira/browse/ARROW-790) / @StevenMPhillips **Note**: *Th

[I] [Java] Include compiled Flatbuffers sources in release artifacts [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #293: URL: https://github.com/apache/arrow-java/issues/293 I'm seeing the flatc that's available in Maven foul up users because it depends on a version of glibc (2.14) which is too new for many enterprise Linux users. It would be better for the Arrow Java artif

[I] [Java] UnionListWriter does not behave properly when multiple lists are written in the same list [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #296: URL: https://github.com/apache/arrow-java/issues/296 if you run the following code: ```Java ListVector listVector = new ListVector("list", allocator, null); listVector.allocateNew(); UnionListWriter listWriter = new UnionListWriter(l

[I] [Java] Allow loading/unloading vectors without using FieldNodes [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #289: URL: https://github.com/apache/arrow-java/issues/289 The information stored in FieldNode structure is not strictly necessary for serializing/deserializing vectors. We should allow loading/unloading of vectors without it. **Reporter**: [Steven Phil

[I] [Java] Refactor metrics system [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #294: URL: https://github.com/apache/arrow-java/issues/294 Java Arrow has some support for metrics (backed by yammers metrics), but by default, no reporter is registered. Also, it means that Arrow is tied to a specific metric system. I suggest to replace

[I] [Java] ComplexWriter should initialize nested writers when container vector is already populated [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #295: URL: https://github.com/apache/arrow-java/issues/295 It's possible and sometimes useful to use reuse a nested vector that was populated in a previous ComplexWriter. The new ComplexWriter should be aware of the fields that are present in the vector.

[I] [Java] Read and write record batches to shared memory [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #292: URL: https://github.com/apache/arrow-java/issues/292 It would be useful for a Java application to be able to read a record batch as a set of memory mapped byte buffers given a file name and a memory address for the metadata. **Reporter**: [Wes Mc

[I] [Java] Check if ArrowBuf is empty buffer in getActualConsumedMemory() and getPossibleConsumedMemory() [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #290: URL: https://github.com/apache/arrow-java/issues/290 Most of the methods related to memory accounting in ArrowBuf have special handling for the case when then Buffer is the empty buffer instance. This check is missing in these two methods. **Report

[I] [Java] Provide guidance in documentation for using Arrow in an uberjar setting [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #288: URL: https://github.com/apache/arrow-java/issues/288 Currently, ArrowBuf class directly access the package-private fields of AbstractByteBuf class which makes shading Apache Arrow problematic. If we relocate io.netty namespace excluding io.netty.buffer.Ar

[I] [Java] Test fuzzer inputs [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #287: URL: https://github.com/apache/arrow-java/issues/287 We are developing a fuzzer-based corpus of malformed IPC inputs https://github.com/apache/arrow-testing/tree/master/data/arrow-ipc The Java implementation should also test against these to v

[I] [Java] Provide default setting of io.netty.tryReflectionSetAccessible=true [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #274: URL: https://github.com/apache/arrow-java/issues/274 After ARROW-3191, consumers of Arrow Java with a JDK 9 and above are required to set the JVM property "io.netty.tryReflectionSetAccessible=true" at startup, each time Arrow code is run, as documented at

[I] [Java] Handle exceptions when flight service processes put requests [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #286: URL: https://github.com/apache/arrow-java/issues/286 The current way of processing is to swallow the exception silently and print a log. However, this way is not friendly to debugging and problem diagnosis. We need a way to process it explicitly. 

[I] [Java] Incorporate jmh benchmarks into archery [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #284: URL: https://github.com/apache/arrow-java/issues/284 We should be able to detect performance regressions using archery for java related benchmarks. **Reporter**: [Micah Kornfield](https://issues.apache.org/jira/browse/ARROW-6207) / @emkornfield

[I] [Java] How to use RootAllocator in a low memory setting? [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #282: URL: https://github.com/apache/arrow-java/issues/282 When I run this simple code with JVM setting: "-Xmx64m" ```java package com.snowflake; import org.apache.arrow.memory.RootAllocator; public class TestArrow { public static void

[I] [Java] Represent a data element of a vector as a tree of ArrowBufPointer [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #275: URL: https://github.com/apache/arrow-java/issues/275 For a fixed/variable width vector, each of its data element can be represented as an ArrowBufPointer object, which represents a contiguous memory segment. This makes many tasks easier and more efficient

[I] [Java] sbt docker publish fails due to Arrow dependecies [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #283: URL: https://github.com/apache/arrow-java/issues/283 Hello guys I'm using Arrow in my Scala project and included Maven deps in sbt as required. However, when I try to publish a Docker container with sbt 'docker:publish', I get the following

[I] [Java] Experiment with performance difference of avoiding the use of Avro Decoder [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #281: URL: https://github.com/apache/arrow-java/issues/281 It has been posited that the Decoder object (and on-heap work in general) is potentially slow for decoding.   The scope of this Jira is to add a new method that instead of consuming from D

[I] [JAVA][C++]Support Parquet Read and Write in Java [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #279: URL: https://github.com/apache/arrow-java/issues/279 We added a new java interface to support parquet read and write from hdfs or local file. The purpose of this implementation is that when we loading and dumping parquet data in Java, we can only u

[I] [Java][CI] Pre-install the java dependencies in the dockerfile [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #277: URL: https://github.com/apache/arrow-java/issues/277 See the comments: https://github.com/apache/arrow/blob/master/ci/docker/debian-9-java.dockerfile#L25-L28 Or perhaps just use the mounted maven m2 repository. **Reporter**: [Krisztian Szucs](ht

[I] [Java] Bump checkstyle from 8.18 to 8.29 [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #269: URL: https://github.com/apache/arrow-java/issues/269 **Reporter**: [Fokko Driesprong](https://issues.apache.org/jira/browse/ARROW-7688) / @Fokko PRs and other links: - [GitHub Pull Request apache/arrow#6290](https://github.com/apache/ar

[I] [Java] offset buffer for vector of variable-width type with zero value count is empty [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #273: URL: https://github.com/apache/arrow-java/issues/273 I am reporting what I think might be two related bugs in `org.apache.arrow.vector.BaseVariableWidthVector` 1. The offset buffer is initialized as empty. I expect that it to have 4 bytes that represen

[I] [Java] Move ArrowBufPointer out of the java the memory package [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #271: URL: https://github.com/apache/arrow-java/issues/271 The memory package is focused on memory access and management. ArrowBufPointer should be moved to algorithm package as it isn't core to the Arrow memory management primitives. I would further suggest th

[I] [Java] Remove reader index and writer index from ArrowBuf [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #272: URL: https://github.com/apache/arrow-java/issues/272 Reader and writer index and functionality doesn't belong on a chunk of memory and is due to inheritance from ByteBuf. As part of removing ByteBuf inheritance, we should also remove reader and writer ind

[I] [Java] FieldVector getFieldBuffers API should not set reader/writer indices [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #270: URL: https://github.com/apache/arrow-java/issues/270 Per discussion . The fact that we have reader/writer settings in `getFieldBuffers` is wrong. To clarify, `getFieldBuffers` is dist

[I] [Java] Explore using Avatica as basis for Flight JDBC Driver [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #268: URL: https://github.com/apache/arrow-java/issues/268 Explore using Avatica as basis for Flight JDBC Driver to see how suitable it is compared to building the JDBC driver from the ground up. **Reporter**: [Andy Grove](https://issues.apache.org/jira/b

[I] [Python] [ADBC_DRIVER] [FlightSQL] flight: no authorization header on the response (Unknown; AuthenticateBasicToken) when creating new connection. [arrow-java]

2024-11-26 Thread via GitHub
amir-bashir opened a new issue, #116: URL: https://github.com/apache/arrow-java/issues/116 ### Describe the usage question you have. Please include as many useful details as possible. Below is my python code to connect to my flight sql jdbc server. ``` with flight_sql.c

[I] [Java] How to implement cross database queries similar to presto [arrow-java]

2024-11-26 Thread via GitHub
smileyboy2019 opened a new issue, #90: URL: https://github.com/apache/arrow-java/issues/90 ### Describe the usage question you have. Please include as many useful details as possible. The document explanation is not very clear. If I want to cross database query and join two data

[I] [Java][CI] Move java-jars crossbow nightly build to Apache/arrow ASF runners [arrow-java]

2024-11-26 Thread via GitHub
danepitkin opened a new issue, #135: URL: https://github.com/apache/arrow-java/issues/135 ### Describe the enhancement requested This will allow the java-jars job to upload to Develocity[1]. There is enough capacity in the ASF runner pool to accommodate this. Note that the MacOS buil

[I] [Java][Float16] Add IPC integration tests for float16 [arrow-java]

2024-11-26 Thread via GitHub
davisusanibar opened a new issue, #127: URL: https://github.com/apache/arrow-java/issues/127 ### Describe the enhancement requested The Java support for half floats has been implemented at https://github.com/apache/arrow/pull/39681, but it would be helpful if we could add IPC integra

[I] Create a stubbed FlightSQL sample application [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #265: URL: https://github.com/apache/arrow-java/issues/265 The goal of the sample application is to demonstrate how the FlightSQL API can be used for other common business intelligence application with a SQL like interface. **Reporter**: [Tiffany Lam](h

[I] [Python] Rewrite pyarrow.jvm using the C data interface [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #254: URL: https://github.com/apache/arrow-java/issues/254 The `pyarrow.jvm` is currently a custom-written bridge between PyArrow and Arrow Java, with limited datatype support. Now that Java implements the C data interface (see ARROW-12965), we should be able t

[I] [Java] PySpark3 with pandas 1.1.5 and pyarrow 2.0.0 getting the below error [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #256: URL: https://github.com/apache/arrow-java/issues/256 While running pyspark3 with pandas 1.1.5 and pyarrow 2.0.0 getting the below error: **Spark Code:** ```java import pyarrow import pandas as pd df = pd.DataFrame({'col1' : [1,2,

[I] [Java][FlightSQL] Implement GetInfo (DatabaseMetadata) [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #264: URL: https://github.com/apache/arrow-java/issues/264 **Reporter**: [Tiffany Lam](https://issues.apache.org/jira/browse/ARROW-12928) / @tifflhl **Note**: *This issue was originally created as [ARROW-12928](https://issues.apache.org/jira/brow

[I] [Java] MurmurHasher doesn't take the final bytes into account if size % 4 != 0 [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #248: URL: https://github.com/apache/arrow-java/issues/248 Minimal test case: all buffers of size 1 currently hash to the same value (-1684229222) To my (admittedly untrained) eye, it looks like

[I] [Java] AllocationListener should be called during ownership transferring [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #249: URL: https://github.com/apache/arrow-java/issues/249 If we transfer balance of buffer1 from allocator1 to allocator2, onRelease() should be called from allocator1.listener as well as the case that buffer1 is closed without transferring. **Reporter*

[I] [Java][FlightSQL] Implement prepared statement [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #262: URL: https://github.com/apache/arrow-java/issues/262 **Acceptance Criteria** - FlightSQLClient can request a prepared statement for a single query. Parameter binding is not part of this ticket. - FlightSQLProducer can return a FlightSQLClient's prepa

[I] [Java][Dataset] Enable CSV file format [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #259: URL: https://github.com/apache/arrow-java/issues/259 CSV was already supported in C++ Dataset API. This is to enable the format in Java side. **Reporter**: [Hongze Zhang](https://issues.apache.org/jira/browse/ARROW-13569) / @zhztheplayer **N

[I] [Java][FlightSQL] Implement INSERT and UPDATE statements [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #260: URL: https://github.com/apache/arrow-java/issues/260 **Acceptance Criteria** - TODO **Reporter**: [Tiffany Lam](https://issues.apache.org/jira/browse/ARROW-12933) / @tifflhl **Note**: *This issue was originally created as [ARROW-12933](ht

[I] [Java] Dummy NPEs in DefaultAllocationManagerOption [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #258: URL: https://github.com/apache/arrow-java/issues/258 Looking at DefaultAllocationManagerOption [1], I see that we could prevent dummy NPEs when trying to convert an often not-set environment option into enums. In my case, it was a bit painful to have t

[I] [Java] ZSTD compression issue [arrow-java]

2024-11-26 Thread via GitHub
shujiewu opened a new issue, #169: URL: https://github.com/apache/arrow-java/issues/169 ### Describe the bug, including details regarding any error messages, version, and platform. The second parameter of the compressUnsafe function represents the available space in the destination b

[I] [Java] Request for enhancements to BufferAllocator.close() [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #253: URL: https://github.com/apache/arrow-java/issues/253 We are using Arrow in a production environment and running into some issues with memory allocation. Specifically, we would like to be able to call `BufferAllocator.close()` and have that clean up the al

[I] [Java] Add support for sliced arrays in C Data Interface [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #251: URL: https://github.com/apache/arrow-java/issues/251 The Java implementation of the C Data Interface does not support non-0-offset arrays. This means that arrays like pyarrow.array([0, None, 2, 3, 4]).slice(1, 2) cannot be moved to a Java process. This is

[I] [Java] Unexpected Allocator states created after allocating buffer whose AllocationManager has different size from the requested size [arrow-java]

2024-11-26 Thread via GitHub
asfimport opened a new issue, #257: URL: https://github.com/apache/arrow-java/issues/257 If a buffer with altered AllocationManager size (which means, the size is different from request size) is created from a Allocator, then allocator still adds the request size rather than the altered siz

  1   2   3   >