[GitHub] [pinot] codecov-commenter commented on pull request #11710: Fix Direct Memory OOM on broker by limiting query response size

2023-09-28 Thread via GitHub
codecov-commenter commented on PR #11710: URL: https://github.com/apache/pinot/pull/11710#issuecomment-1740388935 ## [Codecov](https://app.codecov.io/gh/apache/pinot/pull/11710?src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=apache) R

[GitHub] [pinot] gortiz commented on pull request #11710: Fix Direct Memory OOM on broker by limiting query response size

2023-09-28 Thread via GitHub
gortiz commented on PR #11710: URL: https://github.com/apache/pinot/pull/11710#issuecomment-1740385931 I've added some notes, but I think the PR is in good shape -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [pinot] gortiz commented on a diff in pull request #11710: Fix Direct Memory OOM on broker by limiting query response size

2023-09-28 Thread via GitHub
gortiz commented on code in PR #11710: URL: https://github.com/apache/pinot/pull/11710#discussion_r1340953692 ## pinot-integration-tests/src/test/java/org/apache/pinot/integration/tests/OfflineClusterIntegrationTest.java: ## @@ -602,6 +602,16 @@ public void testTimeFunc(boolean

[GitHub] [pinot] gortiz commented on a diff in pull request #11710: Fix Direct Memory OOM on broker by limiting query response size

2023-09-28 Thread via GitHub
gortiz commented on code in PR #11710: URL: https://github.com/apache/pinot/pull/11710#discussion_r1340952925 ## pinot-core/src/main/java/org/apache/pinot/core/query/scheduler/QueryScheduler.java: ## @@ -168,6 +169,19 @@ protected byte[] processQueryAndSerialize(ServerQueryRequ

[GitHub] [pinot] gortiz commented on a diff in pull request #11710: Fix Direct Memory OOM on broker by limiting query response size

2023-09-28 Thread via GitHub
gortiz commented on code in PR #11710: URL: https://github.com/apache/pinot/pull/11710#discussion_r1340952382 ## pinot-core/src/main/java/org/apache/pinot/core/query/scheduler/QueryScheduler.java: ## @@ -168,6 +169,19 @@ protected byte[] processQueryAndSerialize(ServerQueryRequ

[GitHub] [pinot] codecov-commenter commented on pull request #11709: update map initial capacity

2023-09-28 Thread via GitHub
codecov-commenter commented on PR #11709: URL: https://github.com/apache/pinot/pull/11709#issuecomment-1740361099 ## [Codecov](https://app.codecov.io/gh/apache/pinot/pull/11709?src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=apache) R

[GitHub] [pinot] xiangfu0 commented on a diff in pull request #11574: Fix schema name in table config during controller startup

2023-09-28 Thread via GitHub
xiangfu0 commented on code in PR #11574: URL: https://github.com/apache/pinot/pull/11574#discussion_r1340932616 ## pinot-controller/src/main/java/org/apache/pinot/controller/BaseControllerStarter.java: ## @@ -549,6 +555,69 @@ protected void configure() { _serviceStatusCall

[GitHub] [pinot] dang-stripe commented on pull request #11650: Support omitting time values in SimpleSegmentNameGenerator

2023-09-28 Thread via GitHub
dang-stripe commented on PR #11650: URL: https://github.com/apache/pinot/pull/11650#issuecomment-1740356183 Yes, we still want to use the global sequence ID which isn't supported by the `FixedSegmentNameGenerator`. Another option here is to update `FixedSegmentNameGenerator` to support appe

[GitHub] [pinot] vvivekiyer opened a new pull request, #11710: Throw exception if response size exceeds thresholds

2023-09-28 Thread via GitHub
vvivekiyer opened a new pull request, #11710: URL: https://github.com/apache/pinot/pull/11710 When server responds for a query with a large response, the broker can potentially crash with direct memory OOM. In PR https://github.com/apache/pinot/pull/11496 - a fix was added to restar

[GitHub] [pinot] zhtaoxiang opened a new pull request, #11709: update map initial capacity

2023-09-28 Thread via GitHub
zhtaoxiang opened a new pull request, #11709: URL: https://github.com/apache/pinot/pull/11709 label: `performance` update initial capacity to HashUtil.getHashMapCapacity(desired size) to avoid map resizing -- This is an automated message from the Apache Git Service. To respond

[pinot] branch master updated: Allow GcsPinotFS to work with granular permissions (#11655)

2023-09-28 Thread saurabhd336
This is an automated email from the ASF dual-hosted git repository. saurabhd336 pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/pinot.git The following commit(s) were added to refs/heads/master by this push: new 54a1ca01b5 Allow GcsPinotFS to work with gra

[GitHub] [pinot] saurabhd336 merged pull request #11655: Allow GcsPinotFS to work with granular permissions

2023-09-28 Thread via GitHub
saurabhd336 merged PR #11655: URL: https://github.com/apache/pinot/pull/11655 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pinot.

[GitHub] [pinot] swaminathanmanish commented on pull request #11650: Support omitting time values in SimpleSegmentNameGenerator

2023-09-28 Thread via GitHub
swaminathanmanish commented on PR #11650: URL: https://github.com/apache/pinot/pull/11650#issuecomment-1740308268 > Related to #11649 > > We use the `normalizedDate` segment name generator for an append table w/ a time column. The generators add a min/max time value to the segment nam

[GitHub] [pinot] tibrewalpratik17 commented on a diff in pull request #11705: Add more test for broker jersey bounded thread pool

2023-09-28 Thread via GitHub
tibrewalpratik17 commented on code in PR #11705: URL: https://github.com/apache/pinot/pull/11705#discussion_r1340861053 ## pinot-broker/src/test/java/org/apache/pinot/broker/broker/BrokerManagedAsyncExecutorProviderTest.java: ## @@ -92,6 +94,23 @@ public void testGet() Bl

[GitHub] [pinot] dang-stripe commented on pull request #11650: Support omitting time values in SimpleSegmentNameGenerator

2023-09-28 Thread via GitHub
dang-stripe commented on PR #11650: URL: https://github.com/apache/pinot/pull/11650#issuecomment-1740236292 @swaminathanmanish thanks for reviewing! updated the description to include motivation and responded to your comment -- This is an automated message from the Apache Git Service. To

[GitHub] [pinot] dang-stripe commented on a diff in pull request #11650: Support omitting time values in SimpleSegmentNameGenerator

2023-09-28 Thread via GitHub
dang-stripe commented on code in PR #11650: URL: https://github.com/apache/pinot/pull/11650#discussion_r1340840866 ## pinot-plugins/pinot-batch-ingestion/pinot-batch-ingestion-common/src/main/java/org/apache/pinot/plugin/ingestion/batch/common/SegmentGenerationTaskRunner.java: #

[GitHub] [pinot] xiangfu0 merged pull request #11707: correct default log4j2 config file path in helm chart

2023-09-28 Thread via GitHub
xiangfu0 merged PR #11707: URL: https://github.com/apache/pinot/pull/11707 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pinot.apa

[pinot] branch master updated: correct default log4j2 config file path in helm chart (#11707)

2023-09-28 Thread xiangfu
This is an automated email from the ASF dual-hosted git repository. xiangfu pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/pinot.git The following commit(s) were added to refs/heads/master by this push: new e08e66c399 correct default log4j2 config file pa

[GitHub] [pinot] walterddr commented on issue #11651: LOOKUP not supported in v2 engine

2023-09-28 Thread via GitHub
walterddr commented on issue #11651: URL: https://github.com/apache/pinot/issues/11651#issuecomment-1740146801 here is a proposal for creating lookup join support in the v2 engine: https://docs.google.com/document/d/1cyFrW21MlcbHEEfl46paKRs6v6qHzwOZbIsnFL7UVnw/edit#heading=h.xdbe7h7na0kx

[GitHub] [pinot] Jackie-Jiang commented on pull request #11708: Add more information in RequestContext class

2023-09-28 Thread via GitHub
Jackie-Jiang commented on PR #11708: URL: https://github.com/apache/pinot/pull/11708#issuecomment-1740144719 @siddharthteotia @jackjlli Can you take a look? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [pinot] Jackie-Jiang commented on a diff in pull request #11705: Add more test for broker jersey bounded thread pool

2023-09-28 Thread via GitHub
Jackie-Jiang commented on code in PR #11705: URL: https://github.com/apache/pinot/pull/11705#discussion_r1340762657 ## pinot-broker/src/test/java/org/apache/pinot/broker/broker/BrokerManagedAsyncExecutorProviderTest.java: ## @@ -92,6 +94,23 @@ public void testGet() Blocki

[GitHub] [pinot] kirkrodrigues commented on a diff in pull request #11604: JSON index: Add support for ignoring values longer than a given length.

2023-09-28 Thread via GitHub
kirkrodrigues commented on code in PR #11604: URL: https://github.com/apache/pinot/pull/11604#discussion_r1340749635 ## pinot-segment-local/src/test/java/org/apache/pinot/segment/local/segment/index/JsonIndexTest.java: ## @@ -233,6 +210,83 @@ public void testLargeIndex() }

[GitHub] [pinot] chenboat commented on a diff in pull request #11604: JSON index: Add support for ignoring values longer than a given length.

2023-09-28 Thread via GitHub
chenboat commented on code in PR #11604: URL: https://github.com/apache/pinot/pull/11604#discussion_r1340745847 ## pinot-segment-local/src/test/java/org/apache/pinot/segment/local/segment/index/JsonIndexTest.java: ## @@ -233,6 +210,83 @@ public void testLargeIndex() } }

[pinot] branch master updated: add ignored test capability (#11703)

2023-09-28 Thread rongr
This is an automated email from the ASF dual-hosted git repository. rongr pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/pinot.git The following commit(s) were added to refs/heads/master by this push: new 15988f6016 add ignored test capability (#11703) 15

[GitHub] [pinot] walterddr merged pull request #11703: [multistage][test] add ignored test capability

2023-09-28 Thread via GitHub
walterddr merged PR #11703: URL: https://github.com/apache/pinot/pull/11703 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pinot.ap

[GitHub] [pinot] jasperjiaguo commented on issue #11631: Documentation for argmax/argmin is inconsistent with actual function name exprmax/exprmin

2023-09-28 Thread via GitHub
jasperjiaguo commented on issue #11631: URL: https://github.com/apache/pinot/issues/11631#issuecomment-1740106821 fixed https://github.com/apache/pinot/pull/11700 https://docs.pinot.apache.org/configuration-reference/functions/arg_min-arg_max -- This is an automated message from the Apach

[GitHub] [pinot] codecov-commenter commented on pull request #11708: Add more information in RequestContext class

2023-09-28 Thread via GitHub
codecov-commenter commented on PR #11708: URL: https://github.com/apache/pinot/pull/11708#issuecomment-1740068915 ## [Codecov](https://app.codecov.io/gh/apache/pinot/pull/11708?src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=apache) R

[GitHub] [pinot] tibrewalpratik17 commented on a diff in pull request #11705: Add more test for broker jersey bounded thread pool

2023-09-28 Thread via GitHub
tibrewalpratik17 commented on code in PR #11705: URL: https://github.com/apache/pinot/pull/11705#discussion_r1340704103 ## pinot-broker/src/test/java/org/apache/pinot/broker/broker/BrokerManagedAsyncExecutorProviderTest.java: ## @@ -92,6 +94,23 @@ public void testGet() Bl

[GitHub] [pinot] Jackie-Jiang commented on a diff in pull request #11705: Add more test for broker jersey bounded thread pool

2023-09-28 Thread via GitHub
Jackie-Jiang commented on code in PR #11705: URL: https://github.com/apache/pinot/pull/11705#discussion_r1340700894 ## pinot-broker/src/test/java/org/apache/pinot/broker/broker/BrokerManagedAsyncExecutorProviderTest.java: ## @@ -92,6 +94,23 @@ public void testGet() Blocki

[GitHub] [pinot] Jackie-Jiang commented on issue #11704: Failed to query string column in noDictionaryColumns

2023-09-28 Thread via GitHub
Jackie-Jiang commented on issue #11704: URL: https://github.com/apache/pinot/issues/11704#issuecomment-1740050114 Unfortunately I cannot tell what is causing the decompression failure by the stack trace itself. To debug the issue, we need to dig into the problematic segment and find the

[GitHub] [pinot] siddharthteotia commented on pull request #11699: Make ResourceLimitPolicy functions public to be used from scheduler plugins.

2023-09-28 Thread via GitHub
siddharthteotia commented on PR #11699: URL: https://github.com/apache/pinot/pull/11699#issuecomment-1740042082 @Jackie-Jiang - @vvivekiyer and I will provide you with the context. Will setup something soon. TLDR - We have started thinking about the scheduling sub-problem described

[GitHub] [pinot] Jackie-Jiang merged pull request #11695: use SlidingTimeWindowArrayReservoir instead of default ExponentiallyDecayingReservoir in dropwizard metrics

2023-09-28 Thread via GitHub
Jackie-Jiang merged PR #11695: URL: https://github.com/apache/pinot/pull/11695 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pinot

[pinot] branch master updated: refactor code and doc occurrences of argmin/max -> exprmin/max (#11700)

2023-09-28 Thread jlli
This is an automated email from the ASF dual-hosted git repository. jlli pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/pinot.git The following commit(s) were added to refs/heads/master by this push: new 81e1143f39 refactor code and doc occurrences of arg

[pinot] branch master updated: use SlidingTimeWindowArrayReservoir instead of default ExponentiallyDecayingReservoir in dropwizard metrics (#11695)

2023-09-28 Thread jackie
This is an automated email from the ASF dual-hosted git repository. jackie pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/pinot.git The following commit(s) were added to refs/heads/master by this push: new c642db0666 use SlidingTimeWindowArrayReservoir in

[GitHub] [pinot] jackjlli merged pull request #11700: refactor code and doc occurrences of argmin/max -> exprmin/max

2023-09-28 Thread via GitHub
jackjlli merged PR #11700: URL: https://github.com/apache/pinot/pull/11700 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pinot.apa

[GitHub] [pinot] jackjlli commented on a diff in pull request #11574: Fix schema name in table config during controller startup

2023-09-28 Thread via GitHub
jackjlli commented on code in PR #11574: URL: https://github.com/apache/pinot/pull/11574#discussion_r1340691844 ## pinot-controller/src/main/java/org/apache/pinot/controller/BaseControllerStarter.java: ## @@ -549,6 +555,69 @@ protected void configure() { _serviceStatusCall

[GitHub] [pinot] Jackie-Jiang commented on pull request #11699: Make ResourceLimitPolicy functions public to be used from scheduler plugins.

2023-09-28 Thread via GitHub
Jackie-Jiang commented on PR #11699: URL: https://github.com/apache/pinot/pull/11699#issuecomment-1740035068 @vvivekiyer I don't know the exact race condition, but the test in `PrioritySchedulerTest` was disabled due to race condition. Use this class with cautious because it is not verified

[GitHub] [pinot] codecov-commenter commented on pull request #11707: correct default log4j2 config file path in helm chart

2023-09-28 Thread via GitHub
codecov-commenter commented on PR #11707: URL: https://github.com/apache/pinot/pull/11707#issuecomment-1740034768 ## [Codecov](https://app.codecov.io/gh/apache/pinot/pull/11707?src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=apache) R

[GitHub] [pinot] tibrewalpratik17 opened a new pull request, #11708: Add more information in RequestContext class

2023-09-28 Thread via GitHub
tibrewalpratik17 opened a new pull request, #11708: URL: https://github.com/apache/pinot/pull/11708 As per discussion in https://github.com/apache/pinot/pull/11437#discussion_r1326698944 adding more information to `RequestContext` class. Most of the fields already existed in `BrokerRespons

[GitHub] [pinot] swaminathanmanish commented on a diff in pull request #11650: Support omitting time values in SimpleSegmentNameGenerator

2023-09-28 Thread via GitHub
swaminathanmanish commented on code in PR #11650: URL: https://github.com/apache/pinot/pull/11650#discussion_r1340683133 ## pinot-plugins/pinot-batch-ingestion/pinot-batch-ingestion-common/src/main/java/org/apache/pinot/plugin/ingestion/batch/common/SegmentGenerationTaskRunner.java: ###

[pinot] branch master updated (9ead3ff5bc -> d78682e291)

2023-09-28 Thread siddteotia
This is an automated email from the ASF dual-hosted git repository. siddteotia pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/pinot.git from 9ead3ff5bc Upgrade Avro dependency to 1.10.2 (#11698) add d78682e291 Make constructor and functions public to

[GitHub] [pinot] siddharthteotia merged pull request #11699: Make ResourceLimitPolicy functions public to be used from scheduler plugins.

2023-09-28 Thread via GitHub
siddharthteotia merged PR #11699: URL: https://github.com/apache/pinot/pull/11699 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pi

[pinot] branch master updated (cf8fd93759 -> 9ead3ff5bc)

2023-09-28 Thread sajjad
This is an automated email from the ASF dual-hosted git repository. sajjad pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/pinot.git from cf8fd93759 MultiValue VarByte V4 index writer and consolidate V4 reader for all types (#11674) add 9ead3ff5bc Upg

[GitHub] [pinot] sajjad-moradi merged pull request #11698: Upgrade Avro dependency to 1.10.2

2023-09-28 Thread via GitHub
sajjad-moradi merged PR #11698: URL: https://github.com/apache/pinot/pull/11698 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pino

[GitHub] [pinot] zhtaoxiang opened a new pull request, #11707: correct default log4j2 config file path in helm chart

2023-09-28 Thread via GitHub
zhtaoxiang opened a new pull request, #11707: URL: https://github.com/apache/pinot/pull/11707 label: `bugfix` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubsc

[GitHub] [pinot] walterddr commented on issue #10381: [multistage] Null Check Predicates Not Pushed Down

2023-09-28 Thread via GitHub
walterddr commented on issue #10381: URL: https://github.com/apache/pinot/issues/10381#issuecomment-1739978297 CC @gortiz who is also interested in null handling work -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

[GitHub] [pinot] walterddr closed issue #11691: [multistage] IS NULL/ IS NOT NULL getting skipped from the queries using multistage

2023-09-28 Thread via GitHub
walterddr closed issue #11691: [multistage] IS NULL/ IS NOT NULL getting skipped from the queries using multistage URL: https://github.com/apache/pinot/issues/11691 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [pinot] walterddr commented on issue #11691: [multistage] IS NULL/ IS NOT NULL getting skipped from the queries using multistage

2023-09-28 Thread via GitHub
walterddr commented on issue #11691: URL: https://github.com/apache/pinot/issues/11691#issuecomment-1739977638 closing this as duplicate of #10381 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [pinot] walterddr commented on pull request #10423: [multistage] proper support for column-level null handling

2023-09-28 Thread via GitHub
walterddr commented on PR #10423: URL: https://github.com/apache/pinot/pull/10423#issuecomment-1739975940 closing this one and creating a new one (design doc shared in #10381) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub an

[GitHub] [pinot] walterddr closed pull request #10423: [multistage] proper support for column-level null handling

2023-09-28 Thread via GitHub
walterddr closed pull request #10423: [multistage] proper support for column-level null handling URL: https://github.com/apache/pinot/pull/10423 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the s

[GitHub] [pinot] walterddr commented on issue #10381: [multistage] Null Check Predicates Not Pushed Down

2023-09-28 Thread via GitHub
walterddr commented on issue #10381: URL: https://github.com/apache/pinot/issues/10381#issuecomment-1739977403 here is a quick design doc: https://docs.google.com/document/d/16slt8TEfhJjDCjbE7K6za9hrB5WagiuKscFzod2Oib8/edit TL;DR similar to the POC in #10423 and following the comme

[GitHub] [pinot] codecov-commenter commented on pull request #11705: Add more test for broker jersey bounded thread pool

2023-09-28 Thread via GitHub
codecov-commenter commented on PR #11705: URL: https://github.com/apache/pinot/pull/11705#issuecomment-1739940414 ## [Codecov](https://app.codecov.io/gh/apache/pinot/pull/11705?src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=apache) R

[GitHub] [pinot] Jackie-Jiang opened a new issue, #11706: Group-by without ordering on aggregate

2023-09-28 Thread via GitHub
Jackie-Jiang opened a new issue, #11706: URL: https://github.com/apache/pinot/issues/11706 When a group-by query does not have order-by on the aggregate column, we don't need to keep more groups than the LIMIT because the order-by value won't change. We can maintain a heap (PriorityQueue) o

[GitHub] [pinot] tibrewalpratik17 opened a new pull request, #11705: Add more test for broker jersey bounded thread pool

2023-09-28 Thread via GitHub
tibrewalpratik17 opened a new pull request, #11705: URL: https://github.com/apache/pinot/pull/11705 label - `testing` This is a small patch to add more unit tests for broker managed async executor provider class. -- This is an automated message from the Apache Git Service. To re

[GitHub] [pinot] Jackie-Jiang closed issue #8697: Proper NULL value support in Pinot

2023-09-28 Thread via GitHub
Jackie-Jiang closed issue #8697: Proper NULL value support in Pinot URL: https://github.com/apache/pinot/issues/8697 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscr

[GitHub] [pinot] rohityadav1993 commented on issue #11704: Failed to query string column in noDictionaryColumns

2023-09-28 Thread via GitHub
rohityadav1993 commented on issue #11704: URL: https://github.com/apache/pinot/issues/11704#issuecomment-1739784705 I do have the raw string that is being persisted. Is there a way to figure out which special character could be causing this? -- This is an automated message from the Apache

[GitHub] [pinot] vvivekiyer commented on pull request #11699: Make ResourceLimitPolicy functions public to be used from scheduler plugins.

2023-09-28 Thread via GitHub
vvivekiyer commented on PR #11699: URL: https://github.com/apache/pinot/pull/11699#issuecomment-1739782509 @Jackie-Jiang We are not using the PriorityBasedSchedulers. Just curious - can you point me to the race condition? About this change - We have a use case in Linkedin where w

[GitHub] [pinot] Jackie-Jiang merged pull request #11674: MultiValue VarByte V4 index writer and consolidate V4 reader for all types

2023-09-28 Thread via GitHub
Jackie-Jiang merged PR #11674: URL: https://github.com/apache/pinot/pull/11674 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pinot

[pinot] branch master updated: MultiValue VarByte V4 index writer and consolidate V4 reader for all types (#11674)

2023-09-28 Thread jackie
This is an automated email from the ASF dual-hosted git repository. jackie pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/pinot.git The following commit(s) were added to refs/heads/master by this push: new cf8fd93759 MultiValue VarByte V4 index writer and

[GitHub] [pinot] codecov-commenter commented on pull request #11703: [multistage][test] add ignored test capability

2023-09-28 Thread via GitHub
codecov-commenter commented on PR #11703: URL: https://github.com/apache/pinot/pull/11703#issuecomment-1739764974 ## [Codecov](https://app.codecov.io/gh/apache/pinot/pull/11703?src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=apache) R

[GitHub] [pinot] Jackie-Jiang commented on issue #11704: Failed to query string column in noDictionaryColumns

2023-09-28 Thread via GitHub
Jackie-Jiang commented on issue #11704: URL: https://github.com/apache/pinot/issues/11704#issuecomment-1739758632 Do you have the raw data used to generate the segment? From what I can tell from the stack trace, it might hit some bug in snappy library. Do you have special character in your

[GitHub] [pinot] Jackie-Jiang commented on a diff in pull request #11698: Upgrade Avro dependency to 1.10.2

2023-09-28 Thread via GitHub
Jackie-Jiang commented on code in PR #11698: URL: https://github.com/apache/pinot/pull/11698#discussion_r1340481197 ## pinot-plugins/pinot-input-format/pinot-avro-base/src/main/java/org/apache/pinot/plugin/inputformat/avro/AvroRecordExtractor.java: ## @@ -73,10 +73,12 @@ public

[GitHub] [pinot] Jackie-Jiang commented on a diff in pull request #11698: Upgrade Avro dependency to 1.10.2

2023-09-28 Thread via GitHub
Jackie-Jiang commented on code in PR #11698: URL: https://github.com/apache/pinot/pull/11698#discussion_r1340479513 ## pinot-plugins/pinot-input-format/pinot-avro-base/src/main/java/org/apache/pinot/plugin/inputformat/avro/AvroRecordExtractor.java: ## @@ -74,7 +74,7 @@ public Ge

[GitHub] [pinot] Jackie-Jiang commented on pull request #11575: Add metrics to export netty direct memory used and max

2023-09-28 Thread via GitHub
Jackie-Jiang commented on PR #11575: URL: https://github.com/apache/pinot/pull/11575#issuecomment-1739742639 @siddharthteotia Did Gonzalo address your concern? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL ab

[GitHub] [pinot] Jackie-Jiang commented on a diff in pull request #11575: Add metrics to export netty direct memory used and max

2023-09-28 Thread via GitHub
Jackie-Jiang commented on code in PR #11575: URL: https://github.com/apache/pinot/pull/11575#discussion_r1340470598 ## pinot-core/src/test/java/org/apache/pinot/core/transport/QueryRoutingTest.java: ## @@ -77,9 +78,16 @@ public void setUp() { _requestCount = 0; } + @A

[GitHub] [pinot] sajjad-moradi commented on a diff in pull request #11698: Upgrade Avro dependency to 1.10.2

2023-09-28 Thread via GitHub
sajjad-moradi commented on code in PR #11698: URL: https://github.com/apache/pinot/pull/11698#discussion_r1340470864 ## pinot-plugins/pinot-input-format/pinot-avro-base/src/main/java/org/apache/pinot/plugin/inputformat/avro/AvroRecordExtractor.java: ## @@ -74,7 +74,7 @@ public G

[GitHub] [pinot] sajjad-moradi commented on a diff in pull request #11698: Upgrade Avro dependency to 1.10.2

2023-09-28 Thread via GitHub
sajjad-moradi commented on code in PR #11698: URL: https://github.com/apache/pinot/pull/11698#discussion_r1340461420 ## pinot-plugins/pinot-input-format/pinot-avro-base/src/main/java/org/apache/pinot/plugin/inputformat/avro/AvroRecordExtractor.java: ## @@ -74,7 +74,7 @@ public G

[GitHub] [pinot] Jackie-Jiang commented on a diff in pull request #11698: Upgrade Avro dependency to 1.10.2

2023-09-28 Thread via GitHub
Jackie-Jiang commented on code in PR #11698: URL: https://github.com/apache/pinot/pull/11698#discussion_r1340446845 ## pinot-plugins/pinot-input-format/pinot-avro-base/src/main/java/org/apache/pinot/plugin/inputformat/avro/AvroRecordExtractor.java: ## @@ -74,7 +74,7 @@ public Ge

[GitHub] [pinot] rohityadav1993 opened a new issue, #11704: Failed to query string column in noDictionaryColumns

2023-09-28 Thread via GitHub
rohityadav1993 opened a new issue, #11704: URL: https://github.com/apache/pinot/issues/11704 This happens only for a few rows and only in offline segments. The column is single value string with raw forward encoding ``` ... ... "noDictionaryColumns": [ "compresse

[GitHub] [pinot] Jackie-Jiang commented on a diff in pull request #11698: Upgrade Avro dependency to 1.10.2

2023-09-28 Thread via GitHub
Jackie-Jiang commented on code in PR #11698: URL: https://github.com/apache/pinot/pull/11698#discussion_r1340444844 ## pinot-plugins/pinot-input-format/pinot-avro-base/src/main/java/org/apache/pinot/plugin/inputformat/avro/AvroRecordExtractor.java: ## @@ -74,7 +74,7 @@ public Ge

[GitHub] [pinot] walterddr opened a new pull request, #11703: [multistage][test] add ignored test capability

2023-09-28 Thread via GitHub
walterddr opened a new pull request, #11703: URL: https://github.com/apache/pinot/pull/11703 - add flag to run ignored tests cases - doesn't support run ignored tables b/c we can't load those table during `setUp` - fix several ignored where we already supports now. -- This is a

[GitHub] [pinot] jugomezv commented on issue #11632: Reduce the number of threads for `IngestionDelayTimerThread`

2023-09-28 Thread via GitHub
jugomezv commented on issue #11632: URL: https://github.com/apache/pinot/issues/11632#issuecomment-1739650460 Yes should be an easy change. I had this as follow up but have other tasks at work now. @snleee we can let @cypherean take a stab? keep me in the loop for reviews -- This is an

[GitHub] [pinot] walterddr commented on a diff in pull request #11404: Multi stage int tests

2023-09-28 Thread via GitHub
walterddr commented on code in PR #11404: URL: https://github.com/apache/pinot/pull/11404#discussion_r1340340485 ## pinot-integration-test-base/src/test/java/org/apache/pinot/integration/tests/ClusterTest.java: ## @@ -572,4 +573,10 @@ public Object[][] useV2QueryEngine() {

[GitHub] [pinot] saurabhd336 commented on a diff in pull request #11674: MultiValue VarByte V4 index writer and consolidate V4 reader for all types

2023-09-28 Thread via GitHub
saurabhd336 commented on code in PR #11674: URL: https://github.com/apache/pinot/pull/11674#discussion_r1340339841 ## pinot-segment-local/src/main/java/org/apache/pinot/segment/local/segment/index/forward/ForwardIndexReaderFactory.java: ## @@ -81,13 +81,17 @@ public static Forwa

[GitHub] [pinot] saurabhd336 commented on a diff in pull request #11674: MultiValue VarByte V4 index writer and consolidate V4 reader for all types

2023-09-28 Thread via GitHub
saurabhd336 commented on code in PR #11674: URL: https://github.com/apache/pinot/pull/11674#discussion_r1340323638 ## pinot-segment-local/src/main/java/org/apache/pinot/segment/local/segment/index/readers/forward/VarByteChunkForwardIndexReaderV4.java: ## @@ -113,6 +116,156 @@ pu

[GitHub] [pinot] saurabhd336 commented on a diff in pull request #11674: MultiValue VarByte V4 index writer and consolidate V4 reader for all types

2023-09-28 Thread via GitHub
saurabhd336 commented on code in PR #11674: URL: https://github.com/apache/pinot/pull/11674#discussion_r1340325494 ## pinot-segment-local/src/main/java/org/apache/pinot/segment/local/segment/index/readers/forward/VarByteChunkForwardIndexReaderV4.java: ## @@ -113,6 +116,156 @@ pu

[GitHub] [pinot] saurabhd336 commented on a diff in pull request #11674: MultiValue VarByte V4 index writer and consolidate V4 reader for all types

2023-09-28 Thread via GitHub
saurabhd336 commented on code in PR #11674: URL: https://github.com/apache/pinot/pull/11674#discussion_r1340322049 ## pinot-segment-local/src/main/java/org/apache/pinot/segment/local/segment/index/readers/forward/VarByteChunkForwardIndexReaderV4.java: ## @@ -113,6 +116,156 @@ pu

[GitHub] [pinot] saurabhd336 commented on a diff in pull request #11674: MultiValue VarByte V4 index writer and consolidate V4 reader for all types

2023-09-28 Thread via GitHub
saurabhd336 commented on code in PR #11674: URL: https://github.com/apache/pinot/pull/11674#discussion_r1340320116 ## pinot-segment-local/src/main/java/org/apache/pinot/segment/local/io/writer/impl/VarByteChunkForwardIndexWriterV4.java: ## @@ -142,6 +144,51 @@ public void putByt

[GitHub] [pinot] saurabhd336 commented on a diff in pull request #11674: MultiValue VarByte V4 index writer and consolidate V4 reader for all types

2023-09-28 Thread via GitHub
saurabhd336 commented on code in PR #11674: URL: https://github.com/apache/pinot/pull/11674#discussion_r1340319747 ## pinot-segment-local/src/main/java/org/apache/pinot/segment/local/io/writer/impl/VarByteChunkForwardIndexWriterV4.java: ## @@ -142,6 +144,51 @@ public void putByt

[GitHub] [pinot] codecov-commenter commented on pull request #11702: [Draft] Upgrade spark from 3.2 to 3.5

2023-09-28 Thread via GitHub
codecov-commenter commented on PR #11702: URL: https://github.com/apache/pinot/pull/11702#issuecomment-1738831483 ## [Codecov](https://app.codecov.io/gh/apache/pinot/pull/11702?src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=apache) R

[GitHub] [pinot] gortiz opened a new pull request, #11702: [Draft] Upgrade spark from 3.2 to 3.5

2023-09-28 Thread via GitHub
gortiz opened a new pull request, #11702: URL: https://github.com/apache/pinot/pull/11702 This PR tries to blindly update from Spark 3.2.4 to 3.5.0. See https://github.com/apache/pinot/issues/11701 -- This is an automated message from the Apache Git Service. To respond to the messag

[GitHub] [pinot] gortiz opened a new issue, #11701: Upgrade to Spark 3.5.0

2023-09-28 Thread via GitHub
gortiz opened a new issue, #11701: URL: https://github.com/apache/pinot/issues/11701 This issue is created to analyze whether we can upgrade our Spark 3.x dependencies to 3.5.0. Apache Pinot connects to Apache Spark in two different types of modules: - batch ingestion plugins (modu

[GitHub] [pinot] gortiz commented on pull request #11695: use SlidingTimeWindowArrayReservoir instead of default ExponentiallyDecayingReservoir in dropwizard metrics

2023-09-28 Thread via GitHub
gortiz commented on PR #11695: URL: https://github.com/apache/pinot/pull/11695#issuecomment-1738615579 > Thanks for raising this and providing the readings! Curious how you decide to configure the sliding window to be 15 minutes? What is the side effect of that? I've just decided to