mocobeta opened a new issue, #36:
URL: https://github.com/apache/lucene-jira-archive/issues/36
`jira2markdown_imprt.py` is single-threaded and it takes several hours to
convert all Jira issues.
I think it'd be easy to parallelize this with
[multiprocessing](https://docs.python.org/3/libr
mocobeta commented on PR #33:
URL:
https://github.com/apache/lucene-jira-archive/pull/33#issuecomment-1181456586
I'm also converting the whole Jira issue myself; it looks like it takes
several hours... (recent changes to fix conversion errors could affect the
conversion speed I think). Thi
mocobeta commented on issue #36:
URL:
https://github.com/apache/lucene-jira-archive/issues/36#issuecomment-1181497062
I found https://pypi.org/project/multiprocessing-logging/, but this works
only on Linux.
--
This is an automated message from the Apache Git Service.
To respond to th
[
https://issues.apache.org/jira/browse/LUCENE-10480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17565375#comment-17565375
]
Adrien Grand commented on LUCENE-10480:
---
+1 to explore this in a separate issue.
[
https://issues.apache.org/jira/browse/LUCENE-10600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Adrien Grand updated LUCENE-10600:
--
Fix Version/s: 9.3
> SortedSetDocValues#docValueCount should be an int, not long
> --
mocobeta commented on issue #38:
URL:
https://github.com/apache/lucene-jira-archive/issues/38#issuecomment-1181605666
Thank you for opening this.
While the stack overflow is rare, this recursion in parsing also causes a
significant slowdown in conversion.
I'm sure the root cause o
jpountz commented on code in PR #966:
URL: https://github.com/apache/lucene/pull/966#discussion_r918804129
##
lucene/core/src/java/org/apache/lucene/index/TermsHashPerField.java:
##
@@ -230,9 +230,29 @@ final void writeByte(int stream, byte b) {
}
final void writeBytes(i
jpountz commented on code in PR #1003:
URL: https://github.com/apache/lucene/pull/1003#discussion_r918758391
##
lucene/core/src/java/org/apache/lucene/codecs/compressing/Decompressor.java:
##
@@ -42,6 +44,13 @@ protected Decompressor() {}
public abstract void decompress(
mikemccand commented on PR #33:
URL:
https://github.com/apache/lucene-jira-archive/pull/33#issuecomment-1181821682
I'm closing this messed up PR -- I rebooted it into #40.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and u
mikemccand opened a new pull request, #40:
URL: https://github.com/apache/lucene-jira-archive/pull/40
I "rebooted" my PR by downloading the diff off the messed up #33 PR, futzing
it locally, applying, resolving conflicts. Messy messy. I'll try to more
carefully manage the git merging step
mikemccand closed pull request #33: Polish wording of Legacy Jira details
header, and each comment footer
URL: https://github.com/apache/lucene-jira-archive/pull/33
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL
mayya-sharipova commented on code in PR #992:
URL: https://github.com/apache/lucene/pull/992#discussion_r919288022
##
lucene/core/src/java/org/apache/lucene/codecs/KnnVectorsWriter.java:
##
@@ -24,28 +24,40 @@
import org.apache.lucene.index.DocIDMerger;
import org.apache.lucen
[
https://issues.apache.org/jira/browse/LUCENE-10603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17565402#comment-17565402
]
Adrien Grand commented on LUCENE-10603:
---
+1
> Improve iteration of ords for Sort
mayya-sharipova commented on code in PR #992:
URL: https://github.com/apache/lucene/pull/992#discussion_r919288022
##
lucene/core/src/java/org/apache/lucene/codecs/KnnVectorsWriter.java:
##
@@ -24,28 +24,40 @@
import org.apache.lucene.index.DocIDMerger;
import org.apache.lucen
mocobeta commented on PR #33:
URL:
https://github.com/apache/lucene-jira-archive/pull/33#issuecomment-1181624324
> Thanks -- I was beginning to wonder if it was normal how long it was
taking ;)
Of course it's not normal; I remember it took two or three hours to convert
the whole Jir
[
https://issues.apache.org/jira/browse/LUCENE-10628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17565882#comment-17565882
]
Ignacio Vera commented on LUCENE-10628:
---
I have mainly worked with two type of tr
mocobeta opened a new pull request, #39:
URL: https://github.com/apache/lucene-jira-archive/pull/39
Close #38
This ad-hoc patch fixes `'maximum recursion depth exceeded'` error, and also
makes the script a bit faster. (8h -> 5h)
--
This is an automated message from the Apache Git
[
https://issues.apache.org/jira/browse/LUCENE-10649?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17565885#comment-17565885
]
Adrien Grand commented on LUCENE-10649:
---
Good catch [~vigyas], it looks related i
mayya-sharipova commented on code in PR #992:
URL: https://github.com/apache/lucene/pull/992#discussion_r919332844
##
lucene/core/src/java/org/apache/lucene/codecs/perfield/PerFieldKnnVectorsFormat.java:
##
@@ -102,9 +104,22 @@ private class FieldsWriter extends KnnVectorsWriter
luyuncheng commented on PR #987:
URL: https://github.com/apache/lucene/pull/987#issuecomment-1181632413
> Would it be possible to remove all `CompressionMode#compress` variants
that take a `byte[]` now that you introduced a new method that takes a
`ByteBuffersDataInput`?
>
> Also may
[
https://issues.apache.org/jira/browse/LUCENE-10471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17565468#comment-17565468
]
Michael Sokolov commented on LUCENE-10471:
--
We should not be imposing an arbit
luyuncheng commented on code in PR #987:
URL: https://github.com/apache/lucene/pull/987#discussion_r918848057
##
lucene/core/src/java/org/apache/lucene/codecs/compressing/CompressionMode.java:
##
@@ -257,9 +270,13 @@ private static class DeflateCompressor extends Compressor {
mayya-sharipova commented on code in PR #992:
URL: https://github.com/apache/lucene/pull/992#discussion_r919288022
##
lucene/core/src/java/org/apache/lucene/codecs/KnnVectorsWriter.java:
##
@@ -24,28 +24,40 @@
import org.apache.lucene.index.DocIDMerger;
import org.apache.lucen
mayya-sharipova commented on code in PR #992:
URL: https://github.com/apache/lucene/pull/992#discussion_r919343914
##
lucene/core/src/java/org/apache/lucene/codecs/lucene93/Lucene93HnswVectorsWriter.java:
##
@@ -266,65 +470,128 @@ private void writeMeta(
}
}
- private
mocobeta commented on issue #38:
URL:
https://github.com/apache/lucene-jira-archive/issues/38#issuecomment-1181776008
I opened #39. I cannot really explain _why the ad-hoc fix works_ but it
works. I think there should be a better way though, it would be sufficient for
the one-time batch.
mayya-sharipova commented on code in PR #992:
URL: https://github.com/apache/lucene/pull/992#discussion_r919349095
##
lucene/core/src/java/org/apache/lucene/index/VectorValuesWriter.java:
##
@@ -26,233 +26,153 @@
import org.apache.lucene.codecs.KnnVectorsWriter;
import org.apa
[
https://issues.apache.org/jira/browse/LUCENE-10653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17565380#comment-17565380
]
Adrien Grand commented on LUCENE-10653:
---
+1 to doing a bulk heapify
The fact tha
mayya-sharipova commented on PR #992:
URL: https://github.com/apache/lucene/pull/992#issuecomment-1182388563
@jtibshirani @jpountz Thank for your review. I've tried to address your
comments, but it looks like we are still not clear how to organize `merge` and
`flush` methods. Would be nic
mayya-sharipova commented on code in PR #992:
URL: https://github.com/apache/lucene/pull/992#discussion_r919332844
##
lucene/core/src/java/org/apache/lucene/codecs/perfield/PerFieldKnnVectorsFormat.java:
##
@@ -102,9 +104,22 @@ private class FieldsWriter extends KnnVectorsWriter
mikemccand commented on issue #38:
URL:
https://github.com/apache/lucene-jira-archive/issues/38#issuecomment-1181644356
> I'm trying to find other ways that do not cause infinite recursion while
parsing lists correctly.
Awesome, thanks @mocobeta!
--
This is an automated message fr
jpountz commented on PR #907:
URL: https://github.com/apache/lucene/pull/907#issuecomment-1181518177
@shahrs87 Can you look into removing all other instances of `terms ==
Terms.EMPTY` or `terms != Terms.EMPTY` as well? To do this while keeping tests
passing, I think you'll need to create em
mikemccand merged PR #40:
URL: https://github.com/apache/lucene-jira-archive/pull/40
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscr...@
[
https://issues.apache.org/jira/browse/LUCENE-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17565914#comment-17565914
]
Michael Sokolov commented on LUCENE-10577:
--
It would be nice if we could make
mikemccand commented on PR #33:
URL:
https://github.com/apache/lucene-jira-archive/pull/33#issuecomment-1181587514
> I'm also converting the whole Jira issue myself; it looks like it takes
several hours... (recent changes to fix conversion errors could affect the
conversion speed I think).
mikemccand commented on PR #33:
URL:
https://github.com/apache/lucene-jira-archive/pull/33#issuecomment-1181586644
> Sorry there should have been a "catch all" try~except clause. I made a
quick fix in #35.
No worries at all! No need to apologize!
--
This is an automated message f
mocobeta commented on issue #36:
URL:
https://github.com/apache/lucene-jira-archive/issues/36#issuecomment-1181522090
https://docs.python.org/3/howto/logging-cookbook.html#logging-to-a-single-file-from-multiple-processes
--
This is an automated message from the Apache Git Service.
To res
mikemccand commented on PR #33:
URL:
https://github.com/apache/lucene-jira-archive/pull/33#issuecomment-1181589626
> It looks like a bug introduced in
[cfbc821](https://github.com/apache/lucene-jira-archive/commit/cfbc821390859a7053e43028325b6bc616ec2b5b).
(I have postponed testing it with
mikemccand commented on PR #33:
URL:
https://github.com/apache/lucene-jira-archive/pull/33#issuecomment-1181586767
And thank you for the quick fix!
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go t
mikemccand opened a new issue, #37:
URL: https://github.com/apache/lucene-jira-archive/issues/37
Spinoff from #33.
This is not a blocker for migration, more because I'm curious how Jira lost
issues and how pervasive this problem might be -- maybe other Apache projects
are affected?
jpountz commented on PR #987:
URL: https://github.com/apache/lucene/pull/987#issuecomment-1181718918
> if we only using compress method with variants ByteBuffersDataInput in
LUCENE90, we can not using abstract method Compressor.compress, when we want to
use other compression mode.
I
[
https://issues.apache.org/jira/browse/LUCENE-10650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17565384#comment-17565384
]
Adrien Grand commented on LUCENE-10650:
---
{{query.boost}} is the {{query.getBoost(
[
https://issues.apache.org/jira/browse/LUCENE-10628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17565944#comment-17565944
]
Marc D'Mello commented on LUCENE-10628:
---
Thanks for taking a look! As for the ans
[
https://issues.apache.org/jira/browse/LUCENE-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17565919#comment-17565919
]
Julie Tibshirani commented on LUCENE-10577:
---
I wasn't suggesting making it en
[
https://issues.apache.org/jira/browse/LUCENE-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17565919#comment-17565919
]
Julie Tibshirani edited comment on LUCENE-10577 at 7/12/22 4:23 PM:
-
[
https://issues.apache.org/jira/browse/LUCENE-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17565871#comment-17565871
]
Julie Tibshirani commented on LUCENE-10577:
---
I checked out the latest PR chan
jpountz merged PR #966:
URL: https://github.com/apache/lucene/pull/966
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.
mikemccand opened a new issue, #38:
URL: https://github.com/apache/lucene-jira-archive/issues/38
Spinoff from #33.
Some issues' text hit a stack overflow exception, e.g. one of the comments
on LUCENE-550:
```
(.venv) beast3:migration[polish_legacy_jira]$ python
src/jira2git
mikemccand commented on issue #38:
URL:
https://github.com/apache/lucene-jira-archive/issues/38#issuecomment-1181596940
Note that it is pretty rare -- when I ran the full conversion, I saw four
separate occurrences. Might not be so important to track down? We can just
carry over the raw
[
https://issues.apache.org/jira/browse/LUCENE-10619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17565872#comment-17565872
]
ASF subversion and git services commented on LUCENE-10619:
--
Co
mikemccand commented on PR #33:
URL:
https://github.com/apache/lucene-jira-archive/pull/33#issuecomment-1181657754
I pushed a small change to make a best-effort when we hit exceptions from
the converter. Such comments look like this:
https://github.com/mikemccand/stargazers-migration-test
mikemccand commented on code in PR #39:
URL: https://github.com/apache/lucene-jira-archive/pull/39#discussion_r919015037
##
migration/src/markup/lists.py:
##
@@ -40,6 +40,11 @@ def action(self, tokens: ParseResults) -> str:
for line in tokens:
# print(rep
tang-hi commented on PR #966:
URL: https://github.com/apache/lucene/pull/966#issuecomment-1181886902
@jpountz thanks for the suggestion 😄 . I have changed testWriteBytes to
write small chunks each time
--
This is an automated message from the Apache Git Service.
To respond to the message
mocobeta commented on issue #38:
URL:
https://github.com/apache/lucene-jira-archive/issues/38#issuecomment-1181803770
I'll merge it once I confirmed it parses all Jira without any errors. (I
think nobody can review the quick and dirty fix...)
--
This is an automated message from the Apac
mikemccand commented on PR #33:
URL:
https://github.com/apache/lucene-jira-archive/pull/33#issuecomment-1181662032
OK don't merge this -- I somehow messed up and slurped in unrelated (already
previously committed/pushed) changes. I have to drop off for now but will try
to fix this a bit l
mocobeta commented on PR #39:
URL:
https://github.com/apache/lucene-jira-archive/pull/39#issuecomment-1181804695
Thank you @mikemccand
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specif
[
https://issues.apache.org/jira/browse/LUCENE-10619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Adrien Grand resolved LUCENE-10619.
---
Fix Version/s: 9.3
Resolution: Fixed
> Optimize the writeBytes in TermsHashPerField
[
https://issues.apache.org/jira/browse/LUCENE-10619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17565873#comment-17565873
]
ASF subversion and git services commented on LUCENE-10619:
--
Co
mikemccand commented on PR #33:
URL:
https://github.com/apache/lucene-jira-archive/pull/33#issuecomment-1181660019
Sorry -- not pushed to the PR yet -- struggling w/ git ;)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and
jpountz commented on code in PR #987:
URL: https://github.com/apache/lucene/pull/987#discussion_r918752313
##
lucene/core/src/java/org/apache/lucene/codecs/compressing/CompressionMode.java:
##
@@ -257,9 +270,13 @@ private static class DeflateCompressor extends Compressor {
[
https://issues.apache.org/jira/browse/LUCENE-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17566015#comment-17566015
]
Michael Sokolov commented on LUCENE-10577:
--
OK, that makes sense to me – I'll
[
https://issues.apache.org/jira/browse/LUCENE-10471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17566026#comment-17566026
]
Mayya Sharipova commented on LUCENE-10471:
--
[~sstolpovskiy] Â [~sokolov] Thanks
Nick Knize created LUCENE-10654:
---
Summary: New companion doc value format for LatLonShape and
XYShape field types
Key: LUCENE-10654
URL: https://issues.apache.org/jira/browse/LUCENE-10654
Project: Lucen
[
https://issues.apache.org/jira/browse/LUCENE-10654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Knize updated LUCENE-10654:
Description:
{{XYDocValuesField}} provides doc value support for {{XYPoint}}.
{{LatLonDocValuesF
Yuti-G commented on code in PR #1013:
URL: https://github.com/apache/lucene/pull/1013#discussion_r919502708
##
lucene/facet/src/test/org/apache/lucene/facet/FacetTestCase.java:
##
@@ -264,4 +264,24 @@ protected void assertFloatValuesEquals(FacetResult a,
FacetResult b) {
nknize opened a new pull request, #1017:
URL: https://github.com/apache/lucene/pull/1017
Adds new doc value field to support LatLonShape and XYShape doc values. The
implementation is inspired by ComponentTree. A binary tree of tessellated
components (point, line, or triangle) is created
[
https://issues.apache.org/jira/browse/LUCENE-10649?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17566071#comment-17566071
]
Vigya Sharma commented on LUCENE-10649:
---
Great, thanks for confirming Adrien. I'l
[
https://issues.apache.org/jira/browse/LUCENE-10654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Knize updated LUCENE-10654:
Fix Version/s: 9.3
> New companion doc value format for LatLonShape and XYShape field types
> ---
gsmiller merged PR #1010:
URL: https://github.com/apache/lucene/pull/1010
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscr...@lucene.apac
mocobeta merged PR #39:
URL: https://github.com/apache/lucene-jira-archive/pull/39
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscr...@lu
mocobeta closed issue #38: StackOverflowException on certain issue descriptions
and comment text
URL: https://github.com/apache/lucene-jira-archive/issues/38
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above t
msokolov commented on PR #947:
URL: https://github.com/apache/lucene/pull/947#issuecomment-1182694202
OK, this last round of commits moves the new vector encoding parameter out
of IndexableField and FieldInfo into Codec constructor and internally to the
codec, in FieldEntry. It certainly ha
zacharymorn opened a new pull request, #1018:
URL: https://github.com/apache/lucene/pull/1018
### Description (or a Jira issue link if you have one)
Use BulkScorer to limit BMMScorer to only top-level disjunctions
Note: Tests update pending
--
This is an automated message fro
[
https://issues.apache.org/jira/browse/LUCENE-10480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17566149#comment-17566149
]
Zach Chen commented on LUCENE-10480:
{quote}I wouldn't say blocker, but maybe we co
[
https://issues.apache.org/jira/browse/LUCENE-10480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17566149#comment-17566149
]
Zach Chen edited comment on LUCENE-10480 at 7/13/22 5:09 AM:
zacharymorn commented on PR #1018:
URL: https://github.com/apache/lucene/pull/1018#issuecomment-1182774748
Benchmark results with `wikinightly.tasks` boolean queries below:
```
TaskQPS baseline StdDevQPS
my_modified_version StdDev
navneet1v commented on code in PR #1017:
URL: https://github.com/apache/lucene/pull/1017#discussion_r919668826
##
lucene/core/src/java/org/apache/lucene/document/ShapeDocValuesField.java:
##
@@ -0,0 +1,844 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or
[
https://issues.apache.org/jira/browse/LUCENE-10650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17566191#comment-17566191
]
Nathan Meisels commented on LUCENE-10650:
-
Thanks for the response!
I have ano
77 matches
Mail list logo