murblanc commented on issue #5341: URL: https://github.com/apache/lucene/issues/5341#issuecomment-2569618301
I was thinking along the same lines as this idea and was pointed here by @dsmiley. @mikemccand wrote: > We could quite efficiently add the already-inverted doc (term vectors) to the in-memory postings I'm not overly familiar with Lucene data structures, but I imagined that in the Term Vector we only need the terms, not their positions, these can be found from the segment's term dictionary + posting lists. This should limit the index size increase. Strictly speaking, terms and positions can be found without term vectors, but it would take too long to look at all the terms in the dictionary. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org