jpountz opened a new pull request, #12053:
URL: https://github.com/apache/lucene/pull/12053

   Today Lucene allows creating indexed binary fields, e.g. via 
`StringField(String, BytesRef, Field.Store)`, but not reusing them: calling 
`setBytesValue` on a `StringField` throws.
   
   This commit removes the check that prevents reusing fields with binary 
values. I considered an alternative that consisted of failing if calling 
`setBytesValue` on a field that is indexed and tokenized, but we currently 
don't have such checks e.g. on numeric values, so it did not feel consistent.
   
   Doing this change would help improve the [nightly benchmarks for the NYC 
taxis 
dataset](http://people.apache.org/~mikemccand/lucenebench/sparseResults.html) 
by doing the String -> UTF-8 conversion only once for keywords, instead of once 
for the `StringField` and one for the `SortedDocValuesField`, while still 
reusing fields.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to