kotman12 commented on PR #11955:
URL: https://github.com/apache/lucene/pull/11955#issuecomment-1321159282

   > See background on `java-user@`: 
https://lists.apache.org/thread/okm7c49r7rv53z7v392v2v4pbv6m1pnw
   > 
   > The `BaseTokenStreamTestCase.checkRandomData` has proven itself effective 
at finding thread-safety bugs (e.g. deep in ICU and commons-codec and other 3rd 
party libraries). I beasted the tests a few times with nightly and multiplier 
and didn't have issues.
   
   Does this library also check for race conditions that can arise between and 
ResourceLoaderAware::inform vs TokenStream creation and processing? I know it 
may be out of the scope of this change but I would be curious to know.. Open 
NLP has these heavy-weight global resources and I am worried as to how they get 
initialized. There is an intractable amount code paths to analyze once you take 
into account all of the configuration possibilities of OpenNLP which seems to 
be partially based on the binary model file you provide.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to