kotman12 commented on PR #11955: URL: https://github.com/apache/lucene/pull/11955#issuecomment-1321159282
> See background on `java-user@`: https://lists.apache.org/thread/okm7c49r7rv53z7v392v2v4pbv6m1pnw > > The `BaseTokenStreamTestCase.checkRandomData` has proven itself effective at finding thread-safety bugs (e.g. deep in ICU and commons-codec and other 3rd party libraries). I beasted the tests a few times with nightly and multiplier and didn't have issues. Does this library also check for race conditions that can arise between and ResourceLoaderAware::inform vs TokenStream creation and processing? I know it may be out of the scope of this change but I would be curious to know.. Open NLP has these heavy-weight global resources and I am worried as to how they get initialized. There is an intractable amount code paths to analyze once you take into account all of the configuration possibilities of OpenNLP which seems to be partially based on the binary model file you provide. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org