[ 
https://issues.apache.org/jira/browse/LUCENE-5572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17419592#comment-17419592
 ] 

Tomoko Uchida commented on LUCENE-5572:
---------------------------------------

On a side note - LUCENE-8971 allows users to load their own custom dictionary 
from a path (I remembered that just now...). However, it is a very advanced 
feature for users who know how to train and build kuromoji system dictionary, 
and I think it's a transitional remedy until we have a method to decouple the 
analysis engine and the dictionary (data). Still, most of all users use the 
off-the-shelf resource bundled with kuromoji-jar.

> JapaneseTokenizer is sensitive to interrupts
> --------------------------------------------
>
>                 Key: LUCENE-5572
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5572
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.6.2
>            Reporter: Anthony Rasmussen
>            Priority: Minor
>
> The constructor for JapaneseTokenizer gets the following singleton instances: 
> TokenInfoDictionary, UnknownDictionary, and ConnectionCosts. I am finding 
> that the associated getInstanceMethods are particularly sensitive to 
> IOExceptions.
> Perhaps, in the static initializers of these  3 singletons, there could be 
> some sort of retry effort before throwing a RunTimeException?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to