kuramitsu opened a new pull request, #12885:
URL: https://github.com/apache/lucene/pull/12885

   ### Description
   I found a bug in using JapaneseReadingFormFilter that some hiragana are not 
converted to romaji.
   (For example, "ぐ" does not become "gu". I noticed this because "マスキング" did 
not get any hits when searching for "ますきんぐ".)
   I believe this is due to the existence of hiragana whose readings are not 
explicitly defined in the kuromoji dictionary.
   
   ### Draft
   In the getRomanization function, how about adding a process to convert 
hiragana to katakana when it is detected?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to