Re: [PR] [New Feature] Case-Sensitive Text Indexing [pinot]

via GitHub Sat, 17 May 2025 00:12:50 -0700


xiangfu0 commented on PR #15803:
URL: https://github.com/apache/pinot/pull/15803#issuecomment-2888166150


   > > What do you think about just reusing the existing Analyzer class config 
to enable this? I see case sensitivity eventually being necessary for multiple 
analyzers, but the config only enables a case sensitive StandardAnalyzer. MV 
column text search is another case where different analyzer is useful (to 
prevent the value boundary being crossed for phrases). I see some value in 
Pinot including a number of Analyzers in the future for users to choose from, 
some of which should be case sensitive
   > > Right now the logic can become more complex if we make a future 
adjustment to the StandardAnalyzer, since configs will generally be mutually 
exclusive or require some branching when adding more derivations of the 
StandardAnalyzer to cover all combinations of configs like this
   > 
   > I think the config here is mainly for simplifying the usage for normal 
users, so they don't have to specify the 
`org.apache.pinot.segment.local.segment.index.text.CaseSensitiveAnalyzer`.
   > 
   > If the concern is more on the diverge of `StandardAnalyzer`, I feel we 
could just use `CaseSensitiveAnalyzer` as default and configurable to use 
`LowerCaseFilter` for the current default use cases.
   
   I will merge this PR for now to unblock the users. And will make another one 
to refactor the Analyzer, we can use that PR to discuss.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Re: [PR] [New Feature] Case-Sensitive Text Indexing [pinot]

Reply via email to