Well, WordDelimiterFilterFactory would split on the punctuation, so
you could add it to the analyzer chain along with StandardAnalyzer.

You could use one of the regex filters to break up tokens that make it
through the analyzer as you see fit.

But in general, this will be a bunch of compromises since programming
languages are, shall we say, not standard <G>

Best
Erick


On Thu, Jun 13, 2013 at 4:19 AM, Gian Maria Ricci
<alkamp...@nablasoft.com>wrote:

> I did a little search around and did not find anything interesting. Anyone
> know if some analyzers exists to better index source code (es C#, C++. Java
> etc)?****
>
> ** **
>
> Standard analyzer is quite good, but I wish to know if there are some more
> specific analyzers that can do a better indexing. Es I did a little try
> with C# and the full class name was indexed without splitting by dots. So
> MyLib.Helpers.Myclass becomes one token and when I search for MyClass I did
> not find matches. ****
>
> ** **
>
> Thanks in advance.****
>
> ** **
>
> --****
>
> Gian Maria Ricci****
>
> Mobile: +39 320 0136949****
>
> <http://mvp.microsoft.com/en-us/mvp/Gian%20Maria%20Ricci-4025635> [image:
> https://encrypted-tbn1.gstatic.com/images?q=tbn:ANd9GcQyg0wiW_QuTxl-rnuVR2P0jGuj4qO3I9attctCNarL--FC3vdPYg]<http://www.linkedin.com/in/gianmariaricci>
>  [image:
> https://encrypted-tbn2.gstatic.com/images?q=tbn:ANd9GcT8z0HpwpDSjDWw1I59Yx7HmF79u-NnP0NYeYYyEyWM1WtIbOl7]<https://twitter.com/alkampfer>
>  [image:
> https://encrypted-tbn1.gstatic.com/images?q=tbn:ANd9GcQQWMj687BGGypKMUTub_lkUrull1uU2LTx0K2tDBeu3mNUr7Oxlg]<http://feeds.feedburner.com/AlkampferEng>
>  [image:
> https://encrypted-tbn3.gstatic.com/images?q=tbn:ANd9GcSkTG_lPTPFe470xfDtiInUtseqKcuV_lvI5h_-8t_3PsY5ikg3]
> ****
>
> ** **
>
> ** **
>

Reply via email to