Le 13 mai 2011 à 17:32, Robert Muir a écrit : > On Fri, May 13, 2011 at 7:07 AM, Paul Libbrecht <p...@hoplahup.net> wrote: > >> I sure wish such a compound-analysis would be done with a lucene-powered >> dictionary! >> That would rock. >> > > me too, but its a chicken-and-egg problem (you would have to basically > index everything without decomposition to get the dictionary+freqs, > then use this as decomposition dictionary and index again) I think this is ok. It's a kind of a research project. The decompositions follow some language specific rules I think. And they should be reviewed by humans. Maybe a good GSoc project one day... paul
- Results with and without whitspace(soccer club and soccer... roySolr
- Re: Results with and without whitspace(soccer club a... Paul Libbrecht
- Re: Results with and without whitspace(soccer cl... Markus Jelsma
- Re: Results with and without whitespace(soccer club ... Grijesh
- Re: Results with and without whitespace(soccer c... roySolr
- Re: Results with and without whitespace(socc... Paul Libbrecht
- Re: Results with and without whitespace(... Robert Muir
- Re: Results with and without whites... Paul Libbrecht
- Re: Results with and without whitspace(soccer club a... lboutros
- Re: Results with and without whitspace(soccer cl... Luis Cappa Banda
- Re: Results with and without whitspace(socce... roySolr
- Re: Results with and without whitspace(s... roySolr
- Re: Results with and without whitsp... Sujit Pal
- Re: Results with and without wh... Erick Erickson
- Re: Results with and withou... roySolr
- Re: Results with and withou... Erick Erickson