Le 13 mai 2011 à 17:32, Robert Muir a écrit :

> On Fri, May 13, 2011 at 7:07 AM, Paul Libbrecht <p...@hoplahup.net> wrote:
> 
>> I sure wish such a compound-analysis would be done with a lucene-powered 
>> dictionary!
>> That would rock.
>> 
> 
> me too, but its a chicken-and-egg problem (you would have to basically
> index everything without decomposition to get the dictionary+freqs,
> then use this as decomposition dictionary and index again)

I think this is ok.
It's a kind of a research project.
The decompositions follow some language specific rules I think.
And they should be reviewed by humans.

Maybe a good GSoc project one day...

paul

Reply via email to