Hi

I've just hacked the hunspell sources I got from CVS so that I could get
rid of this hunspell bug:
https://sourceforge.net/tracker/?func=detail&aid=3288562&group_id=143754&atid=756395

The issue was that hunspell would accept words with one suffix+compound,
but not two suffixes+compound. Patch attached in the above link.

-------

Since most words in Quichua have stem+suffix_level1+suffix_level2
+lots_of_compounds, this was a showstopper for my spellchecker.

At first I wrote a generator script that would create all the different
inflexes and stick it into a dictionary, but it was a bad hack and very
slow with a dictionary file of 170000+entries for only some 3000
words...

But now also analysis benefits :D

hunspell -d qu_EC -m   
mikunakuykunatakapash  pa:mikunakuy st:mikuna # base plur #
Nounification "amongst us" pa:kuna st:kuna pa:ta st:ta pa:ka st:ka
pa:pash

[For the curious the word means something like "for to the big feasts,
also"]
--------

The patch is just two lines, one for each of two functions in
affixmgr.cxx, compound_check() and compound_check_morph().

For LO only the one in AffMgr::compound_check() is needed, I think,
haven't seen the morphology analysis being used anywhere in LO...

best
Arno Teigseth

Attachment: signature.asc
Description: This is a digitally signed message part

_______________________________________________
LibreOffice mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/libreoffice

Reply via email to