Hi, 31-05-2012, 11:45 (+0200); Agustin Martin escriu: > On Fri, Feb 24, 2012 at 05:30:29PM +0100, Agustin Martin wrote: > > On Sun, Feb 19, 2012 at 05:23:42PM +0100, Ernest Adrogué wrote: > > > It splits it into two words. I added the following line in the affix > > > file: > > > > > > WORDCHARS · > > > > > > and apparently now it works (the line above must go before the first > > > TRY): > > > myspell-ca puts middle dot in the TRY section. Not sure if this is supposed > > to imply that all chars above are declared as possible parts of a word, or > > for '·', ' (and in general for anything but '-', but better declare this > > too) we need to declare it explicitly at WORDCHARS. > > > > Anyway, we can easily add it to the aff file, although it will only work for > > standalone hunspell. > > For the records, I have uploaded a new Debian myspell-ca package containing an > explicit WORDCHARS line for the different wordchars accepted by Catalan, > > WORDCHARS ·-' > > Please check.
Thanks, I will check it out. > > Did you contact upstream (Joan Moratinos) about this or tried some of > > softcatala resources (forums, ...)? > > Do you know of any news about this at upstream side? Yes. I sent a couple of mails to them and they said they were aware of the problem and that they are working on it but don't expect a fix anytime soon. The thing is there is no simple solution because each spellchecking library/program uses its own algorithms to split words, so the problem has to be dealt with on a case-by-case basis. I suggested that they include a WORDCHARS field in their dictionary, along with other improvements, but never got a reply. That was about 5-6 months ago. It didn't look like they are very receptive to suggestions from "outsiders", that is my impression. Sorry for not getting back to you sooner, I had forgot all about it! Ernest -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org