Hi,

31-05-2012, 11:45 (+0200); Agustin Martin escriu:
> On Fri, Feb 24, 2012 at 05:30:29PM +0100, Agustin Martin wrote:
> > On Sun, Feb 19, 2012 at 05:23:42PM +0100, Ernest Adrogué wrote:
> > > It splits it into two words. I added the following line in the affix
> > > file:
> > > 
> > > WORDCHARS ·
> > > 
> > > and apparently now it works (the line above must go before the first
> > > TRY):
> 
> > myspell-ca puts middle dot in the TRY section. Not sure if this is supposed
> > to imply that all chars above are declared as possible parts of a word, or
> > for '·', '  (and in general for anything but '-', but better declare this
> > too) we need to declare it explicitly at WORDCHARS.
> > 
> > Anyway, we can easily add it to the aff file, although it will only work for
> > standalone hunspell.
> 
> For the records, I have uploaded a new Debian myspell-ca package containing an
> explicit WORDCHARS line for the different wordchars accepted by Catalan,
> 
> WORDCHARS ·-'
> 
> Please check.

Thanks, I will check it out.

> > Did you contact upstream (Joan Moratinos) about this or tried some of
> > softcatala resources (forums, ...)?
> 
> Do you know of any news about this at upstream side?

Yes. I sent a couple of mails to them and they said they were aware of
the problem and that they are working on it but don't expect a fix
anytime soon. The thing is there is no simple solution because each
spellchecking library/program uses its own algorithms to split words,
so the problem has to be dealt with on a case-by-case basis.

I suggested that they include a WORDCHARS field in their dictionary,
along with other improvements, but never got a reply. That was about
5-6 months ago. It didn't look like they are very receptive to
suggestions from "outsiders", that is my impression.

Sorry for not getting back to you sooner, I had forgot all about it!

Ernest



--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org

Reply via email to