A 2015-12-22 22:09, Francis Tyers escrigué: > A 2015-12-22 19:12, Joan escrigué: >> Francis,and all >> >> I've recently found out that pairs can be enabled without a real >> "release". FANTASTIC!! >> >> Can you give me a list of what improvements you expect to see ? Is it >> just the vocabulary? YES. BESIDES THE FILE _DOCUMENT.__TXT_ THAT >> I ATTACH YOU, THERE ARE SOME WORDS AS BIOGRAPHIE, FILMOGRAPHIE, >> NEVEU, PETIT-FILS... I didn't understand what you meant by "strange >> characters". THEY ARE MAINLY CREATED WHEN ORIGINALLY IT FIND "<REF >> NAME="XX">. THE “STRANGE CHARACTERS ARE: >SPAN, ABBR CLAS, >> CONTENDITABLE...YOU CAN SEE FOR EXEMPLE IN >> HTTPS://CA.WIKIPEDIA.ORG/W/INDEX.PHP?TITLE=BOB_RAFELSON&ACTION=EDIT >> [9] VERSUS >> HTTPS://FR.WIKIPEDIA.ORG/W/INDEX.PHP?TITLE=BOB_RAFELSON&ACTION=EDIT >> [10] > > This is a problem for the ContentTranslation team. It seems like the > problem > has to do with templates. > >> Would anyone be against me converting fr-ca to: >> >> * three-letter codes >> * monolingual language packages >> * adding lexical selection support >> On our part, NO PROBLEM >> ? > > Ok. Here are the problems I find: > > * Using the large French lexicon from either br-fr or fr-es causes a > substantial decrease > in translation quality. > * The tagger is really bad.
More problems encountered during the testvoc: There is no transfer rule for converting French <ifi> -> Catalan anar + inf, this means that for certain verbs in Catalan there is no <ifi> form available, e.g. for "encobrir", "escanejar" etc. because they are LR in the paradigm in the Catalan dictionary. I've done a simple rule for this, but it won't cover all the cases with pronouns etc. There are some things like "oh these verbs don't have these forms": http://paste2.org/0WbJmzP0 e.g. we can't do anything like "Jo fosquege." Perhaps the paradigm should be updated: "quan jo fosquejo, s'enlluerna fácilment omb els cotxes i, atropeilat per ells, es trobo mort per les correteres. i. Els ropinyoires diürns soler ser difíciis de veure i, ..." Updated output: http://paste2.org/0Ly3F12Z Fran ------------------------------------------------------------------------------ _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
