Dear Apertiumers,

I'm happy to announce the first beta release (0.1.0) of the arg-cat (Aragonese 
and Catalan) translator, using the arg and cat monolingual modules. Fran Tyers 
and I have been working on it during the last couple of weeks.
The main features are presented below.
* Initial release (0.1.0), beta
* Operative bidirectional release.
* Features:
   * Uses monolingual packages apertium-cat and apertium-arg (separated 
monolingual and bilingual data).
   * Trimming of the monolingual dictionary (in both directions).
   * Lexical selection support (not used yet).
   * Monolingual dictionaries:
        Aragonese (apertium-arg): > 546 paradigms, > 21073 lemmae (of which, 
8047 proper nouns), including 1921 multi-words.
        Catalan (apertium-cat): > 559 paradigms, > 41143 lemmae (of which, 8341 
proper nouns), including 4115 multi-words.
   * Bilingual dictionary: 24110 entries. Open categories crossed from 
apertium-spa-arg (ver 0.4.1) and apertium-es-ca. Some closed categories 
incomplete.
   * Naïve coverage:
        Aragonese: From 87.7% in an.wikipedia to 89.2% in a corpus of narrative 
texts.
        Catalan: From 87.6% in Catalan corpus at 
trunk/apertium-eo-ca/tekstaro/ca.crp.txt to 93.2% at 
trunk/apertium-es-ca/ca-tagger-data/ca.tagged.txt
   * Translation performance: Catalan to Aragonese (50 first sentences of 
ca.crp.txt): WER = 19.37% (PER=17.85%);  WER (unknown removed) = 15.48% 
(PER=13.96%). Unknown words: 11.86%. Free rides: 32.69%.
   * Dialectal support: Analysis of Aragonese dialectal forms supported. 
Valencian forms are analyzed, but not generated.
* Main pending Issues:
   * Improve on coverage.
   * Work on transfer.
   * Develop lexical selection rules

Best,
Juan Pablo

<http://sourceforge.net/about>

------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=267308311&iu=/4140
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to