Hi, See below. Yours, Per Tunedal On Sun, Dec 16, 2012, at 23:07, Francis Tyers wrote: > El dg 16 de 12 de 2012 a les 14:21 +0100, en/na Per Tunedal va escriure: > > Hi, > > I consider info on primarily domain and secondly style useful for > > disambiguation. As a first step it would be very nice to be able to add > > a domain-tag to words. Adding info on style would make it possible to > > further improve translation results. > > > > The translation would improve considerably if the user could choose the > > appropriate domain when demanding a translation. Consider e.g. the > > example of translating the English word "key", or similarly the French > > word "clé/clef", to Swedish. If the domain is e.g. > > Tourism/accommodation/real estate or similar, the word would most likely > > translate to "nyckel" (to lock/unlock the door of a house). On the other > > hand if the domain is e.g. information technology (or even music) the > > word would most likely translate to "tangent" (on your keyboard or > > piano). Obviously, a lexical selector/disambiguator could be trained on > > a corpus from a specific domain as well, further improving to the > > translation. > > I did this in my thesis. It's quite effective. It's possible to tune the > vocabulary to a domain with either parallel or monolingual corpora using > apertium-lex-tools.[1] You won't be interested in it though as it > doesn't work with the Java version.
What will work with the java-version? And what will not? What's the problem? What I would like to do is: - adding info about domain in the dictionaries - do some training on an appropriate corpus - let the user choose a suitable domain (if any), as an alternative to the "general" domain - let Apertium use info from the dictionaries and the training to solve ambiguities. BTW Would it do any difference if you trained the tagger on a domain corpus? > > > On more reason to take domains into account is that it's fairly easy to > > get a high coverage and accuracy when concentrating on translating a > > domain, rather than trying to make a system that makes a general > > translation. Further, it's a fruitful way to build a new language pair: > > adding a domain at a time. Thus, the system quickly gets useful. > > ---snip-- > > Fran > --snip-- ------------------------------------------------------------------------------ LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
