Hi,
See below.
Yours,
Per Tunedal

On Sun, Dec 16, 2012, at 23:07, Francis Tyers wrote:
> El dg 16 de 12 de 2012 a les 14:21 +0100, en/na Per Tunedal va escriure:
> > Hi,
> > I consider info on primarily domain and secondly style useful for
> > disambiguation. As a first step it would be very nice to be able to add
> > a domain-tag to words. Adding info on style would make it possible to
> > further improve translation results.
> > 
> > The translation would improve considerably if the user could choose the
> > appropriate domain when demanding a translation. Consider e.g. the
> > example of translating the English word "key", or similarly the French
> > word "clé/clef", to Swedish. If the domain is e.g.
> > Tourism/accommodation/real estate or similar, the word would most likely
> > translate to "nyckel" (to lock/unlock the door of a house). On the other
> > hand if the domain is e.g. information technology (or even music) the
> > word would most likely translate to "tangent" (on your keyboard or
> > piano). Obviously, a lexical selector/disambiguator could be trained on
> > a corpus from a specific domain as well, further improving to the
> > translation.
> 
> I did this in my thesis. It's quite effective. It's possible to tune the
> vocabulary to a domain with either parallel or monolingual corpora using
> apertium-lex-tools.[1] You won't be interested in it though as it
> doesn't work with the Java version.

What will work with the java-version? And what will not? What's the
problem?

What I would like to do is:
- adding info about domain in the dictionaries
- do some training on an appropriate corpus
- let the user choose a suitable domain (if any), as an alternative to
the "general" domain
- let Apertium use info from the dictionaries and the training to solve
ambiguities.

BTW Would it do any difference if you trained the tagger on a domain
corpus?

> 
> > On more reason to take domains into account is that it's fairly easy to
> > get a high coverage and accuracy when concentrating on translating a
> > domain, rather than trying to make a system that makes a general
> > translation. Further, it's a fruitful way to build a new language pair:
> > adding a domain at a time. Thus, the system quickly gets useful.
> > 
---snip--
> 
> Fran
> 
--snip--

------------------------------------------------------------------------------
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add services
Discover what IT Professionals Know. Rescue delivers
http://p.sf.net/sfu/logmein_12329d2d
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to