Dear fellow Apertiumers,
I think that this would be an interesting venue to present recent
work on Apertium. The last paper we published on Apertium was
in 2009/11 (quite a long time ago) in the journal Machine Translation.
NLE is a scopus Q2/Q3 tier journal, so a decent place to publish
if you need those academic point things.
Here are the "Limitations and work ahead" from the last paper:
1) The performance of the part-of-speech tagging module may be below the
state of
the art for many languages.
2) Polysemous SL words may have more than one TL equivalent. Apertium
bilingual
dictionaries currently provide only one TL LF per SL LF.
3) The structural transfer component does not rely on a full parse tree
of the whole
sentence, but rather on one or more levels of chunking.
4) The structural transfer module is by far the most time-consuming one:
5) Complex and discontiguous MWLUs are not well covered by the system.
Of these,
(1) was taken care of by the perceptron tagger (2016)
(2) is taken care of by the lexical selection module (2013)
(5) is taken care of by apertium-separable (2017)
In addition we probably have double the number of language pairs that we
had
in 2009/11, including many "similar languages and varieties", and there
are new
evaluation results and methods for inferring transfer rules.
If we choose to write a paper, it could focus on the work we've done on
those
languages. e.g. how we deal with variation (ca_valencia, ca_gva,
oci_aran, hbs_HR etc.)
and the new things in the platform.
Please feel free to reply to this email on the list if you are
interested, or
reply to me personally.
Regards,
Fran
-------- Mensaje Original --------
Asunto: [Corpora-List] 1st Call for Papers: NLE Journal Special Issue on
NLP for Similar Languages, Varieties and Dialects
Fecha: 2018-06-04 21:24
De: "Zampieri, Marcos" <[email protected]>
Destinatario: "[email protected]" <[email protected]>
1st Call for Papers
Natural Language Engineering Journal - Cambridge University Press
Special Issue on NLP for Similar Languages, Varieties and Dialects
URL: https://sites.google.com/view/nledialects
Guest Editors
Marcos Zampieri (University of Wolverhampton, United Kingdom)
Preslav Nakov (Qatar Computing Research Institute, HBKU, Qatar)
Topics
Recent initiatives in language technology have led to the development of
at least minimal language processing toolkits for all EU-official
languages, as well as for languages with a large number of speakers
worldwide such as Chinese and Arabic. Apart from those official
languages, a large number of dialects or closely-related language
varieties are in daily use, not only as spoken colloquial languages but
also in written media and social networks. Building language resources
and tools from scratch is expensive, but the efforts can often be
reduced by making use of pre-existing resources and tools for related,
resource-richer languages.
The interest in language resources and computational models for the
study of similar languages, language varieties and dialects has been
growing substantially in recent years. This is evidenced by a number of
publications on this topic in NLP journals and conferences and the
organization of the now well-established VarDial workshop series
co-located yearly with top-tier NLP conferences.
We welcome papers dealing with one or more of the following topics:
- Language resources and tools for similar languages, varieties and
dialects;
- Adaptation of tools (taggers, parsers) for similar languages,
varieties and dialects;
- Evaluation of language resources and tools when applied to language
varieties;
- Reusability of language resources in NLP applications (e.g., for
machine translation, POS tagging, syntactic parsing, etc.);
- Corpus-driven studies in dialectology and language variation;
- Computational approaches to the study of mutual intelligibility
between dialects and similar languages;
- Automatic identification of lexical variation;
- Automatic classification of language varieties;
- Text similarity and adaptation between language varieties;
- Linguistic issues in the adaptation of language resources and tools
(e.g., semantic discrepancies, lexical gaps, false friends);
- Machine translation between closely related languages, language
varieties and dialects.
Important Dates
- Deadline for submissions: 15 October 2018
- First-round author notification: 15 December 2018
- Submission of revised versions: 1 February 2019
- Second-round author notification (final): 1 April 2019
- Camera-ready versions: 15 April 2019
Contact: m.zampieri(at)wlv.ac.uk
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff