OK! Thanks a lot for your reply and recommendation.

I configured the apertium core and litoolbox environment and downloaded
several dictionaries on my computer the other day. Recently I've been
getting familiar with their usage and the meaning of each of the options.

I have a certain understanding of the composition of Unicode code, and now
I am also studying the grammar of ICU and making some progress.

As for IRC, I will always keep an eye on the communication on the channel.

Best regards,

--Weizhe

On Tue, Mar 3, 2020 at 9:10 PM Flammie A Pirinen <[email protected]> wrote:

> Hi,
>
> I am this week on hliday with low internet availability so only few
> quick points. Firstly I strogly recommend joining #apertium IRC channel,
> I think even non-mentors will have useful clues. For the tokenisation
> problem I think the main resource is to understand various unicode
> technical reports that describe tokenisations and a C++ library like
> ICU, and then how apertium currently does tokenisations and how this
> projects code will interact, especially for the last point many other
> people in IRC know it better  than me.
>
> Regards,
>
> On Thu, Feb 27, 2020 at 01:45:09PM +0800, 杨伟哲 wrote:
> > Hi Francis and Flammie,
> >
> > I’m interested in the “Robust tokenisation in lttoolbox”[1] GSoC project.
> > And
> > currently I’m writing the proposal.
> >
> > I have completed the code challenge listed in the project, which has been
> > put
> > on Pastebin[2]. However, I’m not quite clear where this project starting
> > with.
> > And I will be much appreciate if you could list somewhere (e.g. GitHub
> repo
> > related to this project) for me to get started with. I will also try to
> > learn
> > and solve issues there if possible.
> >
> > Bio: I’m Chinese undergraduate in Software Engineering. In my freshman
> > year, I
> > joined the high-performance computing center[3] of the university as a
> > research
> > assistant. Through research and learning during the period, I have a deep
> > understanding of software architecture and open source projects.
> >
> >
> > [1]
> >
> http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code/Robust_tokenisation
> >
> > [2] https://github.com/GavinWz/Apertium
> >
> > [3] http://cs.wfu.edu.cn/2014/0603/c1227a33048/page.htm
> >
> >
> > Regards,
> >
> > Weizhe Yang
>
>
> > _______________________________________________
> > Apertium-stuff mailing list
> > [email protected]
> > https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
>
> --
> Regards, Flammie <https://flammie.github.io>
> (Please note, that I will often include my replies inline instead of
> top or bottom of the mail)
> _______________________________________________
> Apertium-stuff mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to