I tagged the entire file and we have the same amount of lines as in the source file:
bash-4.4$ wc -l tatcorpus3.sentences tatcorpus3.sentences.apertium.tagged.2018-11-10 38909475 tatcorpus3.sentences 38909475 tatcorpus3.sentences.apertium.tagged.2018-11-10 Moving to the newer version of Apertium solved the problem. But I cannot compare it line by line using 'diff'. Can we untag the file using apertium to get the text close to the original? Am Sa., 10. Nov. 2018 um 09:52 Uhr schrieb Kevin Brubeck Unhammer < [email protected]>: > mansur <[email protected]> čálii: > > > Should I run 'cg-comp dev/mansur.rlx dev/mansur.bin' before 'autogen.sh > && > > make' or after? > > If it's not in the makefiles anyway, it doesn't matter. > _______________________________________________ > Apertium-stuff mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/apertium-stuff >
_______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
