tags 175064 patch thanks Hi,
Current build use LANG=C which should work since LaTeX is forced to use C locale. I think older debiandoc2* used to use character conversion to high bit latin-1 character for © using latin-1 if locale is not specified as -l option. Under non latin-1, it shows up in funny character. HEX 1A or Decimal 169 is © = (C) under latin-1,7,8,9,13,14,15 and Š = S with v on top under latin-2,4 . So Josip's reply makes sense. Also (C) encoded values are UTF-8: 0xC2 0xA9 So it can A9 only does not work under UTF-8 So attached patch should work to build proper UTF-8 (Instead of ASCII only) pages. I am not pushing this hard for squeeze since we are deep freeze but if someone wants it, please test it and use it. Osamu
diff --git a/Makefile b/Makefile index 9ab6801..8767276 100644 --- a/Makefile +++ b/Makefile @@ -18,10 +18,10 @@ perl-policy.sgml: version.ent nsgmls -wall -gues $< %.html/index.html: %.sgml - LANG=C debiandoc2html $< + debiandoc2html -l en.UTF-8 $< %-1.html: %.sgml - LANG=C debiandoc2html -1 -b $*-1d $< && \ + debiandoc2html -l en.UTF-8 -1 -b $*-1d $< && \ mv $*-1d.html/index.html $*-1.html && \ rmdir $*-1d.html @@ -29,19 +29,19 @@ perl-policy.sgml: version.ent tar -czf $(<:/index.html=.tar.gz) $(<:/index.html=) %.txt: %.sgml - LANG=C debiandoc2text $< + debiandoc2text -l en.UTF-8 $< %.txt.gz: %.txt gzip -cf9 $< > $@ %.ps: %.sgml - LANG=C debiandoc2latexps $< + debiandoc2latexps -l en.UTF-8 $< %.ps.gz: %.ps gzip -cf9 $< > $@ %.pdf: %.sgml - LANG=C debiandoc2latexpdf $< + debiandoc2latexpdf -l en.UTF-8 $< %.pdf.gz: %.pdf gzip -cf9 $< > $@