On Wed, Jul 31, 2024 at 11:51 PM Werner LEMBERG <w...@gnu.org> wrote: > > However, in the meantime, meaning for groff 1.24, I propose to move > > `hcode` definitions to where they make more sense: the character set > > macro files "koi8-r.tmac", "latin1.tmac", "latin2.tmac", and > > "latin9.tmac". (If/when I do that, I'll need to update the > > "tmac/LOCALIZATION" file accordingly.) > > Probably a good idea. The few cases where this has to be changed > (classical example: Turkish needs 'İ' mapped to 'i' and 'I' mapped to > 'ı') can be overridden in a language-specific hyphenation setup.
This change was made in commit 0629380a9 (http://git.savannah.gnu.org/cgit/groff.git/commit/?id=0629380a9). But it doesn't seem to be the right solution, because now hyphenation varies depending on output format. This surely can't be the intent. $ cat test.59397 .hy 4 .ll 1u resume r\['e]sum\['e] $ groff --version | head -1 GNU groff version 1.23.0.1624-4d251-dirty $ groff -Wbreak -Tlatin1 test.59397 | cat -s re- sume ré- sumé $ groff -Wbreak -a test.59397 <beginning of page> re<hy> sume r<'e>sum<'e>