Re: Latin-1 files in groff

flexibeast Thu, 16 Apr 2026 17:32:30 -0700

Dave Kemper <[email protected]> writes:

I consider it a sign of respect for volunteer
contributors to spell their names correctly, even names that canbe
transliterated to an ASCII approximation.

Agreed. Transliteration is often inexact, which is why there aree.g. multiple transliterations of the Hebrew word חֲנֻכָּה. EnglishWikipedia transliterates it as 'hanukkah' for the purposes of thetitle of the relevant page, but it's also transliterated as'chanukah', 'chanuka', 'hanukah' and several more. Wikipedianotes:

[T]he letter ḥeth (ח‎), which is the first letter in the Hebrewspelling, is pronounced differently in modern Hebrew (voicelessuvular fricative) from in classical Hebrew (voiceless pharyngealfricative [ħ]), and neither of those sounds is unambiguouslyrepresentable in English spelling.

-- https://en.wikipedia.org/wiki/Hanukkah#Alternative_spellings

So there's no straightforward transliteration for people withHebrew names that contain 'ח'.

The Wikipedia page for Latin-1 ("ISO/IEC 8859-1") notes a numberof European linguistic communities that have to use what itdescribes as "typographical approximation" in the context of thatencoding:


 https://en.wikipedia.org/wiki/ISO/IEC_8859-1#Languages_with_incomplete_coverage

This is probably a good point to bring up the "FalsehoodsProgrammers Believe About Names" page, which amongst otherfalsehoods, lists:

People’s names are written in ASCII.
People’s names are written in any single character set.
People’s names are all mapped in Unicode code points.

--
  
https://www.kalzumeus.com/2010/06/17/falsehoods-programmers-believe-about-names/

Barriers to reading UTF-8 text files are few in 2026.


Indeed, in 2026, >99% of Web pages are encoded in UTF-8:

 https://w3techs.com/technologies/cross/character_encoding/ranking


Alexis (flexibeast).

Re: Latin-1 files in groff

Reply via email to