Package: tex4ht Version: 20070904-1 Severity: normal
Japanese text usually does not use space inside sentences (space is not a delimiter). tex4ht seems to insert spaces after 'you-on' (small-letters: "ゃゅょっャュョッ") The original tex formatting is as follows: 「ところで,Debian Weekly News (DWN)は読んでいますか?Debian 界隈でおきていることについて書いているDebian Weekly News. 毎回読んでいるといろいろと分かって来ますが,一人で読んでいても,解説が少ないので,意味がわからないところもあるかも知れません.みんなでDWNを読んでみましょう.」 But it is rendered on HTML as: 「ところで, Debian Weekly News (DWN) は読んでいますか? Debian 界隈でおきていることについて書いて いる Debian Weekly News. 毎回読んでいるといろいろと分かっ て来ますが, 一人で読んでいても, 解説が少ないので, 意味がわからないところもあるかも知れません. みんなで DWN を読んでみま しょ う.」 A HTML snippet looks like (there is extra space in "</span> <span" after "っ") class="dmjhira-10x-x-101">か</span><span class="dmjhira-10x-x-101">っ</span> <span class="dmjhira-10x-x-101">て</span><span I'm using modified htlatex command to generate, called htplatex, as follows to use ptex-bin to process Japanese. #!/bin/sh #htplatex debianmeetingresume200708.tex jp,2,sections+ #apt-get install dvi2ps-fontdata-a2n dvi2dvi dvipng if command -v xhtex > /dev/null 2>&1 ; then true else export PATH=/usr/share/tex4ht:$PATH fi platex $5 '\makeatletter\def\HCode{\futurelet\HCode\HChar}\def\HChar{\ifx"\HCode\def\HCode"##1"[EMAIL PROTECTED]@[EMAIL PROTECTED],html]{tex4ht}}\let\HCode\documentstyle\def\documentstyle{\let\documentstyle\HCode\expandafter\def\csname tex4ht\endcsname{#1,html}\def\HCode####1{\documentstyle[tex4ht,[EMAIL PROTECTED] '$2'.a.b.c.\input ' $1 platex $5 '\makeatletter\def\HCode{\futurelet\HCode\HChar}\def\HChar{\ifx"\HCode\def\HCode"##1"[EMAIL PROTECTED]@[EMAIL PROTECTED],html]{tex4ht}}\let\HCode\documentstyle\def\documentstyle{\let\documentstyle\HCode\expandafter\def\csname tex4ht\endcsname{#1,html}\def\HCode####1{\documentstyle[tex4ht,[EMAIL PROTECTED] '$2'.a.b.c.\input ' $1 platex $5 '\makeatletter\def\HCode{\futurelet\HCode\HChar}\def\HChar{\ifx"\HCode\def\HCode"##1"[EMAIL PROTECTED]@[EMAIL PROTECTED],html]{tex4ht}}\let\HCode\documentstyle\def\documentstyle{\let\documentstyle\HCode\expandafter\def\csname tex4ht\endcsname{#1,html}\def\HCode####1{\documentstyle[tex4ht,[EMAIL PROTECTED] '$2'.a.b.c.\input ' $1 dvi2dvi -F a2n ${1/.tex/.dvi} > ${1/.tex/.dvi.tmp} mv ${1/.tex/.dvi.tmp} ${1/.tex/.dvi} tex4ht -f/$1 -itex4ht/ht-fonts/mozilla/nttsub/$3 t4ht -f/$1 $4 -dhtml/ -- System Information: Debian Release: lenny/sid APT prefers unstable APT policy: (500, 'unstable') Architecture: amd64 (x86_64) Kernel: Linux 2.6.23-rc5dancer (PREEMPT) Locale: LANG=ja_JP.EUC-JP, LC_CTYPE=ja_JP.EUC-JP (charmap=EUC-JP) (ignored: LC_ALL set to ja_JP.eucJP) Shell: /bin/sh linked to /bin/bash Versions of packages tex4ht depends on: ii libc6 2.6.1-2 GNU C Library: Shared libraries ii libkpathsea4 2007-14 TeX Live: path search library for ii tex4ht-common 20070904-1 LaTeX and TeX for Hypertext (HTML) ii texlive-base-bin 2007-14 TeX Live: Essential binaries tex4ht recommends no packages. -- no debconf information -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]