Package: tex4ht Version: 20030426-3 Severity: normal
Incorrect encoding of \oe{} in htlatex openoffice export when tex package "manuscipt" is used. The following example generates an incorrect encoding for \oe{} which prevents OO.o from loading the resulting "content.xml" file. manuscript.sty posted on comp.text.tex 23 march 1998, unable to find a more recent version (the one on CTAN failed to generate the proper fixed-font, doublespaced output on regular tex to dvi processing) >>> manuscript.sty header %%% test version for a style producing manuscript-like output. %%% Stefan Ulrich <[EMAIL PROTECTED]>, with %%% suggestions by Rowland <[EMAIL PROTECTED], no animal> %%% and modifications by Steve Kilbane <[EMAIL PROTECTED]> >>> end manuscript.sty header Simple test case tex document: >>> testcase.tex \documentclass{book} \usepackage[submit]{manuscript} \title{Test case for oe ligature} \usepackage[T1]{fontenc} \usepackage[latin1]{inputenc} \usepackage[frenchb]{babel} \FrenchLayout \begin{document} Rodrigue, as-tu du c\oe{}ur ?\\ Tout autre que mon père l'éprouverait sur l'heure \end{document} >>> end testcase.tex Command issued: htlatex testcase.tex "xhtml,ooffice" "ooffice/! -cmozhtf" "-coo" With manuscript package used as above, the \oe{} is incorrectly encoded as a single char (line 24): >>> resulting content.xml subdocument, line numbers added 01:<?xml version="1.0" encoding="UTF-8"?> 02:<!DOCTYPE office:document-content PUBLIC "-//OpenOffice.org//DTD OfficeDocument 1.0//EN" "office.dtd"> 03:<office:document-content 04:xmlns:chart="http://openoffice.org/2000/chart" 05:xmlns:dr3d="http://openoffice.org/2000/dr3d" 06:xmlns:draw="http://openoffice.org/2000/drawing" 07:xmlns:form="http://openoffice.org/2000/form" 08:xmlns:number="http://openoffice.org/2000/datastyle" 09:xmlns:office="http://openoffice.org/2000/office" 10:xmlns:script="http://openoffice.org/2000/script" 11:xmlns:style="http://openoffice.org/2000/style" 12:xmlns:table="http://openoffice.org/2000/table" 13:xmlns:text="http://openoffice.org/2000/text" 14:xmlns:fo="http://www.w3.org/1999/XSL/Format" 15:xmlns:math="http://www.w3.org/1998/Math/MathML" 16:xmlns:svg="http://www.w3.org/2000/svg" 17:xmlns:xlink="http://www.w3.org/1999/xlink" 18:office:class="text" office:version="1.0" > 19:<office:automatic-styles> <style:style style:name="tabular" style:family="table"> 20:<style:properties style:rel-width="76%" fo:margin-left="1cm" fo:margin-right="1cm" table:align="margins"/> 21:</style:style> 22: 23:</office:automatic-styles> <office:body> 24:<!--l. 13--><text:p text:style-name="p-noindent">Rodrigue, as-tu du c÷ur? 25:<text:line-break/>Tout autre que mon père l'éprouverait sur l'heure 26: </text:p> 27:</office:body> 28:</office:document-content> >>> end content.xml subdocument Document fails to load in OO.o 1.1.4 with the following error: "read error - format error in subdocument content.xml, position 24,69" The encoding is correct when manuscript package is not used: >>> resulting content.xml subdocument, excerpt 24:<!--l. 13--><text:p text:style-name="p-noindent">Rodrigue, as-tu du cœur ? >>> end content.xml subdocument excerpt -- System Information: Debian Release: testing/unstable APT prefers unstable APT policy: (500, 'unstable'), (1, 'experimental') Architecture: i386 (i686) Kernel: Linux 2.6.6 Locale: [EMAIL PROTECTED], [EMAIL PROTECTED] (charmap=ISO-8859-15) (ignored: LC_ALL set to [EMAIL PROTECTED]) Versions of packages tex4ht depends on: ii libc6 2.3.2.ds1-20 GNU C Library: Shared libraries an ii libkpathsea3 2.0.2-26 path search library for teTeX (run ii tetex-bin 2.0.2-26 The teTeX binary files -- no debconf information -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]