I just got this bugreport in the debian BTS
--
Madarasz Gergely [EMAIL PROTECTED] [EMAIL PROTECTED]
It's practically impossible to look at a penguin and feel angry.
Egy pingvinre gyakorlatilag lehetetlen haragosan nezni.
HuLUG: http://mlf.linux.rulez.org/
---------- Forwarded message ----------
Date: Mon, 31 Jan 2000 17:13:54 +0100
From: Florian Hars <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
Subject: Bug#56721: htdig and locale de_DE peculiarities.
Resent-Date: Mon, 31 Jan 2000 16:18:02 +0000 (GMT)
Resent-From: Florian Hars <[EMAIL PROTECTED]>
Resent-To: [EMAIL PROTECTED]
Resent-cc: Gergely Madarasz <[EMAIL PROTECTED]>
Package: htdig
Version: 3.1.4-1
This is probably for upstream.
I use htdig with a locale: de_DE setting. It seems unable to find
occurrences of words containing non-ascii characters that are part of
titles, <Hn> or emphasis elements. Say, if i look for "b�g" in my
data, it finds an index.html document that contains the line
<a href="beg-islamabad-1990.html">Bég 1991: From the Quark
Model to the Stand...</a>
but not the document beg-islamabad-1990.html itself, that starts with:
<html><head><title>Bég 1991: From the Quark Model to the
Stand...</title>
<body>
<h1>Mirza Abdul Baqi <strong>Bég</strong>: From the Quark Model
to the Standard Model: Ten Fateful Years in Particle Physics (1964--74
C.\,E.).</h1>
<p>Mirza Abdul Baqi <strong>Bég</strong> (1991): <em>From the
Quark Model to the Standard Model: Ten Fateful Years in Particle
Physics (1964--74 C.\,E.).</em>
It also doesn't find another document containing
<p><a href="beg-islamabad-1990.html">Mirza Abdul Baqi
<strong>Bég</strong>: <em>From the Quark Model to the
Stand...</em> 221-284</a></p>
although it finds both documents if I look for "Mirza".
Yours, Florian.
--
+ when hideous hordes of web designers will leave ripped bloodless bodies of
hosts they parasited upon and convulsively start tearing limbs of each other
in agony illuminated by artificial light [...], then we know that time has
come for d��|||z���e++++ >>>> �.�.�.�.H.�.L.�.T.�.�.� -- www.absurd.org
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.