Package: lifelines
Version: 3.0.50-2

Hi,

I'm trying export my database in gedcom format.  My internal codeset is
set to UTF-8.  But the CHAR in the gedcom file always seems to be ASCII.
The character set used also seems to be the same as the internal one.

I've tried to do various things, like setting GedcomCodeset to UNICODE,
or even UTF-8, but it always says ASCII, with for instance UTF-8
text in it.

While reading the same file back in, without setting GedcomCodeset it
seems to be reading it as latin1, setting GedcomCodeset to UTF-8 seems
to read it back in properly.

I've been reading the standard, which says it could be set to ASCII,
ANSEL or UNICODE.  But is rather unclear about how UNICODE should be
encoded.  From the explenation it looks like it should be in UTF-16,
and could be either UTF-16BE or UTF-16LE.

The draft 5.5.1 on the other hand also has UTF-8 as option.

It would be nice if either UNICODE or UTF-8 was properly supported in
the gedcom format.


Kurt



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to