On 2/16/21, Dominik Pantůček <[email protected]> wrote:
>
> On 16. 02. 21 22:27, Matthew Flatt wrote:
>> At Tue, 16 Feb 2021 16:03:29 -0500, Ben Greenman wrote:
>>> Sadly, I've already compressed a few files using
>>> `call-with-output-string` ... is there an easy way to decompress those
>>> / undo the UTF-8 encoding?
>>
>> Unfortunately, the underlying `get-output-string` conversion is lossy,
>> because bytes that don't form a UTF-8 encoding are converted to U+FFFD.
>>
>> (I see that the docs say #\? instead of #\uFFFD, and I'll fix the docs.)
>>
>
> #\uFFFD is #\� (bytes EF BF BD in UTF-8)
>
> For those who do not see it (I suspect encoding issues) it is a white
> question mark on black vertically elongated hexagon.
>
> And actually Racket REPL in my terminal displays it like this (7.9 BC,
> 8.0 CS, both on Ubuntu 20.04 in GNOME terminal).

Alas, I see lots of those question mark hexagons in my data. Good to know.

-- 
You received this message because you are subscribed to the Google Groups 
"Racket Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/racket-users/CAFUu9R5b5Y1kFQ0JuTtrTO-L14xFndABn-cSm44VB5s6H9wDRA%40mail.gmail.com.

Reply via email to