Re: [tor-dev] Proposal 285: Directory documents should be standardized as UTF-8

2018-01-09 Thread Alex Xu
Quoting teor (2018-01-10 00:19:54) > These are called "Unicode Scalar Values". > https://www.unicode.org/glossary/#unicode_scalar_value > > Let's reference that. "Unicode Scalar Value" includes U+0, which I think we probably want to exclude. > >* each encoded with the shortest possible e

Re: [tor-dev] Proposal 285: Directory documents should be standardized as UTF-8

2018-01-09 Thread teor
> On 10 Jan 2018, at 04:34, Nick Mathewson wrote: > > On Mon, Nov 13, 2017 at 5:28 PM, teor wrote: >> On 14 Nov 2017, at 05:51, Nick Mathewson wrote: >> >> Filename: 285-utf-8.txt >> Title: Directory documents should be standardized as UTF-8 >> Author: Nick Mathewson >> Created: 13 November

Re: [tor-dev] Proposal 285: Directory documents should be standardized as UTF-8

2018-01-09 Thread Nick Mathewson
Hi, Teor, and sorry for the long delay! You had a lot of good questions on this proposal, and I didn't know how to answer them all. So in hopes of making progress here, I'm taking wild guesses and asking for help in making the wild guesses better :) On Mon, Nov 13, 2017 at 5:28 PM, teor wrote:

Re: [tor-dev] Proposal 285: Directory documents should be standardized as UTF-8

2018-01-09 Thread Nick Mathewson
On Fri, Nov 24, 2017 at 4:05 PM, chelsea komlo wrote: > It is great that we are identifying places to improve support for Rust in > Tor. > > Along this same line of thinking, are there other places in Tor where we > will need to move to supporting UTF-8? For example, should the statefile be > UTF-

Re: [tor-dev] Proposal 285: Directory documents should be standardized as UTF-8

2017-11-24 Thread chelsea komlo
It is great that we are identifying places to improve support for Rust in Tor. Along this same line of thinking, are there other places in Tor where we will need to move to supporting UTF-8? For example, should the statefile be UTF-8 also? On 11/13/2017 01:51 PM, Nick Mathewson wrote: > Filename:

Re: [tor-dev] Proposal 285: Directory documents should be standardized as UTF-8

2017-11-13 Thread teor
> On 14 Nov 2017, at 05:51, Nick Mathewson wrote: > > Filename: 285-utf-8.txt > Title: Directory documents should be standardized as UTF-8 > Author: Nick Mathewson > Created: 13 November 2017 > Status: Open > > 1. Summary and motivation > >People frequently want to include non-ASCII text in

[tor-dev] Proposal 285: Directory documents should be standardized as UTF-8

2017-11-13 Thread Nick Mathewson
Filename: 285-utf-8.txt Title: Directory documents should be standardized as UTF-8 Author: Nick Mathewson Created: 13 November 2017 Status: Open 1. Summary and motivation People frequently want to include non-ASCII text in their router descriptors. The Contact line is a favorite place to d