On Wed, 29 Dec 2010, Kevin R. Coombes wrote:
Hi,
I have a data frame that includes several names that (if typeset correctly)
require accented characters not available in the ASCII character set.
I'd like to include this data frame as example data in an R package. I'd
also like the R CMD check warning about the use of non-ASCII characters to go
away, in part so I could submit the package somewhere that wouldn't balk at
the presence of the warning. (I gather from older posts that there are
environment variables to skip this check. Those will work for me personally
but will not necessarily appease the maintainers of sites like CRAN where I
might want to submit the package.)
Is there any way to use the correctly accented characters by setting a
different character encoding or equivalent for the data frame? Or am I forced
to remove the offending accents in order to be ASCII-pure and thus leave
people and places with an incorrect representation of their names?
The latter is inevitable. There is no encoding that will work
correctly for everyone (see 'Writing R Extensions' ยง1.7.1): e.g.
Chinese Windows users have only ASCII and Chinese characters (and only
one of two sets of Chinese characters). Again, good practice and
compromises are discussed in 'Writing R Extensions' -- these days
using UTF-8 will do a good job for most R users.
--
Brian D. Ripley, rip...@stats.ox.ac.uk
Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/
University of Oxford, Tel: +44 1865 272861 (self)
1 South Parks Road, +44 1865 272866 (PA)
Oxford OX1 3TG, UK Fax: +44 1865 272595
______________________________________________
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel