Re: [R-pkg-devel] Check Error Due to Unicode in Documentation

Duncan Murdoch Thu, 23 Jul 2020 13:59:19 -0700

On 23/07/2020 4:14 p.m., b...@denney.ws wrote:

Hello,

I have a personal package that I�d eventually like to clean up and either
find other packages to be homes for the functions or perhaps eventually
release it on CRAN.  To that end, I try to keep package checks working.

One of the functions that I use is to try to simplify Unicode text to ASCII.
With that, I tend to receive data that is scientifically-focused to the mu
character should be converted to a �u� instead of the standard conversion to
�m�.  On top of that, there are at least two Unicode characters that are
visually the mu character, one is the micro character and the other is an
actual lowercase mu.  This function converts both of those to �u� as
desired.

I generate the documentation using roxygen2, but the text in the
documentation aligns with the expected Unicode character, so I think the
issue is not with roxygen.

The issue is that Codoc gives the following error:

* checking for code/documentation mismatches ... WARNING

Codoc mismatches from documentation object 'unicode_to_ascii':

unicode_to_ascii.character

   Code: function(x, verbose = FALSE, pattern = c("μ", "µ"), replacement

                  = c("u", "u"), general_

But, the code and documentation appear to be the same.  I think that the
issue relates to something with Unicode support in Codoc, but I�m not sure
how to test for that.  The code is here:

https://github.com/billdenney/bsd.report/blob/454caf217c5b333af1d65c7e63bbad
4194320e07/R/unicode_to_ascii.R#L28-L31

And the documentation is here:

https://github.com/billdenney/bsd.report/blob/454caf217c5b333af1d65c7e63bbad
4194320e07/man/unicode_to_ascii.Rd#L17-L24

Do you have any suggestions on how to make this code/documentation work with
Codoc?

If you change the source to include the explicit characters (i.e. usepattern = c("μ", "µ") instead of pattern=c("\u03bc", "\u00b5")), doesthat help?

It may cause other issues: WRE recommends against including UTF-8 charsin source code.

If that doesn't solve the problem, then it looks like an issue withRoxygen2. I don't know if there's a way to tell it not to convert \uescapes into the corresponding character. If there isn't, it seems likethat's something they should add. As a workaround, is there a way tosay that this one particular .Rd file should be edited by hand, insteadof auto-generated?


Duncan Murdoch

______________________________________________
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel

Re: [R-pkg-devel] Check Error Due to Unicode in Documentation

Reply via email to