Re: [R] R-code in html help pages: syntax highlighting

Romain Francois Mon, 16 Mar 2009 23:58:38 -0700

Duncan Murdoch wrote:

On 16/03/2009 5:06 PM, Romain Francois wrote:
hadley wickham wrote:
It would be pretty easy to use the output from the R parser (whichis neverwrong, is it?), and dump some markup out of it. For example theshowTreefunction in codetools dumps an R expression as Lisp, this is nottoo far
from generating html, or any other markup.
As this sounds like fun, I'll volunteer to do something about this.Anotheradvantage is that we can imagine to plug hyperlinks in R code thatlives in
html help pages.
This also sounds like a good idea for a google summer of code project
- that way you might be able to get a student to give you a hand as
well.

Hadley
That did cross my mind earlier this evening, it just seems a bit tooeasy to last all summer, but maybe I am missing something difficult.I will start to play with this over the next few days, and make up mymind.
It depends on your standards. You said you want R to parse the codein the Rd file. That's going to be hard, because Rd files containsomething that is only "R-like", as far as the parser is concerned.You'll need to convert it into R code before you can pass it to the Rparser.

I would assume this would be outsourced to the experimental parse_Rdfunction

And then there's the question of scoping, which gets into theevaluator, not just the parser. (The parser only recognizes "mean" asan identifier; it's the evaluator that decides whether it's thefunction in the base package or a local variable.)

That is an issue. I guess I will fall back on what the parser says andinfer on the scoping. Within the lines below, mean would be differenteach time


mean( 1:10 )
lapply( 1:10, mean)
mean <- (1+4) / 2
lapply( list( mean, median), function( f ) f( 1:10) )
{ mean <- median; mean( 1:10 ) }

So if you've got high standards, it's probably quite hard. On theother hand, if you're willing to accept the usual sort of errors thatsyntax highlighters make, it's not so bad, but not trivial.

There is probably some middle ground between the job an highlighterwould do, and the way the R evaluator would think the expressioneventually. Given that this is more a nice to have feature, I guess wecan accept some errors. checkUsage is wrong sometimes, but it is still agood tool.

One of the problem I might run into is performance, if we want thisto treat all Rd files, we are going to want something very efficient,and it might not be enough to build on top of codetools (which usesrecursion at the R level) , but could make sense to provide a C levelimplementation.
Remember what Knuth said about premature optimization. Write it firstin R, and only optimize it if it's not fast enough.


Deal

(I'd guess it'll be fast enough: Brian Ripley reported that all the Rcode he wrote for conversions in R-devel was faster than the Perl codeit was replacing.)


That is good news

This could lead to interesting things as:
- syntax highlighting in sweave (or decumar)
- pretty printing in the console (using ansi characters)
- syntax highlighting in R help files, potentially with hyperlinks

I have requested creation of a project on r-forge. Anyone else wantto play with this ?


I'll sign up once it's going.

Duncan Murdoch



--
Romain Francois
Independent R Consultant
+33(0) 6 28 91 30 30
http://romainfrancois.blog.free.fr

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] R-code in html help pages: syntax highlighting

Reply via email to