he attributes system in R less useful for documenting data IMO.
Best,
Ista
>
> Kind regards
>
> Georg
>
>> Gesendet: Donnerstag, 30. Juni 2016 um 20:06 Uhr
>> Von: "Bert Gunter"
>> An: g.maub...@gmx.de
>> Cc: "Pito Salas" , "R Hel
"Pito Salas" , "R Help"
> Betreff: Re: [R] Documenting data
>
> I believe Georg's pronouncements are wrong. See inline below.
>
> -- Bert
>
> Bert Gunter
>
> "The trouble with having an open mind is that people keep coming along
> a
I believe Georg's pronouncements are wrong. See inline below.
-- Bert
Bert Gunter
"The trouble with having an open mind is that people keep coming along
and sticking things into it."
-- Opus (aka Berkeley Breathed in his "Bloom County" comic strip )
"...
> Within R there are some limitations
Hi Pito,
Dear Readers,
as other have already mentioned, there are good practices for documenting code
and data. I would like to summarize them and add a few not mentioned earlier:
1. You should have always two things: your raw data and your R script/s. The
raw data is immutable whereas the R sc
Private, since this is a trivial comment. Also, just my opinion, so
feel free to ignore.
Capture it, yes, but not necessarily as a function; just as a script
might do, and the tools mentioned can do this. As others have said,
your instincts are good, and you should just choose the methods that
wor
Thanks to you both. I think you’re saying/implying that once I “test drive” a
particular bit of cleaning I should capture it in a function which does it
reproducibly against the raw data, and that becomes the best documentation for
it. That makes sense.
Pito Salas
Brandeis Computer Science
Feld
In addition to what others have suggested, see ?history.
Cheers,
Bert
Bert Gunter
"The trouble with having an open mind is that people keep coming along
and sticking things into it."
-- Opus (aka Berkeley Breathed in his "Bloom County" comic strip )
On Thu, Jun 30, 2016 at 7:51 AM, Pito Salas
Vince Buffalo has covers this nicely in his book "Bioinformatics Data
Skills". The original data should stay the original data is immutable and
Vince then suggests that you have a text file in your data directory where
you explain where the data came from and which scripts you used to create a
modi
You might look at:
http://stackoverflow.com/questions/7979609/automatic-documentation-of-datasets
You might also, try the FIle | Compile Notebook from within R-Studio
(https://www.rstudio.com/) on your well-documented R-scripts to get a
nice reproducible recording/report of data analysis wor
Pito--
You describe excellent practices.
The R code itself, saved as a script, provides some documentation of how you
got from original data to wherever you are.
Use # comments liberally.
Whenever possible, save your raw data, however it was when you got it--avoid
changing it--make all the c
10 matches
Mail list logo