Re: [R] Documenting data

2016-06-30 Thread Ista Zahn
he attributes system in R less useful for documenting data IMO. Best, Ista > > Kind regards > > Georg > >> Gesendet: Donnerstag, 30. Juni 2016 um 20:06 Uhr >> Von: "Bert Gunter" >> An: g.maub...@gmx.de >> Cc: "Pito Salas" , "R Hel

Re: [R] Documenting data

2016-06-30 Thread G . Maubach
"Pito Salas" , "R Help" > Betreff: Re: [R] Documenting data > > I believe Georg's pronouncements are wrong. See inline below. > > -- Bert > > Bert Gunter > > "The trouble with having an open mind is that people keep coming along > a

Re: [R] Documenting data

2016-06-30 Thread Bert Gunter
I believe Georg's pronouncements are wrong. See inline below. -- Bert Bert Gunter "The trouble with having an open mind is that people keep coming along and sticking things into it." -- Opus (aka Berkeley Breathed in his "Bloom County" comic strip ) "... > Within R there are some limitations

Re: [R] Documenting data

2016-06-30 Thread G . Maubach
Hi Pito, Dear Readers, as other have already mentioned, there are good practices for documenting code and data. I would like to summarize them and add a few not mentioned earlier: 1. You should have always two things: your raw data and your R script/s. The raw data is immutable whereas the R sc

Re: [R] Documenting data

2016-06-30 Thread Bert Gunter
Private, since this is a trivial comment. Also, just my opinion, so feel free to ignore. Capture it, yes, but not necessarily as a function; just as a script might do, and the tools mentioned can do this. As others have said, your instincts are good, and you should just choose the methods that wor

Re: [R] Documenting data

2016-06-30 Thread Pito Salas
Thanks to you both. I think you’re saying/implying that once I “test drive” a particular bit of cleaning I should capture it in a function which does it reproducibly against the raw data, and that becomes the best documentation for it. That makes sense. Pito Salas Brandeis Computer Science Feld

Re: [R] Documenting data

2016-06-30 Thread Bert Gunter
In addition to what others have suggested, see ?history. Cheers, Bert Bert Gunter "The trouble with having an open mind is that people keep coming along and sticking things into it." -- Opus (aka Berkeley Breathed in his "Bloom County" comic strip ) On Thu, Jun 30, 2016 at 7:51 AM, Pito Salas

Re: [R] Documenting data

2016-06-30 Thread Ulrik Stervbo
Vince Buffalo has covers this nicely in his book "Bioinformatics Data Skills". The original data should stay the original data is immutable and Vince then suggests that you have a text file in your data directory where you explain where the data came from and which scripts you used to create a modi

Re: [R] Documenting data

2016-06-30 Thread Robert Baer
You might look at: http://stackoverflow.com/questions/7979609/automatic-documentation-of-datasets You might also, try the FIle | Compile Notebook from within R-Studio (https://www.rstudio.com/) on your well-documented R-scripts to get a nice reproducible recording/report of data analysis wor

Re: [R] Documenting data

2016-06-30 Thread Christopher W Ryan
Pito-- You describe excellent practices. The R code itself, saved as a script, provides some documentation of how you got from original data to wherever you are. Use # comments liberally. Whenever possible, save your raw data, however it was when you got it--avoid changing it--make all the c