Thanks to you both. I think you’re saying/implying that once I “test drive” a
particular bit of cleaning I should capture it in a function which does it
reproducibly against the raw data, and that becomes the best documentation for
it. That makes sense.
Pito Salas
Brandeis Computer Science
calculations, or whatever of the data is. After all it is now
derived from my raw data (which may have been well documented) but it is “new.”
Is this a real problem? Is there a “best practice” to address this?
Thanks!
Pito Salas
Brandeis Computer Science
Feldberg 131
2 matches
Mail list logo