Re: [R] md5sum issues

2021-02-03 Thread Jeff Newmiller
Well, you can use binary input files like RDS, qs, or parquet. But you already have your code and data in Git, so checking your input is redundant... just put in a binary output reference file and a test that verifies it. On February 3, 2021 8:25:33 AM PST, Ivan Calandra wrote: >Dear Jeff, > >I

Re: [R] md5sum issues

2021-02-03 Thread Duncan Murdoch
On 03/02/2021 11:15 a.m., Jeff Newmiller wrote: This CR vs LF vs CRLF newline discrepancy has been around since the 70s and the CP/M operating system. And it remains an issue in over-the-wire internet text protocols today, which actually use the CRLF version like Windows. Sorry, UNIX... world

Re: [R] md5sum issues

2021-02-03 Thread Ivan Calandra
Dear Jeff, If I understood you correctly, it makes sense that I explain more about my goal here: I am trying to find ways to have analyses that are as reproducible as possible (knowing that it is not going to be perfect). One part is to show which file(s) I use as input and what output was c

Re: [R] md5sum issues

2021-02-03 Thread Jeff Newmiller
This CR vs LF vs CRLF newline discrepancy has been around since the 70s and the CP/M operating system. And it remains an issue in over-the-wire internet text protocols today, which actually use the CRLF version like Windows. Sorry, UNIX... world domination of LF encoding failed. The problem wit

Re: [R] md5sum issues

2021-02-03 Thread Ivan Calandra
Thank you very much Duncan for your help. I'll try that. Best, Ivan -- Dr. Ivan Calandra TraCEr, laboratory for Traceology and Controlled Experiments MONREPOS Archaeological Research Centre and Museum for Human Behavioural Evolution Schloss Monrepos 56567 Neuwied, Germany +49 (0) 2631 9772-243 h

Re: [R] md5sum issues

2021-02-03 Thread Duncan Murdoch
On 03/02/2021 4:42 a.m., Ivan Calandra wrote: Thank you Ivan and Duncan for your help. I understand your point Duncan, but the thing is that I do have an issue here. Is it then due to RStudio or even Windows? If it is, I can forget about a solution on that end, so I would focus on what I can do,

Re: [R] md5sum issues

2021-02-03 Thread Ivan Calandra
Thank you Ivan and Duncan for your help. I understand your point Duncan, but the thing is that I do have an issue here. Is it then due to RStudio or even Windows? If it is, I can forget about a solution on that end, so I would focus on what I can do, and this Git setting seems to be the best p

Re: [R] Alternative to mapply to select samples

2021-02-03 Thread PIKAL Petr
Hi I am not sure if I understand your function but simple mapply gives you probably the same result and may be quicker. > set.seed(111) > blf <- bl_func() > set.seed(111) > blm <- mapply(sample, bl, kn, replace=TRUE) > all.equal(blf, blm) [1] TRUE > Cheers Petr > -Original Message- >

Re: [R] md5sum issues

2021-02-03 Thread Duncan Murdoch
On 03/02/2021 2:14 a.m., Ivan Krylov wrote: On Tue, 2 Feb 2021 17:01:05 +0100 Ivan Calandra wrote: This happens to all text-based files (Rmd, MD, CSV...) but not to non-editable files (PDF, XLSX...). This is probably caused by Git helpfully converting text files from LF (0x10) line endings t