Thank you! I greatly appreciate your consideration, though of course it is up 
to you. I think many people switch to stringr/stringi simply because functions 
in those packages have some consistent design choices, for example, they do not 
drop empty/missing matches, which facilitates array-based programming. For 
example, in the cases where one needs to make a new column in a data.frame 
(data.table, tibble, etc.) of regex extractions. Or in any other case where 
there needs to be an element-wise correspondence between input and output. I 
think insertion of NA_character_ to prevent dropping indices seems like the 
natural choice for an array language (which, I think, motivated the creation of 
stringr/stringi). While those are great packages and this behavior can be 
easily replicated with simple wrappers, string operations are normally easy to 
accomplish in base languages, so this seems like something that would be 
appropriate to have in base. For example, MATLAB and Pandas regex both allow
  non-dropping empty matches (though of course I acknowledge Pandas is not a 
base language).

Best,
CG

______________________________________________
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel

Reply via email to