Hey,

I'm using R as a pre-processor for a large dataset with IDs which are
numeric (but has no numeric meaning so can be seen as factors).
I do some data formating and then write it out to a csv file.

However the problem is that the IDs are very long, 18-22 chars long more
precisely. R is constantly rewriting these IDs to the abbreviated +eX which
hinders me from exporting the data to the csv since the IDs are no longer
intact.
I've tried telling R that ID column is a factor, but this results in two
problems: 1) Since I have millions of rows and R is slower handling factors
than numbers my comp can't run the process in any kind of reasonable time.
and 2) Some IDs STILL seem to be rewritten somehow. The second point made me
believe that perhaps R is rewriting upon import?

Does anyone have any tips on how to solve this problem?

Thanks,
Mike

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to