On 10/27/2009 1:05 PM, Kenneth Roy Cabrera Torres wrote:
Thank you very much for your interest.

I make this:
x <- as.character(alumnos$AL_NUME_ID)
x <- x[-seq_len(length(x)/2)]
save(x, file="x.RData")

I exit form R, and then restart R and I make this:

load("x.RData")
y <- gsub("(^ +)|( +$)","",x)

It shows me:

Error en gsub("(^ +)|( +$)", "", x) : input string 66644 is invalid in this locale

I'm working on Windows, so I don't have the locale problem, but I do get the segfault. I'll see if I can track down what is going wrong.

Duncan Murdoch

I delete that string (it is a string with a non usual character (Ñ))

So, I retype without that observation.

y <- gsub("(^ +)|( +$)","",x[-c(66644)])

I got this:
Error en gsub("(^ +)|( +$)", "", x[-c(66644)]) : input string 160689 is invalid in this locale

I retype again with this invalid string this way (I use the
  160690 position, because the lag of the x vector)

y <- gsub("(^ +)|( +$)","",x[-c(66644,160690)])
Error: C produce desborde de pila en 'segfault'

And it fails.

I also repeat all the process with this conversion first.

x <- iconv(as.character(alumnos$AL_NUME_ID),"latin1","UTF-8")
x <- x[-seq_len(length(x)/2)]
save(x, file="x.RData")

And I exit, and restart R, and then I type

load("x.RData")
y <- gsub("(^ +)|( +$)","",x)

And it fails again without showing me the "invalid string" errors.

I then make this:

 load("x.RData")
 y <- gsub("(^ +)|( +$)","",x[1:160690])

and it works, then I type

 y <- gsub("(^ +)|( +$)","",x[1:200000]) #(x length is 454035)

and it works...

But I start to make a manual binary search,
I found something that stills puzzle me.

y <- gsub("(^ +)|( +$)","",x[1:261570])

works, but sometimes fails (after I restart R),
it always fails with index greather than 262000.

I see that there are not something inusual arround 261570.

x[261560:261580]
 [1] "21444777             " "1147585              " "255202522
"
 [4] "25852100             " "24258550             " "A8D0251207
"
 [7] "34681811             " "19121345             " "16921329
"
[10] "20442195             " "14506482             " "44332211
"
[13] "35049122             " "34326340             " "35182366
"
[16] "33288742             " "34958795             " "1017147202
"
[19] "3306985              " "33048501             " "33295073
"

I am sending you the x.Rdata file to see if you can
reproduce my problem.

This infomation may be useful:

 sessionInfo()

R version 2.10.0 (2009-10-26) x86_64-unknown-linux-gnu
locale:
[1] LC_CTYPE=es_CO.UTF-8 LC_NUMERIC=C [3] LC_TIME=es_CO.UTF-8 LC_COLLATE=es_CO.UTF-8 [5] LC_MONETARY=C LC_MESSAGES=es_CO.UTF-8 [7] LC_PAPER=es_CO.UTF-8 LC_NAME=C [9] LC_ADDRESS=C LC_TELEPHONE=C [11] LC_MEASUREMENT=es_CO.UTF-8 LC_IDENTIFICATION=C
attached base packages:
[1] stats graphics grDevices utils datasets methods base
 R.Version()

$platform
[1] "x86_64-unknown-linux-gnu"
$arch
[1] "x86_64"
$os
[1] "linux-gnu"
$system
[1] "x86_64, linux-gnu"
$status
[1] ""
$major
[1] "2"
$minor
[1] "10.0"
$year
[1] "2009"
$month
[1] "10"
$day
[1] "26"
$`svn rev`
[1] "50208"
$language
[1] "R"
$version.string
[1] "R version 2.10.0 (2009-10-26)"

gcc --version and g++ --verision shows me:

gcc (Ubuntu 4.3.3-5ubuntu4) 4.3.3
Copyright (C) 2008 Free Software Foundation, Inc.
Esto es software libre; vea el código para las condiciones de copia.  NO
hay
garantía; ni siquiera para MERCANTIBILIDAD o IDONEIDAD PARA UN PROPÓSITO
EN
PARTICULAR

When I compile R I use this option in configuration (nothing more)

./configure --enable-R-shlib
make sudo make install

At the moment I have 22Gb of swap partition (keeping monitor tracking
the systems is not using it) and 4GB of RAM.

Again, thank you very much for your help.

Kenneth






______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to