Dear R-community,
Rwiki is broken (well, some of the syntax is anyway). Anyone have some
insight as to what is needed to fix it?
Sincerely,
KeithC.
__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the pos
Dear Sarah,
[snip...]
"I know that samples within each facility cannot be treated as independent,
so I need an approach that accounts for (1) clustering within facilities
and"
You could just use lm() & some planning. The data from within a specific
facility can be fit with a model to generate par
Dear Sandra,
R^2 is just a ratio between the amount of error explained between two
models.
PRE (proportional reduction in error) = R^2 = (SSE model C - SSE model
A)/SSE model C.
This is sometimes expressed as (SSEc-SSEa)/SSEc = SSR/SSEc |SSR=sum squared
reduced
Given your example with some exten
Wow! This definitely contributed to my evening.
If you could indulge, I would like some clarification on this matter of
binning and distortion, particularly wrt time series (perhaps related to
long-memory processes?). I had thought binning was standard practice in
spectral analysis and ANPOW.
..
?paste
See also,
http://rwiki.sciviews.org/doku.php?id=large_scale_data:lsdioi_sqchunk for
additional examples.
Assuming stations[1:2] are tables your database:
simpleQuery<- paste("SELECT * FROM", stations[1]) # should read the whole
table when invoked
dbGetQuery(con_stations, simpleQuery)
Is t
Dear n.vialma,
Good question! Your columns are of type factor(). Watch out for strange
things with coercion (and so much for the 3 minute reply)! In this solution,
you need a pre-allocated vector to store the results, and your approach is
different depending on the data type you want the resulting
Dear Alex,
Has manual garbage collection had any effect?
Sincerely,
KeithC.
-Original Message-
From: Alex van der Spek [mailto:do...@xs4all.nl]
Sent: Wednesday, May 05, 2010 3:48 AM
To: r-help@r-project.org
Subject: [R] Memory issue
Reading a flat text file 138 Mbyte large into R with
One line at a time! That must have taken forever!
Do you happen to know how many lines you could read in at once & still have
enough room to work with?
Sincerely,
KeithC.
-Original Message-
From: Seth [mailto:sjmy...@syr.edu]
Sent: Tuesday, May 04, 2010 11:05 PM
To: r-help@r-project.org
Dear Seth,
If this were my project, I would likely use something besides readLines().
Have you looked into read.table() or scan()? They'll separate based on your
delimiter on input so you do not need to do post processing.
#example file
txt2<-cbind(c("A","cat","ran","over","the"),c("brown","fox."
Dear someone,
Jorge's solution is excellent, assuming it is what you had in mind. Please
note that the help page for unique() has duplicated() listed in its "See
Also" section. Thus, when you studied ?unique(), it would have made sense to
read about duplicated() as well. Or perhaps you did look in
eithC.
-Original Message-
From: Kyeong Soo (Joseph) Kim [mailto:kyeongsoo@gmail.com]
Sent: Friday, April 30, 2010 4:10 AM
To: kMan
Cc: r-help@r-project.org
Subject: Re: [R] Curve Fitting/Regression with Multiple Observations
Dear Keith,
Thanks for the suggestion and taking your time to re
Dear Kay,
It is not an issue of whether it is "allowed". The issue is understanding
what comparisons are taking place between the models. Your advisor should
help with that.
Perhaps you had a more focused question related to R?
Sincerely,
KeithC.
-Original Message-
From: Kay Cichini [ma
Dear Joseph,
If you do not need to make any inferences, that is, you just want it to look
pretty, then drawing a curve by hand is as good a solution as any. Plus, there
is no reason for expert testimony to say that the curve does not mean anything.
Sincerely,
KeithC.
-Original Message-
Dear list,
There is a new rwiki (http://rwiki.sciviews.org) section for 'large scale
data' at http://rwiki.sciviews.org/doku.php?id=large_scale_data. This is a
massive topic of interest to many. I could use some help filling in the
content so that this can develop into a powerful resource for the
You set records to NULL perhaps (delete, shift up). Perhaps your system is
susceptible to butterflies on the other side of the world.
Your code may have 'worked' on a small section of data, but the data used
did not include all of the cases needed to fully test your code. So... test
your code!
sc
dered too far along?
Sincerely,
KeithC.
-Original Message-
From: b.rowling...@googlemail.com [mailto:b.rowling...@googlemail.com] On
Behalf Of Barry Rowlingson
Sent: Monday, April 19, 2010 12:33 AM
To: kMan
Cc: r-help@r-project.org
Subject: Re: [R] Serverless databases in R
On Sun, Apr 18, 201
with them.
Sincerely,
KeithC.
-Original Message-
From: b.rowling...@googlemail.com [mailto:b.rowling...@googlemail.com] On
Behalf Of Barry Rowlingson
Sent: Sunday, April 18, 2010 12:25 PM
To: Gabor Grothendieck
Cc: kMan; r-help@r-project.org
Subject: Re: [R] Serverless databases in R
On Sun, Apr
Dear list,
What are the cross-platform, serverless databases in R?
Sincerely,
KeithC.
__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and pro
strating why the 'bad practice' part of working with raw
text files is something more than mere prescription.
Sincerely,
KeithC.
-Original Message-
From: Thomas Lumley [mailto:tlum...@u.washington.edu]
Sent: Monday, March 29, 2010 2:56 PM
To: Gabor Grothendieck
Cc: kMan; r-help;
>This was *very* useful for me when I dealt with a 1.5Gb text file
>http://www.csc.fi/sivut/atcsc/arkisto/atcsc3_2007/ohjelmistot_html/R_and_la
rge_data/
Two hours is a *very* long time to transfer a csv file to a db. The author
of the linked article has not documented how to use scan() arguments
Dear Dhruv,
Your clarification helps, and I'm stumped. Sorry I cannot be of more help.
Sincerely,
KeithC.
-Original Message-
From: Sharma, Dhruv [mailto:dhruv.sha...@penfed.org]
Sent: Monday, March 08, 2010 7:51 AM
To: kMan; r-help@r-project.org
Subject: RE: [R] r code to gen
Dear Dhruv,
You could create interaction variables manually (assuming A is your
dependent variable). Just multiply the variables together.
cd.int<-C*D
ce.int<-C*E
cde.int<-C*D*E # what about D*E, or interactions with B?
Include those in your model, such as A~B+C+D+E+cd.int+cd.int+ce.int+cde.int.
c(x,(x<-1:5)[4:1])
-Original Message-
From: baptiste auguie [mailto:baptiste.aug...@googlemail.com]
Sent: Friday, March 05, 2010 1:08 AM
To: kensuguro
Cc: r-help@r-project.org
Subject: Re: [R] how to make this sequence: 1,2,3,4,5,4,3,2,1
c(x <- 1:5, rev(x[-length(x)]))
On 5 March 2010
(1) - nlme, lattice, stats
(2) - a usable large-file/out of memory regression package that abstracts
"all" the details of connections & etc from the user, accept perhaps the
initial function call, so I don't have to actually know anything about the
file I'm opening, how big it is, how many lines of
John,
I felt a short, somewhat strong reply was in order. One of the inherent
aspects of the language is that R demands more of an understanding from
users about what is taking place. Model formulae, for example, are close to
what one would use if they were to write the model on paper. I consider
JD,
I would recommend 64 bit, and win 7 on a quad core system has been
surprisingly stable for me. Allan's points are also spot on. I would suggest
reading the references Allan provided.
Increased memory will increase your ability to handle n vectors of 2^31-1
elements, a limit that was a design
I would first abandon time-domain techniques altogether for irregularly spaced
series. The Lomb Scargle periodogram is available for R, here:
http://research.stowers-institute.org/efg/2005/LombScargle/R/index.htm.
KeithC.
-Original Message-
From: ravi [mailto:rv...@yahoo.se]
Sent: Fr
Then use strsplit() instead.
KeithC.
-Original Message-
From: kayj [mailto:kjaj...@yahoo.com]
Sent: Friday, February 26, 2010 1:02 PM
To: r-help@r-project.org
Subject: Re: [R] using grep
Hi ,
I have tried
gsub(".*York(\\d+).*", "\\1", grep("New York", x, value = TRUE))
and outputs
Dear Peter,
What data types does your list contain? Have you tried treating the list as
a data frame or matrix?
KeithC.
-Original Message-
From: Heym, Peter-Paul [mailto:ph...@ipb-halle.de]
Sent: Thursday, February 25, 2010 2:11 AM
To: r-help@R-project.org
Subject: [R] how to fast extr
PM
To: kMan
Subject: Re: [R] lm function in R
Thank you so much, kMan. That makes sense. Only one question, how can I
see the value of 'error'? Here's what I see:
Call:
lm(formula = Y ~ X1 * X2 * X3, na.action = na.exclude)
Coefficients:
(Intercept)
example. I have not decided how I feel about that yet, but I do have a
splitting headache.
Sincerely,
KeithC.
-Original Message-
From: Peter Ehlers [mailto:ehl...@ucalgary.ca]
Sent: Sunday, February 14, 2010 10:04 PM
To: kMan
Cc: r-help@r-project.org
Subject: Re: [R] Plot differe
Peter wrote:
>You like to live dangerously.
Clue me in, Professor.
Sincerely,
KeithC.
-Original Message-
From: Peter Ehlers [mailto:ehl...@ucalgary.ca]
Sent: Sunday, February 14, 2010 6:49 PM
To: kMan
Cc: 'David Winsemius'; 'Rhonda Reidy'; r-help@r-project.or
Dear Rhonda,
Consider curve().
KeithC.
-Original Message-
From: Rhonda Reidy [mailto:rre...@gmail.com]
Sent: Saturday, February 13, 2010 11:36 AM
To: r-help@r-project.org
Subject: [R] Plot different regression models on one graph
The following variables have the following significant r
I would use all of the data. If you want to "drop" one, control for it in
the model & sacrifice a degree of freedom.
Why the call to poly() by the way?
KeithC.
-Original Message-
From: Peter Ehlers [mailto:ehl...@ucalgary.ca]
Sent: Saturday, February 13, 2010 1:35 PM
To: David Winsemius
Dear Anton,
4Mb is not a lot of data. A Gb still wouldn't be that troublesome in a flat
file. Your data can be migrated to a relational database at a future point.
Sincerely,
KeithC.
-Original Message-
From: Anton du Toit [mailto:atdutoitrh...@gmail.com]
Sent: Saturday, February 13, 20
Dear Kara,
Did you bother to test your code? You say your code actually did some
summing, but you didn't include any working example of that code. Did you
bother to read the posting guide?
(1) TRY to reference Col A (including space, as you indicated) in df.
>names(df)<-c("Col A", "Col B") #space
It depends on the type of file and your system. 'count.fields()' is
impractical for large files because it generates a matrix with the same
number of dimensions as the file. It would be easier to use scan() with the
delimiter argument set up to read to the end of line marker, "\n" I believe,
and th
Use "\\^" instead.
For example:
df<-data.frame(c(1,2,4),c(4,3,2))
names(df)<-c("Amt","Resp")
df.form<-formula(Resp~0+Amt+I(Amt^2), data=df)
strsplit(as.character(df.form)[3], "\\^", perl=T)
should work just fine.
Sincerely,
KeithC.
-Original Message-
From: Gabor Grothendieck [mailto:ggr
38 matches
Mail list logo