[R] What happened to Rwiki?

2011-01-10 Thread kMan
Dear R-community, Rwiki is broken (well, some of the syntax is anyway). Anyone have some insight as to what is needed to fix it? Sincerely, KeithC. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the pos

Re: [R] Question about food sampling analysis

2010-07-14 Thread kMan
Dear Sarah, [snip...] "I know that samples within each facility cannot be treated as independent, so I need an approach that accounts for (1) clustering within facilities and" You could just use lm() & some planning. The data from within a specific facility can be fit with a model to generate par

Re: [R] Calculation of r squared from a linear regression

2010-06-15 Thread kMan
Dear Sandra, R^2 is just a ratio between the amount of error explained between two models. PRE (proportional reduction in error) = R^2 = (SSE model C - SSE model A)/SSE model C. This is sometimes expressed as (SSEc-SSEa)/SSEc = SSR/SSEc |SSR=sum squared reduced Given your example with some exten

Re: [R] Simple question on binning data

2010-05-14 Thread kMan
Wow! This definitely contributed to my evening. If you could indulge, I would like some clarification on this matter of binning and distortion, particularly wrt time series (perhaps related to long-memory processes?). I had thought binning was standard practice in spectral analysis and ANPOW. ..

Re: [R] dbSendQuery with R variables

2010-05-11 Thread kMan
?paste See also, http://rwiki.sciviews.org/doku.php?id=large_scale_data:lsdioi_sqchunk for additional examples. Assuming stations[1:2] are tables your database: simpleQuery<- paste("SELECT * FROM", stations[1]) # should read the whole table when invoked dbGetQuery(con_stations, simpleQuery) Is t

Re: [R] concatenate values of two columns

2010-05-05 Thread kMan
Dear n.vialma, Good question! Your columns are of type factor(). Watch out for strange things with coercion (and so much for the 3 minute reply)! In this solution, you need a pre-allocated vector to store the results, and your approach is different depending on the data type you want the resulting

Re: [R] Memory issue

2010-05-05 Thread kMan
Dear Alex, Has manual garbage collection had any effect? Sincerely, KeithC. -Original Message- From: Alex van der Spek [mailto:do...@xs4all.nl] Sent: Wednesday, May 05, 2010 3:48 AM To: r-help@r-project.org Subject: [R] Memory issue Reading a flat text file 138 Mbyte large into R with

Re: [R] readLines with space-delimiter?

2010-05-05 Thread kMan
One line at a time! That must have taken forever! Do you happen to know how many lines you could read in at once & still have enough room to work with? Sincerely, KeithC. -Original Message- From: Seth [mailto:sjmy...@syr.edu] Sent: Tuesday, May 04, 2010 11:05 PM To: r-help@r-project.org

Re: [R] readLines with space-delimiter?

2010-05-05 Thread kMan
Dear Seth, If this were my project, I would likely use something besides readLines(). Have you looked into read.table() or scan()? They'll separate based on your delimiter on input so you do not need to do post processing. #example file txt2<-cbind(c("A","cat","ran","over","the"),c("brown","fox."

Re: [R] Delete rows with duplicate field...

2010-05-04 Thread kMan
Dear someone, Jorge's solution is excellent, assuming it is what you had in mind. Please note that the help page for unique() has duplicated() listed in its "See Also" section. Thus, when you studied ?unique(), it would have made sense to read about duplicated() as well. Or perhaps you did look in

Re: [R] Curve Fitting/Regression with Multiple Observations

2010-04-30 Thread kMan
eithC. -Original Message- From: Kyeong Soo (Joseph) Kim [mailto:kyeongsoo@gmail.com] Sent: Friday, April 30, 2010 4:10 AM To: kMan Cc: r-help@r-project.org Subject: Re: [R] Curve Fitting/Regression with Multiple Observations Dear Keith, Thanks for the suggestion and taking your time to re

Re: [R] model selection, lme

2010-04-29 Thread kMan
Dear Kay, It is not an issue of whether it is "allowed". The issue is understanding what comparisons are taking place between the models. Your advisor should help with that. Perhaps you had a more focused question related to R? Sincerely, KeithC. -Original Message- From: Kay Cichini [ma

Re: [R] Curve Fitting/Regression with Multiple Observations

2010-04-29 Thread kMan
Dear Joseph, If you do not need to make any inferences, that is, you just want it to look pretty, then drawing a curve by hand is as good a solution as any. Plus, there is no reason for expert testimony to say that the curve does not mean anything. Sincerely, KeithC. -Original Message-

[R] New rwiki page for "large scale data"

2010-04-26 Thread kMan
Dear list, There is a new rwiki (http://rwiki.sciviews.org) section for 'large scale data' at http://rwiki.sciviews.org/doku.php?id=large_scale_data. This is a massive topic of interest to many. I could use some help filling in the content so that this can develop into a powerful resource for the

Re: [R] Huge data sets and RAM problems

2010-04-21 Thread kMan
You set records to NULL perhaps (delete, shift up). Perhaps your system is susceptible to butterflies on the other side of the world. Your code may have 'worked' on a small section of data, but the data used did not include all of the cases needed to fully test your code. So... test your code! sc

Re: [R] Serverless databases in R

2010-04-19 Thread kMan
dered too far along? Sincerely, KeithC. -Original Message- From: b.rowling...@googlemail.com [mailto:b.rowling...@googlemail.com] On Behalf Of Barry Rowlingson Sent: Monday, April 19, 2010 12:33 AM To: kMan Cc: r-help@r-project.org Subject: Re: [R] Serverless databases in R On Sun, Apr 18, 201

Re: [R] Serverless databases in R

2010-04-18 Thread kMan
with them. Sincerely, KeithC. -Original Message- From: b.rowling...@googlemail.com [mailto:b.rowling...@googlemail.com] On Behalf Of Barry Rowlingson Sent: Sunday, April 18, 2010 12:25 PM To: Gabor Grothendieck Cc: kMan; r-help@r-project.org Subject: Re: [R] Serverless databases in R On Sun, Apr

[R] Serverless databases in R

2010-04-17 Thread kMan
Dear list, What are the cross-platform, serverless databases in R? Sincerely, KeithC. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and pro

Re: [R] large dataset

2010-03-29 Thread kMan
strating why the 'bad practice' part of working with raw text files is something more than mere prescription. Sincerely, KeithC. -Original Message- From: Thomas Lumley [mailto:tlum...@u.washington.edu] Sent: Monday, March 29, 2010 2:56 PM To: Gabor Grothendieck Cc: kMan; r-help;

Re: [R] large dataset

2010-03-28 Thread kMan
>This was *very* useful for me when I dealt with a 1.5Gb text file >http://www.csc.fi/sivut/atcsc/arkisto/atcsc3_2007/ohjelmistot_html/R_and_la rge_data/ Two hours is a *very* long time to transfer a csv file to a db. The author of the linked article has not documented how to use scan() arguments

Re: [R] r code to generate interaction columns

2010-03-09 Thread kMan
Dear Dhruv, Your clarification helps, and I'm stumped. Sorry I cannot be of more help. Sincerely, KeithC. -Original Message- From: Sharma, Dhruv [mailto:dhruv.sha...@penfed.org] Sent: Monday, March 08, 2010 7:51 AM To: kMan; r-help@r-project.org Subject: RE: [R] r code to gen

Re: [R] r code to generate interaction columns

2010-03-07 Thread kMan
Dear Dhruv, You could create interaction variables manually (assuming A is your dependent variable). Just multiply the variables together. cd.int<-C*D ce.int<-C*E cde.int<-C*D*E # what about D*E, or interactions with B? Include those in your model, such as A~B+C+D+E+cd.int+cd.int+ce.int+cde.int.

Re: [R] how to make this sequence: 1,2,3,4,5,4,3,2,1

2010-03-05 Thread kMan
c(x,(x<-1:5)[4:1]) -Original Message- From: baptiste auguie [mailto:baptiste.aug...@googlemail.com] Sent: Friday, March 05, 2010 1:08 AM To: kensuguro Cc: r-help@r-project.org Subject: Re: [R] how to make this sequence: 1,2,3,4,5,4,3,2,1 c(x <- 1:5, rev(x[-length(x)])) On 5 March 2010

Re: [R] Three most useful R package

2010-03-05 Thread kMan
(1) - nlme, lattice, stats (2) - a usable large-file/out of memory regression package that abstracts "all" the details of connections & etc from the user, accept perhaps the initial function call, so I don't have to actually know anything about the file I'm opening, how big it is, how many lines of

Re: [R] two questions for R beginners

2010-03-03 Thread kMan
John, I felt a short, somewhat strong reply was in order. One of the inherent aspects of the language is that R demands more of an understanding from users about what is taking place. Model formulae, for example, are close to what one would use if they were to write the model on paper. I consider

Re: [R] Best Hardware & OS For Large Data Sets

2010-02-28 Thread kMan
JD, I would recommend 64 bit, and win 7 on a quad core system has been surprisingly stable for me. Allan's points are also spot on. I would suggest reading the references Allan provided. Increased memory will increase your ability to handle n vectors of 2^31-1 elements, a limit that was a design

Re: [R] decomposing an irregularly spaced time series

2010-02-28 Thread kMan
I would first abandon time-domain techniques altogether for irregularly spaced series. The Lomb Scargle periodogram is available for R, here: http://research.stowers-institute.org/efg/2005/LombScargle/R/index.htm. KeithC. -Original Message- From: ravi [mailto:rv...@yahoo.se] Sent: Fr

Re: [R] using grep

2010-02-28 Thread kMan
Then use strsplit() instead. KeithC. -Original Message- From: kayj [mailto:kjaj...@yahoo.com] Sent: Friday, February 26, 2010 1:02 PM To: r-help@r-project.org Subject: Re: [R] using grep Hi , I have tried gsub(".*York(\\d+).*", "\\1", grep("New York", x, value = TRUE)) and outputs

Re: [R] how to fast extract values from different list elements

2010-02-26 Thread kMan
Dear Peter, What data types does your list contain? Have you tried treating the list as a data frame or matrix? KeithC. -Original Message- From: Heym, Peter-Paul [mailto:ph...@ipb-halle.de] Sent: Thursday, February 25, 2010 2:11 AM To: r-help@R-project.org Subject: [R] how to fast extr

Re: [R] lm function in R

2010-02-14 Thread kMan
PM To: kMan Subject: Re: [R] lm function in R Thank you so much, kMan. That makes sense. Only one question, how can I see the value of 'error'? Here's what I see: Call: lm(formula = Y ~ X1 * X2 * X3, na.action = na.exclude) Coefficients: (Intercept)

Re: [R] Plot different regression models on one graph

2010-02-14 Thread kMan
example. I have not decided how I feel about that yet, but I do have a splitting headache. Sincerely, KeithC. -Original Message- From: Peter Ehlers [mailto:ehl...@ucalgary.ca] Sent: Sunday, February 14, 2010 10:04 PM To: kMan Cc: r-help@r-project.org Subject: Re: [R] Plot differe

Re: [R] Plot different regression models on one graph

2010-02-14 Thread kMan
Peter wrote: >You like to live dangerously. Clue me in, Professor. Sincerely, KeithC. -Original Message- From: Peter Ehlers [mailto:ehl...@ucalgary.ca] Sent: Sunday, February 14, 2010 6:49 PM To: kMan Cc: 'David Winsemius'; 'Rhonda Reidy'; r-help@r-project.or

Re: [R] Plot different regression models on one graph

2010-02-14 Thread kMan
Dear Rhonda, Consider curve(). KeithC. -Original Message- From: Rhonda Reidy [mailto:rre...@gmail.com] Sent: Saturday, February 13, 2010 11:36 AM To: r-help@r-project.org Subject: [R] Plot different regression models on one graph The following variables have the following significant r

Re: [R] Plot different regression models on one graph

2010-02-14 Thread kMan
I would use all of the data. If you want to "drop" one, control for it in the model & sacrifice a degree of freedom. Why the call to poly() by the way? KeithC. -Original Message- From: Peter Ehlers [mailto:ehl...@ucalgary.ca] Sent: Saturday, February 13, 2010 1:35 PM To: David Winsemius

Re: [R] Hierarchical data sets: which software to use?

2010-02-13 Thread kMan
Dear Anton, 4Mb is not a lot of data. A Gb still wouldn't be that troublesome in a flat file. Your data can be migrated to a relational database at a future point. Sincerely, KeithC. -Original Message- From: Anton du Toit [mailto:atdutoitrh...@gmail.com] Sent: Saturday, February 13, 20

Re: [R] sum sections of data of different lengths from within a data frame

2010-02-10 Thread kMan
Dear Kara, Did you bother to test your code? You say your code actually did some summing, but you didn't include any working example of that code. Did you bother to read the posting guide? (1) TRY to reference Col A (including space, as you indicated) in df. >names(df)<-c("Col A", "Col B") #space

Re: [R] Fast way to determine number of lines in a file

2010-02-09 Thread kMan
It depends on the type of file and your system. 'count.fields()' is impractical for large files because it generates a matrix with the same number of dimensions as the file. It would be easier to use scan() with the delimiter argument set up to read to the end of line marker, "\n" I believe, and th

Re: [R] the hat ^ in regular expression

2010-02-09 Thread kMan
Use "\\^" instead. For example: df<-data.frame(c(1,2,4),c(4,3,2)) names(df)<-c("Amt","Resp") df.form<-formula(Resp~0+Amt+I(Amt^2), data=df) strsplit(as.character(df.form)[3], "\\^", perl=T) should work just fine. Sincerely, KeithC. -Original Message- From: Gabor Grothendieck [mailto:ggr