Re: [R] Principle components analysis on a large dataset

2009-08-21 Thread misha680
x. > > Good luck! > > Moshe. > > P.S. I am curious to see what other people think. > > --- On Fri, 21/8/09, misha680 wrote: > >> From: misha680 >> Subject: [R] Principle components analysis on a large dataset >> To: r-help@r-project.org >>

[R] Principle components analysis on a large dataset

2009-08-21 Thread Prof. John C Nash
Thu, 20 Aug 2009 17:45:00 -0700 (PDT) From: misha680 Subject: [R] Principle components analysis on a large dataset To: r-help@r-project.org Message-ID: <25072510.p...@talk.nabble.com> Content-Type: text/plain; charset=us-ascii Dear Sirs: Please pardon me I am very new to R. I have been us

Re: [R] Principle components analysis on a large dataset

2009-08-20 Thread Jari Oksanen
Moshe Olshansky yahoo.com> writes: > > Hi Misha, > > Since PCA is a linear procedure and you have only 6000 observations, you do not need 68000 variables. Using > any 6000 of your variables so that the resulting 6000x6000 matrix is non-singular will do. You can choose > these 6000 variables (co

[R] Principle components analysis on a large dataset

2009-08-20 Thread misha680
Dear Sirs: Please pardon me I am very new to R. I have been using MATLAB. I was wondering if R would allow me to do principal components analysis on a very large dataset. Specifically, our dataset has 68800 variables and around 6000 observations. Matlab gives "out of memory" errors. I have trie

Re: [R] Principle components analysis on a large dataset

2009-08-20 Thread Moshe Olshansky
000 matrix, i.e. 36,000,000 entries, which can fit into a memory and you can perform the usual PCA on this matrix. Good luck! Moshe. P.S. I am curious to see what other people think. --- On Fri, 21/8/09, misha680 wrote: > From: misha680 > Subject: [R] Principle components analysis on

[R] Principle components analysis on a large dataset

2009-08-20 Thread misha680
Dear Sirs: Please pardon me I am very new to R. I have been using MATLAB. I was wondering if R would allow me to do principal components analysis on a very large dataset. Specifically, our dataset has 68800 variables and around 6000 observations. Matlab gives "out of memory" errors. I have trie