Re: [Rd] application to mentor syrfr package development for Google Summer of Code 2010

2010-03-11 Thread Michael Schmidt
I think it's a great idea worth trying out. We have always done significance tests just on the final frontier of models as a post processing step. Moving this into the algorithm could focus the search more on significant higher quality solutions. One thing to beware of though is that using parsimon

Re: [Rd] application to mentor syrfr package development for Google Summer of Code 2010

2010-03-10 Thread James Salsman
Michael, Thanks for your reply with the information about the Eureqa API -- I am forwarding it to the r-devel list below. Dirk, Will you please agree to referring to the syrfr package as symbolic genetic algorithm regression of functions but not (yet) general relations? It would be best to refe

Re: [Rd] application to mentor syrfr package development for Google Summer of Code 2010

2010-03-08 Thread James Salsman
Michael, Thanks for your reply: On Mon, Mar 8, 2010 at 12:41 AM, Michael Schmidt wrote: > > Thanks for contacting me. Eureqa takes into account the total size of an > equation when comparing different candidate models. It attempts to find the > set of possible equations that are non-dominated in

Re: [Rd] application to mentor syrfr package development for Google Summer of Code 2010

2010-03-08 Thread Michael Schmidt
Hi James, Thanks for contacting me. Eureqa takes into account the total size of an equation when comparing different candidate models. It attempts to find the set of possible equations that are non-dominated in both error and size. The final results is a short list consisting of the most accurate

Re: [Rd] application to mentor syrfr package development for Google Summer of Code 2010

2010-03-08 Thread Chidambaram Annamalai
Oh oops. I clearly embarrassed myself. :D I believe you are suggesting that besides the evaluation functions proposed in the paper you want to test the model produced by SR using statistical tests to prove its validity? I haven't really given much thought about using statistical tests in model eva

Re: [Rd] application to mentor syrfr package development for Google Summer of Code 2010

2010-03-07 Thread James Salsman
Chillu, I meant that development on both a syrfr R package capable of using either F statistics or parametric derivatives should proceed in parallel with your work on such a derivatives package. You are right that genetic algorithm search (and general best-first search -- http://en.wikipedia.org/wi

Re: [Rd] application to mentor syrfr package development for Google Summer of Code 2010

2010-03-07 Thread Chidambaram Annamalai
> If I understand your concern, you want to lay the foundation for > derivatives so that you can implement the search strategies described > in Schmidt and Lipson (2010) -- > http://www.springerlink.com/content/l79v2183725413w0/ -- is that > right? Yes. Basically traditional "naive" error estimat

Re: [Rd] application to mentor syrfr package development for Google Summer of Code 2010

2010-03-07 Thread James Salsman
Chillu, If I understand your concern, you want to lay the foundation for derivatives so that you can implement the search strategies described in Schmidt and Lipson (2010) -- http://www.springerlink.com/content/l79v2183725413w0/ -- is that right? It is not clear to me how well this generalized app

Re: [Rd] application to mentor syrfr package development for Google Summer of Code 2010

2010-03-07 Thread Chidambaram Annamalai
It's been a while since I proposed syrfr and I have been constantly in contact with the many people in the R community and I wasn't able to find a mentor for the project. I later got interested in the Automatic Differentiation proposal (adinr) and, on consulting with a few others within the R commu

[Rd] application to mentor syrfr package development for Google Summer of Code 2010

2010-03-07 Thread James Salsman
Per http://rwiki.sciviews.org/doku.php?id=developers:projects:gsoc2010 -- and http://rwiki.sciviews.org/doku.php?id=developers:projects:gsoc2010:syrfr -- I am applying to mentor the "Symbolic Regression for R" (syrfr) package for the Google Summer of Code 2010. I propose the following test which a