Mark, The paper published by Bentor is part of a published collection. I don't know if it is available online. (I can, if you like, scan the relevant 2-3 pages and email them to you.)
Computer Based Horse Race Handicapping and Wagering Systems: A Report William Bentor Thanks! -Noah On 8/25/09 5:25 PM, markle...@verizon.net wrote: > Hi Noah: Do you have a referene or the paper to the horse racing paper > that you referred > to previously ? I can't help you with below because I haven't mastered > the difference yet > between the multinomial logit and the conditional logit. Chuck's > reference didn't help me much > with that so if you know of others, please let me know. Thanks. > > > > Mark > > > On Aug 25, 2009, *Noah Silverman* <n...@smartmediacorp.com> wrote: > > Hello > > I believe that I'm getting very close in my modeling application. > > I've come across a challenge that I am unable to solve and would > really > appreciate the group's opinion. > > I've been using the val.prob function from the Design library (Thanks > Frank!!) to both evaluate and visualize my model. > > From the scores and graph, it appears as my model is very accurate in > predicting probabilities correctly. Please see attachment "graph1.pdf" > > Since I'm scoring horse races, I assume that I need to "normalize" > the > predicted probabilities by race. (Described in Bentor.) > I am calculating a conditional logit manually since there is a bug in > the Survival library for this function. > > A val.prob function applied to my conditional logit score shows an > interesting result. The line is almost perfectly parallel to the > "ideal" mark on the graph, but is offset by a significant amount. My > first thought is that this indicates an error in my calculation > somewhere. Please see attachment "graph2.pdf" > > Below is the two step process that I used for the conditional logit. > -------------------------------------------------- > 1) First a standard logistic regression is calculated on two > variables: > model <- lrm(label ~ val1 + val2, data = traindata ) > > This gives me the following results: > Coef S.E. Wald Z P > Intercept 1.8065 0.05137 35.16 0 > val1 0.8105 0.02567 31.57 0 > val2 0.5218 0.04308 12.11 0 > > 2) I then calculate a conditional logit: > > testdata$log_int <- exp( model$coefficients[2] * model$val1 + > model$coefficients[3] * model$val2) > for(race in testdata$races){ > testlogdata$c_prob[testdata$code== race] <- > testdata$log_int[testdata$race== race] / > sum(testdata$log_int[testlogdata$race == race]) > } > --------------------------------------------------- > > Do you have any idea why this might be happening? Did I miss > something > in my calculation? > > Additionally, please notice the "Logistic Calibration" line on > graph1. > It appears almost perfect. My thought is that whatever transformation > the val.prob is doing to my predictions is helping. How would I > store/access those values? > > Once I can finalize the prediction of probabilities, then I can > focus on > the application to a betting model. Having a high level of confidence > in my models predictions is obviously the first step. > > I really appreciate it. > > Thanks! > > -Noah > > > > > ------------------------------------------------------------------------ > > ______________________________________________ > R-help@r-project.org <mailto:R-help@r-project.org> mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.