Re: [R] Regression model for predicting ranks of the dependent variable

Greg Snow Mon, 16 Sep 2013 16:56:29 -0700

You might consider Projection Pursuit Regression (ppr function).  Since the
ranking is just a monotone transformation of the underlying score, ppr can
estimate the transformation and the contribution of the terms into the
score.



On Mon, Sep 16, 2013 at 11:53 AM, Saumya Gupta <saumya.gu...@outlook.com>wrote:

> I have a training dataset which contains statistics of football players
> for the year 2009, and their ranks for the year 2010. For example:
>
>    Player No. of goals No. of matches Age Rank (in 2010)  A 5 1 35 1  B 2
> 4 23 2  C 1 7 26 3  D 6 4 52 5  E 3 2 45 4
>
> The above ranks have been calculated on the basis of a 'score' (which is
> unknown) given to each player, which is a function of the 3 variables. It
> could be something like:
> Score = (No. of goals/No. of matches) - Age^(1/2)
> After you arrange the scores in descending order, you get the ranks. The
> scores are not known and neither is the formula for calculating it. Only
> the ranks are given.
>
> Now, I have statistics for football players for the year 2013, and I have
> to predict their ranks for the year 2014, which should give the same result
> if the formula used in 2010 were used. Calculating their scores is not
> necessary and even finding out the formula is not the objective. The
> objective is just to predict their ranks. But, finding the exact formula
> for calculating scores will be a bonus.
>
> ------------------------------
> Date: Mon, 16 Sep 2013 10:20:08 -0600
> Subject: Re: [R] Regression model for predicting ranks of the dependent
> variable
> From: 538...@gmail.com
> To: saumya.gu...@outlook.com
> CC: r-help@r-project.org
>
>
> What question (or questions) are you trying to answer?  Any advice we may
> give will depend on what you are trying to accomplish.
>
>
> On Sat, Sep 14, 2013 at 2:12 PM, Saumya Gupta <saumya.gu...@outlook.com>wrote:
>
> I have a dataset which has several predictor variables and a dependent
> variable, "score" (which is numeric). The score for each row is calculated
> using a formula which uses some of the predictor variables. But, the
> "score" figures are not explicitly given in the dataset. The scores are
> only arranged in ascending order, and the ranks of the numbers are given
> (like 1, 2, 3, 4, etc.; rank 1 means that the particular row had the
> highest score, 2 means it had the second highest score and so on). So, if
> the data has 100 rows, the output has ranks from 1 to 100.
> I don't think it would be proper to treat the output column as a numeric
> one, since it is an ordinal variable, and the distance (difference in
> scores) between ranks 1 and 2 may not be the same as that between ranks 2
> and 3. However, most R regression models for ordinal regression are made
> for output such as (high, medium, low), where each level of the output does
> not necessarily correspond to a unique row. In my case, each output (rank)
> corresponds to a unique row.
> So please suggest me what models I could use for this problem. Will
> treating the output as numeric instead of ordinal be a reasonable
> approximation? Or will the usual models for ordinal regression work on this
> dataset as well?
>         [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help@r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
>
>
>
> --
> Gregory (Greg) L. Snow Ph.D.
> 538...@gmail.com
>



-- 
Gregory (Greg) L. Snow Ph.D.
538...@gmail.com

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Regression model for predicting ranks of the dependent variable

Reply via email to