> -----Original Message-----
> From: [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] On Behalf Of Tanya Yatsunenko
> Sent: Thursday, April 03, 2008 1:55 PM
> To: r-help@r-project.org
> Subject: [R] coding for categorical variables with unequal 
> observations
> 
> Hi,
> I am doing multiple regression, and have several X variables that are 
> categorical.
> I read that I can use dummy or contrast codes for that, but are there 
> any special rules when there're unequal #observations in each 
> groups (4 
> females vs 7 males in a "gender" variable)?
> Also, can R generate these codes for me?
> THanks.
> 

You don't need to do anything special, and yes you can just let SAS do it for 
you.  For most of the regression PROCs you can put your categorical variables 
in a CLASS statement.  Depending on which procedure you are using, you may be 
able to specify whether you want effects or dummy coding, and which level of 
the categorical variable should be the "comparison" level.  It is also possible 
to use PROC GLMMOD to create your design variables to be fed into other PROCs.  
Other approaches are possible as well.

If you provide more detail on what analyses you plan to undertake, someone may 
be able to provide more specific advice.

Hope this is helpful,

Dan

Daniel J. Nordlund
Research and Data Analysis
Washington State Department of Social and Health Services
Olympia, WA  98504-5204
 
 

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to