[R] Help with simulation of unbalanced clustered data

Chao Liu Wed, 16 Dec 2020 01:08:39 -0800

Dear R experts,

I want to simulate some unbalanced clustered data. The number of clusters
is 20 and the average number of observations is 30. However, I would like
to create an unbalanced clustered data per cluster where there are 10% more
observations than specified (i.e., 33 rather than 30). I then want to
randomly exclude an appropriate number of observations (i.e., 60) to arrive
at the specified average number of observations per cluster (i.e., 30). The
probability of excluding an observation within each cluster was not uniform
(i.e., some clusters had no cases removed and others had more excluded).
Therefore in the end I still have 600 observations in total. How to realize
that in R? Thank you for your help!


Best,

Liu

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Help with simulation of unbalanced clustered data

Reply via email to