This is probably an easy question for some, but it's giving me a
headache.
I have N white bags and N black bags filled with 1000 marbles each.
Marbles comes in 10 colors. There is bag to bag variation in the
proportion of each color marble in a bag, and an unknown white and
black bag treatment effect on the distribution of proportions of
marbles. I only get to sample X marbles per bag.
We have a biological problem that is analogous to this. Scientists have
analyzed these data by ignoring bag variation and just focusing on
numbers of marbles sampled from white versus black.
I think this is not a good idea, but what is an appropriate Stata
estimator that accounts for the bag to bag random effect, the white and
black fixed effect, and models the count/categorical/proportion of
marbles as dependent variable?