[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: Imputation vs. Reducing Dataset?

From	John Simpson <[email protected]>
To	[email protected]
Subject	st: Imputation vs. Reducing Dataset?
Date	Mon, 13 Jul 2009 11:53:49 -0600

Hello Statarians,

I have a very large set of data featuring population counts generatedby a computer simulation. In order to speed processing populationsthat grew beyond 15000 within the 100 generation limit were pulledfrom the simulation. As a result there are numerous populations thatnow have missing data, making my panels unbalanced.

I am curious how to best fit a model to this data given what ismissing. In particular, I have two worries:

1. That unless I do something the missing values will cause anyprocedure to misrepresent the actual situation as the smaller valuesthat remain towards the end of the time period will skew the mean. Iam curious if this is a problem for populations that have died offearly as well (do I need to carry the 0 through all the remaininggenerations?).

2. I am unsure whether imputation (with ice?) or chopping the datasetor both is the best way to proceed. I know that ice needs variablesthat are missing at random, but is there some way to impute themissing values if I know how they are structured.


Thank you.

-John

John Simpson
Department of Philosophy
University of Alberta
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Prev by Date: re: st: STATA drops squared terms from the model
Next by Date: RE: st: Acessing files in dir
Previous by thread: st: STATA drops squared terms from the model
Next by thread: re: st: Imputation vs. Reducing Dataset?
Index(es):
- Date
- Thread