|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
st: Panel Data with Truncation and Gaps
Dear Stata and statistics experts,
I am looking for a strategy to handle a large amount of panel data
that features both truncation and gaps. In particular I would like to
know how I might go about fitting a model to the data I have on hand.
Important features of the data are as follows:
1. It is population data generated from agent-based evolutionary
simulations . Each trial population has a series of observations
associated with it over the length of time that it was being run.
2. To conserve memory and processing time two data collection
shortcuts were used.
2a. Summary statistics from the population were collected on the
initial creation of the population, after running it for one
generation, and again after the second generation. Following this the
same statistics are collected every five generation until generation
100 at which point the simulation of the population ends. If the
population drops below two members then no more information is
collected either (There is no single-agent reproduction).
2b. If the population grew over 15000 members then summary statistics
were collected in the generation in which this occurred and then the
population was dropped.
3. There are a collection of variables that need to be taken into
account.
3a. Some of these are fixed throughout the trial (These include things
like the initial population size, the cost to live from generation to
generation, and the cost to spawn with another agent).
3b. Others change throughout the course of each simulation and are
randomly distributed at the beginning (These are the behaviours that
the agents exhibit under certain conditions. Over time as
opportunities to express these behaviours present themselves agents
with more good/useful beahviours get to spawn more, increasing the
likelihood that these useful behaviours will become more prevalent in
the population).
In particular I have two worries. First, that as successful
populations are truncated out, those that remain will bring down the
mean. Second, that a combination of successful population truncating
out and unsuccessful populations having few members with highly
similar behaviour sets will skew any investigation into which
behaviours are successful.
Any suggestions regarding possible models or methods for handling this
dataset or directions to possibly useful resources would be appreciated.
John Simpson
Department of Philosophy
University of Alberta, Canada
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/