<>
You should turn the string into a numeric variable via -encode-. Then -egen-
can go to work. Also provide an excerpt of your data and show what you want
to happen to them...
HTH
Martin
-----Ursprüngliche Nachricht-----
Von: [email protected]
[mailto:[email protected]] Im Auftrag von joe j
Gesendet: Donnerstag, 8. Oktober 2009 11:57
An: [email protected]
Betreff: st: egen(mean or suchlike) for a string variable
In my data I have a string variable "University", which lists
university names. In some years the names are missing. Two other
variables I've are "Professor" and "Year". The same "Professor" and
"University" can occur multiple times in a year.
The problem I have is that there are quite a few University names that
are missing. What I want to do is to replace as many missing
University names as possible, by assuming that: when a professor is
linked to a university at least once in a year, she is linked to the
same university during that year - so the missing university name when
her name occurs again in the same year can be replaced (why there are
missing university names is a complicated story:)).
Any suggestion would be appreciated.
Best,
JJ
I tried the following in Stata (it's foolish, I know):
bysort year professor: egen University_all=mean(University)
But I get the warning "type mismatch".
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/