SAS, SPSS. and S-Plus allow value labels for string variables. Also they
allow the development of value labels independent of the database being
value labeled. (Proc Format)
STATA does not, at least not without some (considerable) rigamorrole.
Maybe Stata people will fix this.
At present the soltuion (that is quickest) seems to be developing a new
variable using the valkue label as a variable value. This is not
database-wise efficient. And these labels are not easily reduced to short
strings; subtle disease distinctions are difficult to reduce to a few
characters.
I don't see the issue here. Say that you have a million records, and one
string variable recorded therein is str2 state, AK..WY [DC PR]. You do not
want to store the 'long name' of the state in the database, so you set up a
new dataset with 50, 51 or 52 cases, containing str2 state. You encode
state into int statename, and you define a value label containing 50, 51 or
52 values for that integer variable, containing the long names of states.