Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
st: RE: String Variable With Probable Extra Space, Problems with Trim
From 
 
"Nick Cox" <[email protected]> 
To 
 
<[email protected]> 
Subject 
 
st: RE: String Variable With Probable Extra Space, Problems with Trim 
Date 
 
Wed, 2 Jun 2010 18:58:36 +0100 
You could report to us the variable type and string format of -state-
and -state1-. 
You could also check for tabs, e.g. 
assert strpos(state1, "`= char(9)'") == 0 
Nick 
[email protected] 
J Taylor
I have a variable of US state names that include what I think are some
extra spaces at the front and at the end of the variable value.
Instead of "California", I think the variable is "  California  ".  I
received the dataset in Excel and then saved it as a .csv file with
Excel 2007.
I say 'I think' above because I'm not 100% sure if there might be some
character that I can't see that is not a space.
I try using
gen state1 = trim(state)
but state1 continues to have "  California  " as a value instead of
"California."  This is super-weird, and I apologize I cannot provide
information about what is going on.
Might there be another way (other than trim) to extract the state name
so that there are no spaces?  I could look for a continuous group of
letter characters, but this may face disadvantages for the two word
states, e.g. "New Hampshire."  Any other suggestions about fixing this
problem?
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/