Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: RE: Dropping parts of strings (i.e. Baldwin County becomes Baldwin)


From   Jeph Herrin <[email protected]>
To   [email protected]
Subject   Re: st: RE: Dropping parts of strings (i.e. Baldwin County becomes Baldwin)
Date   Thu, 06 Mar 2014 14:00:44 -0500

Never mind, I forgot about the multi-name entities.

J


On 3/6/2014 1:56 PM, Jeph Herrin wrote:


Or, simply:

  replace county=word(county,1)

?

hth,
Jeph


On 3/6/2014 1:40 PM, Radwin, David wrote:
Cody,

Would something simpler like this work? Note the compound quotation
marks.

foreach x in `"County"' `"Borough"' `"Township"' `"Census Area"' {
    replace county = trim(subinstr(county, "`x'", "", .))
    }

David
--
David Radwin, Senior Research Associate
Education and Workforce Development
RTI International
2150 Shattuck Ave. Suite 800, Berkeley, CA 94704
Phone: 510-665-8274

www.rti.org/education


-----Original Message-----
From: [email protected] [mailto:owner-
[email protected]] On Behalf Of Andrew Maurer
Sent: Thursday, March 06, 2014 10:31 AM
To: [email protected]
Subject: RE: st: RE: Dropping parts of strings (i.e. Baldwin County
becomes Baldwin)

No, that's why I gave united_kingdom as an example. It:
- removes everything including and after the first + sign observed
- replaces all remaining underscores with spaces
- trims the leading and trailing spaces

"__united_kingdom__+___county" becomes "united kingdom"

There would be problems if you had a country with a + sign in the name,
but I don't expect you'll have that scenario.

Andrew Maurer

-----Original Message-----
From: [email protected] [mailto:owner-
[email protected]] On Behalf Of Cody Cook
Sent: Thursday, March 06, 2014 12:24 PM
To: [email protected]
Subject: Re: st: RE: Dropping parts of strings (i.e. Baldwin County
becomes Baldwin)

Hi Andrew,

Thanks for the response! I think this route is promising, but (just by
looking at the code, I haven't run it) it looks like it may have trouble
with counties that have two words in their name, or no? Is there a
way to
account for this?

Best,
Cody



On Mar 6, 2014, at 11:13 AM, Andrew Maurer <[email protected]>
wrote:

Maybe not the most efficient way, but this works (replaces internal
underscores with spaces):

local name __united_kingdom__+___county
di trim(subinstr(substr(`"`name'"',1,strpos(`"`name'"',"+")-1),"_","
",.))

or for if your variable is named "oldname":
gen newname = trim(subinstr(substr(oldname,1,strpos(oldname,"+")-
1),"_"," ",.))

(if you want internal underscores then gen newname2 =
subinstr(newname,"
","_",.)

Andrew Maurer

-----Original Message-----
From: [email protected] [mailto:owner-
[email protected]] On Behalf Of Cody Cook
Sent: Thursday, March 06, 2014 11:02 AM
To: [email protected]
Subject: st: Dropping parts of strings (i.e. Baldwin County becomes
Baldwin)

Hi Stata Helpers,

I have a ton of census data that I'm trying to merge with some other
data. The other data only has county names, not FIPS.

For the census data, it is formatted as __name___ + ___ census type ___
where census type is either "County", "Borough", "Census Area" and
maybe a
few more. This is all in one string. I want to only keep the part that
identifies the specific county.

Basically, is there a way to say "if county includes "XXXX" replace
without "XXXX""

Best,

Cody

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index