Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
RE: st: RE: importing html to stata - http://www.wolfbane.com/icd/icd8h.htm
From
Nick Cox <[email protected]>
To
"'[email protected]'" <[email protected]>
Subject
RE: st: RE: importing html to stata - http://www.wolfbane.com/icd/icd8h.htm
Date
Thu, 8 Dec 2011 14:20:46 +0000
Regex aficionados would want to emphasise that
replace entry = regexm(text, "^\([E|N|Y][0-9]") if entry == 0
replace entry = regexm(text, "^[E|N|Y][0-9]") if entry == 0
is equivalent to
replace entry = regexm(text, "^\(*[E|N|Y][0-9]") if entry == 0
Nick
[email protected]
-----Original Message-----
From: [email protected] [mailto:[email protected]] On Behalf Of Nick Cox
Sent: 08 December 2011 13:44
To: '[email protected]'
Subject: RE: st: RE: importing html to stata - http://www.wolfbane.com/icd/icd8h.htm
Note that the code segment I gave is tailored to the document you specified. But the small tricks in using functions may well still be useful for other documents. BTW, I tried copy and paste first, but that failed for some reason.
Nick
[email protected]
-----Original Message-----
From: [email protected] [mailto:[email protected]] On Behalf Of Lars Folkestad
Sent: 08 December 2011 13:35
To: [email protected]
Subject: Re: st: RE: importing html to stata - http://www.wolfbane.com/icd/icd8h.htm
That was what i thought.
The best way to go is prob to find the ICD lists i need online, use your
infile suggestion and then merge the two data sets.
Thank you.
lars
Den 08/12/11 14.27 skrev "Nick Cox" <[email protected]>:
>No, not directly. You can only attach value labels to integers, remember.
>
>But the best way you can do something similar is likely to be with a
>-merge-.
>
>Nick
>[email protected]
>
>Lars Folkestad
>
>Thank you Nick, that worked fantastic.
>Is it possible to use this newly created dta file as a value label for
>another data set, so that if fx in my other data set i have a patient with
>the diagnosis 000.0 which is essential cholera, the essential cholera will
>show up as a value label?
>lars
>
>Den 08/12/11 14.11 skrev "Nick Cox" <[email protected]>:
>
>>Followed by
>>
>>gen var1 = word(text, 1)
>>gen var2 = subinstr(text, word(text, 1), "", 1)
>>
>>
>>Nick
>>[email protected]
>>
>>
>>-----Original Message-----
>>From: Nick Cox
>>Sent: 08 December 2011 13:07
>>To: '[email protected]'
>>Subject: RE: importing html to stata -
>>http://www.wolfbane.com/icd/icd8h.htm
>>
>>This seemed to work for me. It took a few iterations.
>>
>>infile str244 text using http://www.wolfbane.com/icd/icd8h.htm, clear
>>drop in 1/39
>>drop in 20827/L
>>compress
>>gen entry = regexm(text, "^\(*[0-9]")
>>replace entry = regexm(text, "^\([E|N|Y][0-9]") if entry == 0
>>replace entry = regexm(text, "^[E|N|Y][0-9]") if entry == 0
>>replace entry = sum(entry)
>>replace text = text[_n-1] + " " + text if entry == entry[_n-1]
>>sort entry, stable
>>by entry : keep if _n == _N
>>drop entry
>>
>>Nick
>>[email protected]
>>
>>
>>-----Original Message-----
>>From: [email protected]
>>[mailto:[email protected]] On Behalf Of Lars Folkestad
>>Sent: 08 December 2011 12:42
>>To: [email protected]
>>Subject: st: importing html to stata -
>>http://www.wolfbane.com/icd/icd8h.htm
>>
>>Dear List
>>
>>I would like to download the following html into stata:
>>http://www.wolfbane.com/icd/icd8h.htm
>>I would like to make the numbers from 000 - _n be var1 and the text from
>>cholera to be var2
>>
>>Can this be done through stata or do i need to go via a program of some
>>sort.
>>I've tried searching the list, and found one example:
>>http://www.stata.com/statalist/archive/2007-03/msg00535.html
>>But i must admit i did not understand what that was all about, and i dont
>>think it applies to my problem.
>
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/