Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
st: suppressing low frequency observations in tabulation
From
Kevin McConeghy <[email protected]>
To
[email protected]
Subject
st: suppressing low frequency observations in tabulation
Date
Wed, 24 Oct 2012 17:08:00 -0500
Hello everyone,
I have a large dataset, roughly 6.5mill obs, which is the FDA adverse
event database. Variable drugname is the string describing the drug.
. describe drugname
storage display value
variable name type format label variable label
---------------------------------------------------------------------------------------------------------------------------------------------------
drugname str30 %30s
I want to create a frequency table of the top 25 drug "offenders" in
the database, however I am having trouble figuring out how to get
STATA to perform the tab drugname command without including all the
low frequency observations from random drugs (which causes stata to
stop the command becuase "too many values"). I can't see an option for
this in the syntax. Any advice on how to filter out all the background
noise for this?
Kevin
--
Kevin McConeghy, PharmD
Infectious Diseases Fellow
University of Illinois College of Pharmacy
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/faqs/resources/statalist-faq/
* http://www.ats.ucla.edu/stat/stata/