Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Combining multiple imputation with propensity score matching

From	David Kantor <[email protected]>
To	[email protected]
Subject	Re: st: Combining multiple imputation with propensity score matching
Date	Tue, 02 Mar 2010 12:24:12 -0500

Hi.

As the author of mahapick, I would like to mention that, indeed, itdoes not pick unique matches. (This could be an avenue for future development.)You can specify that it generates a multitude of match candidates,which is virtually a queue, in order of closeness, of possiblematches for each primary ("treated") case. You then can take this andrun a loop that visits primary cases in a random order. For each such case,

 select the best candidate for the given primary case;

remove that selected match as a candidate for use in later passesthrough the loop.

I recommend that if you want more than one match (say 3) per primarycase, that you run this loop several (3) times (maintaining the samedata structure that disqualifies candidates from future matching) --rather than selecting, say, the best 3 matches for each case in onepass through the loop. The latter method might enable earlier casesin the loop to grab better matches.

Of course, this has a random element to the process. You may or maynot like that. But you need some way of deciding who gets a givencandidate if it is matched to more than one primary case.

I had done this selection process once, several years ago; I might beable to dig up the code if necessary. My co-worker also had a plan tosomehow optimize the process by swapping matches in order to minimizethe sum of the distances. That was too complex to be done in Stata,and we abandoned it. I understand that the task was taken up byothers (in C, I suppose), but the result was no better than theoriginal random process.


HTH
--David

At 11:17 AM 3/2/2010, John E. Cornell wrote:

Dear Stata Folks:
I have a large, and somewhat complicated multi-site dataset, thatrequires the use of multiple imputation to fill-in missing labvalues that I need to generate propensity scores for three classesof drugs. I used the new multiple imputation procedure based onmultivariate normal regression to fill-in the missing lab values. Wecreated 20 imputed datasets if the flong format, and used logisticregression to compute and save the propensity scores in logit formwithin each imputed set. We used mahapick to select to match cases(being on one or more of the three agents) to controls (never on anyof the three agents). This worked well, but there are two problemswe encountered at this stage. First, the procedure selects theclosest match actual distance may be very large so we needed to editthe matches to maintain a subset of cases with reasonable closeness.Second, the procedure may match the same control to more than onecase, so we needed to restrict the sample to unique matches.Finally, the number of matches varied between imputed sets.
It does not appear that the mi estimate command can handle thissituation. So, we are left with the prospect of writing our own codeto compute and combine the model estimates. We are relatively noviceStata programmers at the moment, and we would welcome anysuggestions, references, etc. that the Stata community could providethat will help us solve this problem.
Cheers,


John E. Cornell, Ph.D.
Professor
Department of Epidemiology and Biostatistics
University of Texas Health Science Center, San Antonio
7703 Floyd Curl Drive
San Antonio, Texas 78229-3900
[...]


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- Re: st: Combining multiple imputation with propensity score matching
  - From: Austin Nichols <[email protected]>

References:
- st: Combining multiple imputation with propensity score matching
  - From: "Cornell, John E" <[email protected]>

Prev by Date: Re: st: Combining multiple imputation with propensity score matching
Next by Date: Re: st: AW: difference in odds ratio
Previous by thread: Re: st: Combining multiple imputation with propensity score matching
Next by thread: Re: st: Combining multiple imputation with propensity score matching
Index(es):
- Date
- Thread