Dear all,
I am considering methods of detecting fraud in a hypothetical clinical
trial with a large number of centres, but only a few patients per
centre.
In addition, many variables will be binary.
Would Cook's D be appropriate here?
Is it possible to calculate Mahalanobis' distance in Stata in order to
detect (possibly fraudulent) inliers, outliers and near duplicates in a
dataset?
If anyone has any ideas of other ways to detect possible fraud I would
love to hear from you too!
Thanks - Rachael
This email and any files transmitted with it are confidential. If you are not the intended recipient, any reading, printing, storage, disclosure, copying or any other action taken in respect of this email is prohibited and may be unlawful.
If you are not the intended recipient, please notify the sender immediately by using the reply function and then permanently delete what you have received.Incoming and outgoing email messages are routinely monitored for compliance with the Department of Healths policy on the use of electronic communications.
For more information on the Department of Healths email policy, click
http://www.dh.gov.uk/DHTermsAndConditions/fs/en?CONTENT_ID=4110945&chk=x1C3Zw
The original of this email was scanned for viruses by the Government Secure Intranet virus scanning service supplied by Cable&Wireless in partnership with MessageLabs. (CCTM Certificate Number 2007/11/0032.) On leaving the GSi this email was certified virus free.
Communications via the GSi may be automatically logged, monitored and/or recorded for legal purposes.
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/