|  | 
[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]
Re: st: Fraud methods in Stata
On 26 Sep 2008, at 16:20, Williams, Rachael wrote:
If anyone has any ideas of other ways to detect possible fraud I would
love to hear from you too!
Having been peripherally involved in a case of academic fraud, there  
are two interesting tendencies that people show when inventing data:
1. They do not understand random variability. Typically, invented data  
has a smaller dispersion than real data, and the distribution tends to  
be uniform rather than normal. People often confuse 'random' with  
'evenly distributed'. In particular, differences between measurements  
supposedly taken on different occasions has a very narrow distribution.
2. The previous observation influences the value if the person is  
inventing data at the keyboard. They try to make each observation  
different, causing a correlation between the values for adjacent  
observations.
In addition, watch out for data organised in a way that makes it hard  
to transcribe from the written source but easy to enter on a computer.  
In this case, instead of all the pretest items from a questionnaire  
being entered, followed by the post-test items, each item was entered  
with the pretest and post-test values in adjacent columns. Clearly,  
this would be hopelessly inefficient if you really had the  
questionnaires, but easy if you were inventing the data to show  
pretest-posttest differences.
Ronan Conroy
=================================
[email protected]
Royal College of Surgeons in Ireland
Epidemiology Department,
Beaux Lane House, Dublin 2, Ireland
+353 (0)1 402 2431
+353 (0)87 799 97 95
+353 (0)1 402 2764 (Fax - remember them?)
http://www.flickr.com/photos/ronanconroy/sets/72157601895416740/
P    Before printing, think about the environment
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/