Allan Reese (Cefas) wrote:
> A suggestion from myself and colleagues is therefore a much simpler
> "standard" for documenting data so that it is human-usable. It may also
> be immediately compatible between computers, but that depends on the
> file format and character coding conventions, etc. Most data can be
> reduced to a 2-way table (or sequence of linked tables). We therefore
> propose that a data table (think Stata dataset) should be documented
> with a second table that describes the fields (=columns=variables) as a
> Codebook, and a third table containing the discovery metadata as defined
> in ISO standards. It's very simple, not technical, but would promote
> more computer users to notice concepts like missing values and
> procedures used when coding.
>
Theres no real need for a secondary table describing columns and variables.
The beauty of Stata data files (if used correctly, which many of my
predecessors at my current place of work haven't done, grrr) is to use
the various -label- commands to describe the -label variables-, -label
data--, -label define/values- so the code-book is tied in with the data
itself.
More verbose comments can be attached to the data or variables
themselves with the -notes- command.
It should be habit to define these when importing and tidying data and
is good practice for reproducible research.
Neil
--
"We should make things as simple as possible, but not simpler" - Anon (not Albert Einstein)
***********************************************************************
This message may contain confidential and privileged information.
If you are not the intended recipient you should not disclose, copy
or distribute information in this e-mail or take any action in reliance
on its contents. To do so is strictly prohibited and may be unlawful.
Please inform the sender that this message has gone astray before
deleting it. Thank you.
2008 marks the 60th anniversary of the NHS. It's an opportunity to pay
tribute to the NHS staff and volunteers who help shape the service, and
celebrate their achievements.
If you work for the NHS and would like an NHSmail email account, go
to: www.connectingforhealth.nhs.uk/nhsmail
***********************************************************************
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/