Stata
Products Purchase Support Company
Search
   >> Home >> Products >> Capabilities >> Data management Bookmark and Share

Data management

Creating Stata datasets

  • Input data from command line
  • Input data saved from spreadsheets
  • Read data using a dictionary
  • Read any type of ASCII data
  • Read and write data in the format required by the FDA for NDA submittals
  • Read and write XML-formatted data files, including those produced by Microsoft Excel
  • Convert datasets directly from other statistical packages, spreadsheets, and databases using third-party software

ODBC support

  • Import data from any ODBC data source, such as Access, Excel, Postgres, or MySQL
  • Export data to new or existing ODBC tables
  • Execute raw SQL commands individually or in batches
  • Support for ODBC on Windows, Mac, and Linux

Built-in spreadsheet editor (Updated)

  • For Windows, Mac, and Unix

Variables Manager New

  • Change storage types, names, and formats
  • Add and edit value labels
  • Attach notes to variables
  • Filter variables

Data-management functions

Data reorganization

  • Row–column transposition
  • Data reshaping
  • Stacking of variables
  • Collapsing into means, totals, etc.

Labels

  • Dataset labels
  • Variable labels
  • Value labels (e.g., male and female for 0 and 1)
  • Ability to switch between multiple sets of data, variable, and value labels
  • Missing-value labels
  • Support for multiple languages

Notes

  • Extensive notes can be attached to a dataset

Data snapshots New

  • Allow multiple levels of undo to modified datasets

Sorting

  • Ascending or descending sorts
  • Multiple-key sorts
  • Numeric and string sorts

Merging datasets

  • Merge datasets (Updated)
    • By key variables
    • By observations
  • Join datasets
  • Outer join
  • Append datasets (Updated)
  • Append time series

Special datasets

Utilities

  • Compress (make dataset as small as possible without loss of accuracy)
  • Formatted and unformatted disk I/O
  • Zip-file support New

Variable management

  • Generation of new variables
  • Replacement of existing variables
  • Encoding and decoding string variables

Dataset reports

  • Data signatures to verify the integrity of new data
  • Flexible description of variables, labels, and types
  • Codebooks for variables
  • Value-label reports
  • Duplicates and missing values (Updated)

Variable types

  • Byte
  • Integer (int)
  • Long
  • Float
  • Double
  • String
  • Dates
  • Dates and times

Saved results

  • Save results to disk for later use
  • Store up to 300 sets of results in memory
  • Create tables to compare results

See New in Stata 11 for more about what was added in Stata Release 11.

Stata 11
Overview: Why use Stata?
Stata/MP
64-bit Stata
Capabilities
Overview
Data management
Graphics
Basic statistics
Linear models
Binary and discrete outcomes
Panel data
Survey methods
Time series
Survival analysis
Epidemiology tools
Mixed models
GLM
ANOVA / MANOVA
Multiple imputation
Exact statistics
Nonparametric methods
Multivariate methods
Cluster analysis
Resampling
Model testing
Maximum likelihood
Other statistical methods
Programming
Matrix programming—Mata
Internet capabilities
Accessibility
Sample session
User-written commands
New in Stata 11
Supported platforms
Which Stata package?
Technical support
User comments
Products
Stata 11
Order Stata
Upgrade
Training
Bookstore
Stata Journal
Stata Press
Stata News
STB
Stat/Transfer
Gift Shop

Site overview
Products
Resources & support
Company
Site index

© Copyright 1996–2009 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index