|
Data management
|
data transformations,
match-merge,
ODBC,
XML,
by-group processing,
append files,
sort,
row–column transposition,
labeling,
saving results,
more |
Basic statistics
|
summaries,
cross-tabulations,
correlations,
t tests,
equality-of-variance tests,
tests of proportions,
confidence intervals,
more |
Linear models
|
regression;
bootstrap,
jackknife,
and robust Huber/White/sandwich variance estimates;
instrumental variables;
three-stage least squares;
constraints;
quantile regression;
GLS;
more |
Multilevel mixed-effects models
|
continuous, binary, and count outcomes; two-, three-, and
multi-way random-intercepts and random-coefficients
models; crossed random effects; ML and REML estimation; BLUPs of
effects and fitted values; hierarchical models;
more |
Binary, count, and limited dependent variables
|
logistic,
probit,
tobit;
Poisson and negative-binomial;
conditional,
multinomial,
nested,
ordered,
rank-ordered,
and stereotype logistic;
multinomial probit;
zero-inflated and zero-truncated count models;
selection models;
marginal effects;
more |
Panel data/cross-sectional time-series
|
random- and fixed-effects with robust standard errors,
linear mixed models,
random-effects probit,
GEE,
random- and fixed-effects Poisson,
dynamic panel data models,
and instrumental variables regression;
AR(1) disturbances;
more |
Generalized linear models (GLMs)
|
ten link functions,
user-defined links,
seven distributions,
ML and IRLS estimation,
nine variance estimators,
seven residuals,
more |
Nonparametric methods
|
Wilcoxon–Mann–Whitney,
Wilcoxon signed ranks and Kruskal–Wallis tests;
Spearman and Kendall correlations;
Kolmogorov–Smirnov tests;
exact binomial CIs,
more |
Exact statistics
|
exact logistic and Poisson regression,
exact case–control statistics,
binomial tests,
Fisher’s exact test for r × c tables,
more |
ANOVA/MANOVA
|
balanced and unbalanced designs;
factorial, nested, and mixed designs;
repeated measures;
more |
Multivariate methods
|
factor analysis;
principal components;
discriminant analysis;
rotation;
multidimensional scaling;
Procrustean analysis;
correspondence analysis;
biplots;
dendrograms;
user-extensible analyses;
more |
Cluster analysis
|
hierarchical clustering;
kmeans and kmedian nonhierarchical clustering;
dendrograms;
stopping rules;
user-extensible analyses;
more |
Resampling and simulation methods
|
bootstrapping,
jackknife and Monte Carlo simulation;
permutation tests;
more |
Model testing and postestimation support
|
Wald tests;
LR tests;
linear and nonlinear combinations, tests, and predictions;
marginal effects;
adjusted means;
Hausman tests;
more |
|
Graphics
|
line charts, scatterplots,
bar charts,
pie charts,
hi–lo charts,
Graph editor,
regression diagnostic graphs,
survival plots,
nonparametric smoothers,
distribution Q–Q plots,
more |
Survey methods
|
sampling weights,
multistage designs;
stratification,
poststratification;
deff;
means,
proportions,
ratios,
totals;
summary tables;
bootstrap,
jackknife,
and linearization-based variance estimation;
regression,
instrumental
variables,
probit,
Cox regression;
more |
Survival analysis
|
Kaplan–Meier and
Nelson–Aalen estimators,
Cox regression (frailty);
parametric models (frailty);
hazards;
time-varying covariates;
left and right censoring,
Weibull,
exponential,
and Gompertz analysis;
sample size and power analysis;
more |
Tools for epidemiologists
|
standardization of rates,
case–control,
cohort,
matched case–control,
Mantel–Haenszel,
pharmacokinetics,
ROC analysis,
ICD-9-CM,
more |
Time series
|
ARIMA,
ARCH/GARCH,
VAR,
VECM,
high-frequency data,
correlograms,
periodograms,
white-noise tests,
unit root tests,
Holt–Winters smoothers,
Haver Analytics data,
rolling and recursive estimation,
more |
Maximum likelihood
|
user-specified functions;
NR, DFP, BFGS, BHHH;
OIM, OPG, robust, bootstrap, and jackknife matrices;
Wald tests;
survey data;
numeric or analytic derivatives;
more |
Transforms and normality tests
|
Box–Cox transforms,
power transforms,
Shapiro–Wilk and Shapiro–Francia tests,
more
|
Other statistical methods
|
sample size and power,
nonlinear regression,
imputations,
stepwise regression,
statistical and
mathematical functions,
more
|
Programming language
|
adding new commands,
command scripting,
if,
while,
command parsing,
debugging,
menu and dialog-box programming,
markup and control language,
more |
Matrix programming—Mata
|
interactive sessions,
large-scale development projects,
optimization,
matrix inversions,
decompositions,
eigenvalues and eigenvectors,
LAPACK engine,
real and complex numbers,
string matrices,
interface to Stata datasets and matrices,
more |
Internet capabilities
|
ability to install new commands,
web updating,
web file sharing,
latest Stata news,
more |
Accessibility
|
Section 508 compliance,
accessibility for persons with disabilities |
Sample session
|
A sample session of
Stata |
User-written commands
|
User-written commands for meta-analysis; data management; survival;
econometrics; more |
|