Stata: Software for Statistics and Data Science

Home /
Software /
Stata

Fast. Accurate. Easy to use. Stata is a complete, integrated software package that provides all your data science needs—data manipulation, visualization, statistics, and automated reporting.

Why Stata?

Fast. Accurate. Easy to use. Stata is a complete, integrated software package that provides all your data science needs—data manipulation, visualization, statistics, and automated reporting.

Master your data

Stata's data management features give you complete control.

Frames — manage multiple datasets simultaneously
Import, export
JDBC, ODBC, SQL
Sort, match, merge, join, append, create
Built-in spreadsheet
Unicode
Process text or binary data
Access data locally or on the web
Collect statistics across groups
BLOBs—strings that can hold entire documents
Billions of observations
Hundreds of thousands of variables
Survival data, panel data, multilevel data, survey data, discrete choice data, multiple-imputation data, categorical data, time-series data

And much more, to support all your data science needs.

Explore all of Stata's data management features »

Broad suite of statistical features

Publication-quality graphics

Stata makes it easy to generate publication-quality, distinctly styled graphs.

You can point and click to create a custom graph. Or you can write scripts to produce hundreds or thousands of graphs in a reproducible manner. Export graphs to EPS or TIFF for publication, to PNG or SVG for the web, or to PDF for viewing. With the integrated Graph Editor, you click to change anything about your graph or to add titles, notes, lines, arrows, and text.

Automated reporting

All the tools you need to automate reporting your results.

Dynamic Markdown documents
Create Word documents
Create PDF documents
Create Excel files
Customizable tables
Schemes for graphics
Word, HTML, PDF, SVG, PNG

Truly reproducible research

Lots of folks talk about reproducible research.
Stata has been dedicated to it for over 30 years.

We constantly add new features; we have even fundamentally changed language elements. No matter. Stata is the only statistical package with integrated versioning. If you wrote a script to perform an analysis in 1985, that same script will still run and still produce the same results today. Any dataset you created in 1985, you can read today. And the same will be true in 2050. Stata will be able to run anything you do today.

We take reproducibility seriously.

Real documentation

When it comes time to perform your analyses or understand the methods you are using, Stata does not leave you high and dry or ordering books to learn every detail.

Each of our data management features is fully explained and documented and shown in practice on real examples. Each estimator is fully documented and includes several examples on real data, with real discussions of how to interpret the results. The examples give you the data so you can work along in Stata and even extend the analyses. We give you a Quick start for every feature, showing some of the most common uses. Want even more detail? Our Methods and formulas sections provide the specifics of what is being computed, and our References point you to even more information.

Stata is a big package and so has lots of documentation – over 18,000 pages in 35 manuals. But don't worry, type help my topic, and Stata will search its keywords, indexes, and even community-contributed packages to bring you everything you need to know about your topic. Everything is available right within Stata.

Access the documentation online »

Trusted

We don't just program statistical methods, we validate them.

The results you see from a Stata estimator rest on comparisons with other estimators, Monte Carlo simulations of consistency and coverage, and extensive testing by our statisticians. Every Stata we ship has passed a certification suite that includes 4.1 million lines of testing code that produces 5.8 million lines of output. We certify every number and piece of text from those 5.8 million lines of output.

Easy to use

All of Stata's features can be accessed through menus, dialogs, control panels, a Data Editor, a Variables Manager, a Graph Editor, and even an SEM Diagram Builder. You can point and click your way through any analysis.

If you don't want to write commands and scripts, you don't have to.

Even when you are pointing and clicking, you can record all your results and later include them in reports. You can even save the commands created by your actions and reproduce your complete analysis later.

Easy to grow with

Stata's commands for performing tasks are intuitive and easy to learn. Even better, everything you learn about performing a task can be applied to other tasks. For example, you simply add if gender=="female" to any command to limit your analysis to females in your sample. You simply add vce(robust) to any estimator to obtain standard errors and hypothesis tests that are robust to many common assumptions.

The consistency goes even deeper. What you learn about data management commands often applies to estimation commands, and vice-versa. There is also a full suite of postestimation commands to perform hypothesis tests, form linear and nonlinear combinations, make predictions, form contrasts, and even perform marginal analysis with interaction plots. These commands work the same way after virtually every estimator.

Sequencing commands to read and clean data, then to perform statistical tests and estimation, and finally to report results is at the heart of reproducible research. Stata makes this process accessible to all researchers.

Find out how »

Easy to automate

Everyone has tasks that they do all the time—create a particular kind of variable, produce a particular table, perform a sequence of statistical steps, compute an RMSE, etc. The possibilities are endless. Stata has thousands of built-in procedures, but you may have tasks that are relatively unique or that you want done in a specific way.

If you have written a script to perform your task on a given dataset, it is easy to transform that script into something that can be used on all your datasets, on any set of variables, and on any set of observations.

See how easy automation is in Stata »

Easy to extend

Some of the things you automate may be so useful that you want to share them with colleagues or even make them available to all Stata users. That's also easy. With just a little code, you can turn an automation script into a Stata command. A command that supports standard features that Stata's official commands support. A command that can be used in the same way official commands are used.

Take a look »

Advanced programming

Stata also includes an advanced programming language—Mata.

Mata has the structures, pointers, and classes that you expect in your programming language and adds direct support for matrix programming.

Though you don't need to program to use Stata, it is comforting to know that a fast and complete programming language is an integral part of Stata. Mata is both an interactive environment for manipulating matrices and a full development environment that can produce compiled and optimized code. It includes special features for processing panel data, performs operations on real or complex matrices, provides complete support for object-oriented programming, and is fully integrated with every aspect of Stata. Stata also has comprehensive Python integration, allowing you to harness all the power of Python directly from your Stata code.

Learn more about Mata »

Stata also has PyStata, which provides comprehensive Python integration, allowing you to harness all the power of Python directly from your Stata code and to harness all the power of Stata from your Python code.

Stata even let's you incorporate C, C++, and Java plugins in your Stata programs via a native API for each language. And you can even embed Java code directly in your Stata code!

Get the most out of your multicore computer.
No other statistical software comes close.
Enjoy the new features of Stata at top speed.

Learn more »

Community-contributed features

Stata is so programmable that developers and users add new features every day to respond to the growing demands of today's researchers.

With Stata's Internet capabilities, new features and official updates can be installed over the Internet with a single click.

World-class technical support

All registered users of the current release of Stata (Stata 18) are eligible for free technical support. If you have not registered your copy of Stata, please fill out the online registration form.

We have a dedicated staff of expert Stata programmers and statisticians to answer your technical questions. From tricky data management solutions to getting your graph looking just right and from explaining a robust standard error to specifying your multilevel model, we have your answers.

Cross-platform compatible

Stata will run on Windows, Mac, and Linux/Unix computers; however, our licenses are not platform specific.

That means if you have a Mac laptop and a Windows desktop, you don't need two separate licenses to run Stata. You can install your Stata license on any of the supported platforms. Stata datasets, programs, and other data can be shared across platforms without translation. You can also quickly and easily import datasets from other statistical packages, spreadsheets, and databases.

View compatible operating systems »

Widely used

Used by researchers for more than 30 years, Stata provides everything you need for data science—data manipulation, visualization, statistics, and automated reporting.

Select your discipline and see how Stata can work for you.

Behavioral sciences

Education

Institutional research

Public health

Biostatistics

Epidemiology

Medicine

Public policy

Data science

Finance, business, and marketing

Political science

Sociology

Economics

Can't find your discipline? See who else is using Stata »

Stata is distributed in more than 180 countries. Our authorized distributors offer services such as basic technical support and training, and carry Stata inventory so delivery is fast.

Comprehensive resources

Video tutorials

Stata's YouTube channel is the perfect resource for new users to Stata, users wanting to learn a new feature in Stata, and professors looking for aids in teaching with Stata. We have over 300 videos on our YouTube channel that have been viewed over 6 million times by Stata users wanting to learn how to label variables, merge datasets, create scatterplots, fit regression models, work with time-series or panel data, fit multilevel models, analyze survival data, perform Bayesian analylsis, and use many other features of Stata. View the complete list of videos.

Visit our YouTube channel »

Stata Blog

We write the official Stata Blog, Not Elsewhere Classified (NEC), to share things we think you will find instructive, informative, or just plain entertaining. We have written about how to interpret statistical results; export results into Word, Excel, and LaTeX; perform Monte Carlo simulations; program your own estimators; and more. We also post service and product announcements. Individually signed, the articles in NEC are written by the same people who develop and support Stata.

Read our latest blog post »

Free Stata webinars

Stata webinars offer something for everyone. Those new to Stata will get a head start when they join our Ready. Set. Go Stata webinar. Both new and experienced users will want to join our Tips and Tricks webinar and our one-hour feature webinars; each one provides an in-depth look at one of Stata's statistical, graphical, data management, or reporting features.

View the current webinar offerings »

Training

A multitude of training options are available to become proficient at Stata quickly. Stata provides hands-on classroom and web-based training courses, customized on-site training courses, and online training through NetCourses, webinars, and video tutorials.

View available trainings »

Stata Press

Stata Press® publishes books, manuals, and journals about Stata and general statistics topics for professional researchers of all disciplines. Stata Press® publications, along with books recommended by StataCorp, can be found in the Stata Bookstore.

Visit the Stata Bookstore »

Stata News

The Stata News is a free publication with columns such as the popular In the Spotlight, where Stata developers give insight into specific Stata features, and the User's corner, where we share unique, helpful, and fun contributions from the user community. The News also contains announcements such as new releases and updates, training schedules, new books, Conferences, and Users Group meetings.

Stata Journal

The Stata Journal is a quarterly publication containing articles about statistics, data analysis, teaching methods, and effective use of Stata's language. The Journal publishes reviewed papers together with shorter notes and comments, regular columns, book reviews, and other materials of interest to researchers applying statistics in a variety of disciplines.

Subscribe to the Stata Journal »

Vibrant community

Stata Conferences

Whether you are a beginner or an expert, you will find something just for you at Stata conferences, which are held each year in various locations around the world. These meetings showcase in-depth presentations from StataCorp experts and experienced Stata users that focus on helping you use Stata more effectively.

View upcoming meetings »

Statalist

A great resource for users is Statalist, a forum where more than 40,000 Stata users exchange roughly 4,500 postings and responses each month. Statalist is run and moderated by Stata users and maintained by StataCorp.

Join the forum »

User comments

Our users love to share how great Stata is, so we'd like to show you! When we receive nice comments about Stata, we post them here. If you think Stata is great too, send us an email with your comment, and we may share it with the Stata community.

See what users are saying »

Affordable

Stata is not sold in modules, which means you get everything in one package!

Stata offers several purchase options to fit your budget. Annual and multiyear subscriptions ensure you always have access to the latest features. Contact a sales representative or browse our products to find out more about our affordable options. You can also download a product brochure.

Learn which Stata is right for you »

Stata is a complete, integrated software package that provides all your data science needs—data manipulation, visualization, statistics, and automated reporting. Stata is not sold in modules, which means you get everything you need in one package.

Whether you're a student or a seasoned research professional, a range of Stata packages are available and designed to suit all needs.

All of the following editions of Stata have the same complete set of commands and features and manuals included as PDF documentation within Stata.

Stata/MP: The fastest edition of Stata (for quad-core, dual-core, and multicore/multiprocessor computers) that can analyze the largest datasets
Stata/SE (Standard Edition): Standard edition; for larger datasets
Stata/BE (Basic Edition): Basic edition; for mid-sized datasets
Comparison of features

Stata/MP is the fastest and largest edition of Stata. Virtually any current computer can take advantage of the advanced multiprocessing of Stata/MP. This includes the Intel i3, i5, i7, i9, Xeon, and Celeron, and AMD multi-core chips. On dual-core chips, Stata/MP runs 40% faster overall and 72% faster where it matters, on the time-consuming estimation commands. With more than two cores or processors, Stata/MP is even faster.

Stata/MP is faster-much faster. Stata/MP lets you analyze data in one-half to two-thirds of the time compared to Stata/SE on inexpensive dual-core laptops and in one-quarter to one-half the time on quad-core desktops and laptops.

Stata/MP runs even faster on multiprocessor servers. Stata/MP supports up to 64 processors/cores.

Speed is often most crucial when performing computationally intense estimation procedures. A few of Stata's estimation procedures, including linear regression, are nearly perfectly parallelized, meaning they run twice as fast on two cores, four times as fast on four cores, eight times as fast on eight cores, and so on. Some estimation commands can be parallelized more than others. Taken at the median, estimation commands run 1.8 times faster on 2 cores, 2.9 times faster on 4 cores, and 4.1 times faster on 8 cores.

Stata/MP is 100% compatible other editions of Stata. Analyses do not have to be reformulated or modified in any way to obtain Stata/MP’s speed improvements.

Stata/MP is available for the following operating systems:

Windows (64-bit processors);
macOS (64-bit Intel processors);
Linux (64-bit processors);

To run Stata/MP, you can use a desktop computer with a dual-core or quad-core processor, or you can use a server with multiple processors. Whether a computer has separate processors or one processor with multiple cores makes no difference. More processors or cores makes Stata/MP run faster.

For more advice on purchasing/upgrading to Stata/MP or for hardware queries, please contact our sales team.

Return to menu

Stata/SE and Stata/BE differ only in the dataset size that each can analyse. Stata/SE and Stata/MP can fit models with more independent variables than Stata/BE (up to 65,532). Stata/SE can analyse up to 2 billion observations.

Return to menu

Stata/BE allows datasets with as many as 2,048 variables. The maximum number of observations is 2.14 billion. Stata/BE can have at most 798 independent variables in a model.

Return to menu

Comparison of features

Maximum number of variables

Info

2,048

32,767

120,000

Maximum number of observations

Info

2.14 billion

Up to 20 billion

Maximum number of independent variables

Info

798

10,998

65,532

Multicore support

Time to run logistic regression with 5 million obs and 10 covariates Info

1-core

10.0 sec

1-core

10.0 sec

2 core

5.0 sec

4 core

2.6 sec

even faster

Complete suite of statistical features

Info

Yes

Publication-quality graphics

Info

Yes

Matrix programming language

Yes

Complete PDF documentation

Info

Yes

Exceptional technical support

Yes

Includes within-release updates

Yes

64-bit version available

Yes

Windows, Mac, or Unix

Info

Yes

Memory requirements

1 GB

2 GB

4 GB

Disk space requirements

2 GB

For Business License terms and options please click here.

For Educational License terms and options please click here.

Return to menu

New Features in Stata

Stata has something for everyone. Below we list the highlights of the latest Stata 18 release. This release is unique because most of the new features can be used by researchers in every discipline.

Spatial autoregressive models

Statistical calculators

Cross-tabulations and chi-squared tests calculator
One-sample t tests calculator
Two-sample t tests calculator
Incidence-rate ratios calculator
Odds-ratios calculator
Risk-ratios calculator

Survey data analysis

Basic introduction to the analysis of complex survey data
Specifying the design of your survey data

How to download, import, and merge multiple datasets from the NHANES website
How to download, import, and prepare data from the NHANES website

Multilevel models for survey data
Survey data support for SEM

Survival analysis

Interval-censored survival models

Learn how to set up your data for survival analysis
How to describe and summarize survival data

How to construct life tables
How to calculate incidence rates and incidence-rate ratios
How to calculate the Kaplan-Meier survivor and Nelson-Aalen cumulative hazard functions
How to graph survival curves
How to test the equality of survivor functions using nonparametric tests
How to fit a Cox proportional hazards model and check proportional-hazards assumption

Multilevel survival analysis
Panel-data survival models
Survival models for SEM
Treatment effects for survival models

Time series

Import FRED (Import Federal Reserve Economic Data)
Threshold regression
Tests for multiple breaks in time series

Tour of forecasting
Formatting and managing dates
Time-series operators
Correlograms and partial correlograms
Line graphs and tin()
Introduction to ARMA/ARIMA models
Markov-switching models
Moving-average smoothers

Treatment effects

Tour of treatment effects

Introduction to treatment effects in Stata: Part 1
Introduction to treatment effects in Stata: Part 2

Treatment effects: Regression adjustment
Treatment effects: Inverse-probability weighting
Treatment effects: Inverse-probability weighted regression adjustment
Treatment effects: Augmented inverse-probability weighting
Treatment effects: Nearest-neighbor matching
Treatment effects: Propensity-score matching

Treatment effects for survival models
Endogenous treatment effects

All editions of Stata run on dual-core, multi-core and multi-processor computers.

Stata for Windows: Stata requires 64-bit Windows for x86-64 processors made by Intel® and AMD

Windows 11 *
Windows 10 *
Windows Server 2022, 2019, 2016, 2012R2

*Stata requires 64-bit Windows for x86-64 processors made by Intel® or AMD (Core i3 equivalent or better)

Stata for Mac

Mac with Apple Silicon or 64-bit Intel processor
MacOS 11.0 (Big Sur) or newer for Macs with Apple Silicon and MacOS 10.12 (High Sierra) or newer for Macs with 64-bit Intel processors

Stata for Linux

Linux: Any 64-bit (x86-64 or compatible) running Linux.
Minimum requirements include the GNU C library (glib) 2.17 or better and libcurl4
For xstata, you need to have GTK 2.24 installed

Hardware requirements

Minimum of 1 GB of RAM for Stata/BE, 2GB for Stata/SE and 4GB for Stata/MP
Minimum of 2 GB of disk space for all editions

Find out all about Stata’s expansive range of statistical features using the table of contents below. Each section links to further details and examples to help users get the best out of their software.

Linear models	Time series	Data management	Longitudinal data/panel data
Survival analysis	Graphics	Multilevel mixed-effects models	Bayesian analysis
Documentation	Graphical user interface	Binary, count, and limited outcomes	Power and sample size
Basic data science	Extended regression models (ERMs)	Treatment effects/Causal inference	Other statistical methods
Generalized linear models (GLMs)	SEM (structural equation modeling)	Functions	Finite mixture models (FMMs)
Internet capabilities	Spatial autoregressive models	Latent class analysis	community-contributed commands
Programming features	ANOVA/MANOVA	Multiple imputation	Mata – Stata’s serious programming language
Exact data science	Survey methods	Embedded statistical computations	Linearized DSGE models
Cluster analysis	Installation Qualification	Tests, predictions, and effects	IRT (item response theory)
Accessibility	Contrasts, pairwise comparisons, and margins	Multivariate methods	Nonparametric methods
GMM and nonlinear regression	Epidemiology	Simple maximum likelihood	Programmable maximum likelihood
Resampling and simulation methods

https://www.timberlake.co.uk/software/stata/

Start Again

Please select a user type:

Business Educational Students Upgrade Business and Educational The Stata Prof+ Plan

Please select a licence type:

Single User / Volume Single Users Network (Concurrent Users)

Please select an educational licence type:

Faculty Students

Please select a licence type:

Single User / Volume Single Users Network (Concurrent Users) Student Lab

Please select an upgrade licence type:

Business Educational

Please select an upgrade licence type:

Single User / Volume Single Users Network (Concurrent Use)

Please select an upgrade licence type:

Single User / Volume Single Users Network (Concurrent Use) Student Lab

View Prices

View Stata Students Page

I currently own a Stata license for:

Stata 18 Documentation

Every installation of Stata includes all the documentation in PDF format. Stata’s documentation consists of over 18,000 pages detailing each feature in Stata including the methods and formulas and fully worked examples. You can transition seamlessly across entries using the links within each entry.

Stata Manuals

User’s Guide

The Stata documentation is copyright of StataCorp LLC, College Station TX, USA, and is used with permission of StataCorp LLC.

Students may purchase Stata/MP, Stata/SE and Stata/BE at a discounted price. To purchase Student Licenses, click here.

Why Stata?

Master your data

Broad suite of statistical features

Publication-quality graphics

Automated reporting

Truly reproducible research

Real documentation

Trusted

Easy to use

Easy to grow with

Easy to automate

Easy to extend

Advanced programming

Community-contributed features

World-class technical support

Cross-platform compatible

Widely used

Behavioral sciences

Education

Institutional research

Public health

Biostatistics

Epidemiology

Medicine

Public policy

Data science

Finance, business, and marketing

Political science

Sociology

Economics

Comprehensive resources

Video tutorials

Stata Blog

Free Stata webinars

Training

Stata Press

Stata News

Stata Journal

Vibrant community

Affordable

Comparison of features

Product Features

Stata/BE (Basic Edition)

Stata/SE (Standard Edition)

Stata/MP

For Business License terms and options please click here.

For Educational License terms and options please click here.

New Features in Stata

Quick Tips

Recently added

Stata basics

Tour of what's new in Stata

Data management

Graphics

Automated document and report creation

Bayesian analysis

Binary, ordinal, count, and fractional outcomes

Case–control studies

Classical hypothesis tests

Descriptive data science, tables, and cross-tabulations

Dynamic stochastic general equilibrium models (DSGEs)

Econometrics

Effect sizes

Extended regression models (ERMs)

Factor variables

IRT (item response theory)

Latent class analysis and finite mixture models

Linear models

Marginal means, predictive margins, and contrasts

Multilevel mixed-effects models

Multiple imputation

Nonparametric data science

Panel data

Power and sample size

SEM (structural equation modeling)

Spatial autoregressive models

Statistical calculators

Survey data analysis

Survival analysis

Time series