Connecting to databases using JDBC

Highlights

Access data from many databases, including Oracle, MySQL, Amazon Redshift, Snowflake, Microsoft SQL Server, and more
Completely cross-platform compatible
Load an entire database table into Stata. or use a SQL SELECT to just load specific columns from a table into Stata
Insert all your variables into a database table, or insert just a subset of your data
Execute SQL statements from Stata
Store connection settings as a data source name (DSN)

Connecting Stata with databases has gotten even easier. jdbc allows us to exchange data with some of the most popular database vendors such as Oracle, MySQL, Amazon Redshift, Snowflake, Microsoft SQL Server, and much more. What's great about jdbc is that it's a cross-platform solution, so our JDBC setup works the same way for Windows, Mac, and Unix systems. Once you install a JDBC driver, that driver and your Stata code are all you need to switch from, say, your Mac laptop to your company's Windows cloud systems.

Let's see it work

We have email data stored on Amazon Web Services in a Redshift cluster, and we need to load these data into Stata. We first log in to AWS and go to the Amazon Redshift configuration page to download the correct JDBC driver and get the correct connection information. We then place the downloaded JDBC JAR file along our Stata adopath. Now in the Stata Do-file Editor, we store our connection information by typing

. local jar "redshift-jdbc42-2.0.0.0.jar"
. local driverc "com.amazon.redshift.jdbc42.Driver"
. local url "jdbc:redshift://redshift-cluster-1.cziajbxjzi3e.us-west-2.redshift.amazonaws.com:5439/emails"
. local user "admin"
. local pass "secret"

. jdbc connect,  jar("`jar'") driverclass("`driverc'") url("`url'")
        user("`user'") password("`pass'")

If these database settings need to be used by others or you just want to make remembering them easier, we can store them by typing

. local jar "redshift-jdbc42-2.0.0.0.jar"
. local driverc "com.amazon.redshift.jdbc42.Driver"
. local url "jdbc:redshift://redshift-cluster-1.cziajbxjzi3e.us-west-2.redshift.amazonaws.com:5439/emails"
. local user "admin"
. local pass "secret"

. jdbc add MyRed,  jar("`jar'") driverclass("`driverc'") url("`url'")
        user("`user'") password("`pass'")

We can now add the above commands to profile.do to save these connection settings in between Stata sessions, and we now can connect to our Redshift database by typing

. jdbc connect MyRed

To see what tables are availiable to load from our connection, we type

. jdbc showtables


Database: emails

Tables

category
response_info
employees

We can describe a table by typing

. jdbc describe response_info


Table: response_info

Column name                    Column type

id                                         BIGINT UNSIGNED
filename                                   VARCHAR
category_id                                BIGINT UNSIGNED
employee_id                                BIGINT UNSIGNED
datein                                     TIMESTAMP
dateout                                    DATE
screendate                                 TIMESTAMP
rid                                        TEXT
keywords                                   TEXT
assigntime                                 TIMESTAMP
resptime                                   TIMESTAMP
timeadded                                  TIMESTAMP
sversion                                   DOUBLE
correct                                    BIT
timetouched                                TIMESTAMP
timemailed                                 TIMESTAMP

To load the data, we type

. jdbc load, table("response_info") clear
(128 observations loaded)

Now we have a Stata dataset and can perform our analysis!

Additional resources

Learn more about setting up a JDBC DSN, executing SQL, loading data, and inserting data with in-depth examples in the Stata Data Management Reference Manual; see [D] jdbc.

We use cookies

We use cookies to ensure that we give you the best experience on our website—to enhance site navigation, to analyze usage, and to assist in our marketing efforts. By continuing to use our site, you consent to the storing of cookies on your device and agree to delivery of content, including web fonts and JavaScript, from third party web services.

Cookie Settings

Last updated: 16 November 2022

StataCorp LLC (StataCorp) strives to provide our users with exceptional products and services. To do so, we must collect personal information from you. This information is necessary to conduct business with our existing and potential customers. We collect and use this information only where we may legally do so. This policy explains what personal information we collect, how we use it, and what rights you have to that information.

Advertising and performance cookies

This website uses cookies to provide you with a better user experience. A cookie is a small piece of data our website stores on a site visitor's hard drive and accesses each time you visit so we can improve your access to our site, better understand how you use our site, and serve you content that may be of interest to you. For instance, we store a cookie when you log in to our shopping cart so that we can maintain your shopping cart should you not complete checkout. These cookies do not directly store your personal information, but they do support the ability to uniquely identify your internet browser and device.

Please note: Clearing your browser cookies at any time will undo preferences saved here. The option selected here will apply only to the device you are currently using.

This page announced the new features in Stata 17. Please see our Stata 19 page for the new features in Stata 19.

Connecting to databases using JDBC

Highlights

Let's see it work

Additional resources

We use cookies

Privacy policy

Required cookies

Advertising and performance cookies

Table: response_info

Column name	Column type

id BIGINT UNSIGNED filename VARCHAR category_id BIGINT UNSIGNED employee_id BIGINT UNSIGNED datein TIMESTAMP dateout DATE screendate TIMESTAMP rid TEXT keywords TEXT assigntime TIMESTAMP resptime TIMESTAMP timeadded TIMESTAMP sversion DOUBLE correct BIT timetouched TIMESTAMP timemailed TIMESTAMP

Stata/MP4 Annual License (download)

This page announced the new features in Stata 17. Please see our Stata 19 page for the new features in Stata 19.

Connecting to databases using JDBC

Highlights

Let's see it work

Additional resources

We use cookies

Privacy policy

Required cookies

Advertising and performance cookies