Title | Encoding a string variable | |
Author | James Hardin, StataCorp |
The most common cause of this error message is that you are trying to use a string variable with a command that only supports numeric variables. You can only tell the type of a variable by using the describe command.
This is easy to fix.
If you have a string variable and want to convert it to a numeric variable, you can use the encode command. If you have a string variable that has only numbers in it, then you can alternatively use the real() function.
. describe Contains data obs: 4 vars: 2 size: 48 ------------------------------------------------------------------------ storage display value variable name type format label variable label ------------------------------------------------------------------------ a str4 %9s b str4 %9s ------------------------------------------------------------------------ Sorted by: Note: dataset has changed since last saved . list +-------+ | a b | |-------| 1. | 1 a | 2. | 2 b | 3. | 3 c | 4. | 4 d | +-------+ . gen na = real(a) . encode b, gen(nb) . describe Contains data obs: 4 vars: 4 size: 80 ------------------------------------------------------------------------ storage display value variable name type format label variable label ------------------------------------------------------------------------ a str4 %9s b str4 %9s na float %9.0g nb long %8.0g nb ------------------------------------------------------------------------ Sorted by: Note: dataset has changed since last saved . list +-----------------+ | a b na nb | |-----------------| 1. | 1 a 1 a | 2. | 2 b 2 b | 3. | 3 c 3 c | 4. | 4 d 4 d | +-----------------+
Although nb is a numeric variable, it looks like a string variable because the encode command added value labels to it.
. list nb, nolab +----+ | nb | |----| 1. | 1 | 2. | 2 | 3. | 3 | 4. | 4 | +----+
Warning:
If you have more than 67,784 unique values of the string variables that you are encoding,
encode will complain.
If that is the case, then you can use
. egen nb = group(b)which will generate a numeric variable nb that does not have value labels. |
Learn
Free webinars
NetCourses
Classroom and web training
Organizational training
Video tutorials
Third-party courses
Web resources
Teaching with Stata
© Copyright 1996–2024 StataCorp LLC. All rights reserved.
×
We use cookies to ensure that we give you the best experience on our website—to enhance site navigation, to analyze usage, and to assist in our marketing efforts. By continuing to use our site, you consent to the storing of cookies on your device and agree to delivery of content, including web fonts and JavaScript, from third party web services.
Cookie Settings
Last updated: 16 November 2022
StataCorp LLC (StataCorp) strives to provide our users with exceptional products and services. To do so, we must collect personal information from you. This information is necessary to conduct business with our existing and potential customers. We collect and use this information only where we may legally do so. This policy explains what personal information we collect, how we use it, and what rights you have to that information.
These cookies are essential for our website to function and do not store any personally identifiable information. These cookies cannot be disabled.
This website uses cookies to provide you with a better user experience. A cookie is a small piece of data our website stores on a site visitor's hard drive and accesses each time you visit so we can improve your access to our site, better understand how you use our site, and serve you content that may be of interest to you. For instance, we store a cookie when you log in to our shopping cart so that we can maintain your shopping cart should you not complete checkout. These cookies do not directly store your personal information, but they do support the ability to uniquely identify your internet browser and device.
Please note: Clearing your browser cookies at any time will undo preferences saved here. The option selected here will apply only to the device you are currently using.