Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
st: string variables
From
Estrella Gomez <[email protected]>
To
[email protected]
Subject
st: string variables
Date
Fri, 20 Sep 2013 12:12:17 +0200
Dear statalisters
I am working on a dataset related to movies. I would like to identify
each movie with an unique id. However, there are many cases in which
the title is translated and then the original identifier provided in
the dataset is not the same, for instance:
id | country | artist | trackname
2975 | at | Adam McKay | Anchorman - Die Legende von Ron Burgundy
2975 | de | Adam McKay | Anchorman - Die Legende von Ron Burgundy
6647 | it | Adam McKay | Anchorman: La leggenda di Ron Burgundy
6653 | be | Adam McKay | Anchorman: The Legend of Ron Burgundy
How could I create a new id to uniquely identify the same movie (even
if it's in different languages)? Maybe I could use the first 5 or 6
letters in the title, because usually this coincides in different
languages; but still I don't know how to do it.
Thanks a lot,
Estrella Gomez
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/faqs/resources/statalist-faq/
* http://www.ats.ucla.edu/stat/stata/