Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: insheet limit in string
From
Nick Cox <[email protected]>
To
[email protected]
Subject
Re: st: insheet limit in string
Date
Fri, 9 Dec 2011 09:00:53 +0000
Sounds as if you need some kind of pre-processing to put those
variables somewhere else. The Mata-based
programs I posted in September may be suggestive.
Nick
On Fri, Dec 9, 2011 at 4:19 AM, Mike Kim <[email protected]> wrote:
> Thanks, Nick and Matt.
>
> Matt, your suggestion works with the simple example below, but it does not
> work with my actual data. I have many variables to the right of 'school'
> variable. If I use the code you suggested, split ones push other
> observations to the right and variables are all mixed. Strange. Well, thanks
> anyway. I really appreciate your help.
> Mike.
>
> -----Original Message-----
> From: [email protected]
> [mailto:[email protected]] On Behalf Of Matthew White
> Sent: Thursday, December 08, 2011 6:09 PM
> To: [email protected]
> Subject: Re: st: insheet limit in string
>
> Hi Mike,
>
> With -filefilter-, you probably want to change "; [" to the delimiter.
> For example, if mydata.csv is comma-delimited, you should change "; ["
> to `"",""' (since you're working with strings). Let's say mydata.csv
> looks like this:
> ***BEGIN .CSV***
> author,school
> "Novicevic, MM; Humphreys, JH; Buckley, MR; Cagle, C; Roberts,
> F","[Buckley, MR] Univ Oklahoma, Michael F Price Coll Business,
> Norman, OK 73019 USA; [Novicevic, MM; Roberts, F] Univ Mississippi,
> Sch Business Adm, University, MS 38677 USA; [Humphreys, JH] Texas A&M
> Univ, Commerce, TX 75429 USA; [Cagle, C] Univ Mississippi, Sch
> Accountancy, University, MS 38677 USA"
> ***END***
>
> Then in Stata:
> filefilter mydata.csv mydata_split.csv, from("; [") to(`"",""') replace
> insheet using mydata_split.csv, comma names clear
>
> Does that work? Also, if you're working with many files, think about
> using the -:dir- extended macro function: -help extended_fcn-.
>
> Best,
> Matt
>
> On Thu, Dec 8, 2011 at 6:30 PM, Mike Kim <[email protected]> wrote:
>> Hi Matt,
>>
>> Yes, I tried filefilter, etc. but it didn't work. For example, I changed
> ";
>> [" into "^" or "???"and used insheet. The result is completely mess (with
> ^)
>> or error (with ???). Does delimiter option in insheet work only with one
>> variable? If data has several variable, the imported data becomes total
> mess
>> with delimiter option. Then, maybe the only option is to save my school
>> variable as a separate file and import using delimit option and merge
> again?
>> I have 40 of these files to import, but it is doable. The only question
>> is... why does Stata create this pain?
>> Mike.
>>
>> -----Original Message-----
>> From: [email protected]
>> [mailto:[email protected]] On Behalf Of Matthew White
>> Sent: Thursday, December 08, 2011 4:22 PM
>> To: [email protected]
>> Subject: Re: st: insheet limit in string
>>
>> Hi Mike,
>>
>> There was a discussion about a similar problem not too long ago.
>> Google "String variables over 244 in a dataset with two delimiters"
>> and see if that helps.
>>
>> Best,
>> Matt
>>
>> On Thu, Dec 8, 2011 at 5:05 PM, Mike Kim <[email protected]> wrote:
>>> Hi all,
>>>
>>> I am using Stata IC v.11 and trying to import data using:
>>> insheet using mydata.csv, clear
>>>
>>> However, due to 244 string limit, I cannot correctly import the following
>>> example. I cannot use delimit(;) option because it changes the data
>>> structure I intended. If I can split school variable using delimit(";
> ["),
>>> it will work, but Stata does not allow this. Is there any way I can
> import
>>> more than 244 string? Can Stata 12 handle large string variables? Thank
>> you
>>> in advance.
>>> Mike.
>>>
>>> clear
>>> input str244 author str244 school
>>> "Novicevic, MM; Humphreys, JH; Buckley, MR; Cagle, C; Roberts, F"
>>> "[Buckley, MR] Univ Oklahoma, Michael F Price Coll Business, Norman, OK
>>> 73019 USA; [Novicevic, MM; Roberts, F] Univ Mississippi, Sch Business
> Adm,
>>> University, MS 38677 USA; [Humphreys, JH] Texas A&M Univ, Commerce, TX
>> 75429
>>> USA; [Cagle, C] Univ Mississippi, Sch Accountancy, University, MS 38677
>> USA"
>>> end
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/