Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: RE: Re: Reshaping dataset
From
Andrea Molinari <[email protected]>
To
statalist <[email protected]>
Subject
Re: st: RE: Re: Reshaping dataset
Date
Thu, 2 May 2013 00:03:11 -0300
Yep, that was the problem with the command. Now that I was able to run
the whole set of commands, I get something quite weird, and I'm not
really sure about what step is causing it...
I get different results for "valrm" as I run and re-run the do-file...
I copy the syntax below, does anyone know what might be hapening?
Cheers!
Andrea
////////////////////////////////
clear
set mem 1g
use "cadenas.dta"
sort cuci5d
save "cadenas.dta", replace
clear
use "datos pry.dta"
sort cuci5d
merge m:m cuci5d using "cadenas.dta"
assert value==. if _merge==2
drop if _merge==2
drop _merge
save "prycadenas.dta", replace
clear
use "usoecon.dta"
sort usoecon
clear
use "prycadenas.dta"
merge m:1 usoecon using "usoecon.dta"
drop if _merge==1
drop if _merge==2
drop _merge
drop hs cuci5d flow usoecon
save "prycadenas.dta", replace
bysort year partner cadena subcadena flores: egen double svalue=sum(value)
bysort year partner cadena subcadena flores: keep if _n==1
drop value
reshape wide svalue, i(year cadena subcadena flores) j(partner)
gen double valmcs=svalue32+svalue76+svalue858
rename svalue0 valwld
gen valrm=valwld-valmcs
////////////////////////////////
On 1 May 2013 21:19, Sarah Edgington <[email protected]> wrote:
> Andrea,
> What syntax did you use? It sounds like you tried to do a 1:1 merge when
> you needed a 1:m merge.
> If I'm reading your description right, you have 1 observation per SITC in
> your master data and you want that to match to ALL the chains with that SITC
> in your using data. As long as both those things are true, merge 1:m should
> get you what you need.
> -Sarah
>
> -----Original Message-----
> From: [email protected]
> [mailto:[email protected]] On Behalf Of Andrea Molinari
> Sent: Wednesday, May 01, 2013 4:05 PM
> To: statalist
> Subject: st: Re: Reshaping dataset
>
> Dear statalisters,
>
> It´s me again trying to reshape a piece of my dataset.
>
> I need to assign values from one trade classification (SITC) to another
> (chain), but with the complexity that there may be one SITC that corresponds
> to more than one chain. I then need to sum (with
> -egen-) the values by SITC to group them into the chain classification.
>
> When I tried to use the -merge- command to do this, as the identifying
> variable to use -merge- (SITC) "does not uniquely identify observations in
> the using data" (sic), the system does not allow me to merge the two
> datasets.
>
> Does anyone know of any other command that allows me to do this?
>
> Cheers!
> Andrea
>
> On 26 April 2013 13:24, Andrea Molinari <[email protected]> wrote:
>> Dear statalisters,
>>
>> I´m working with a dataset which groups many dimensions and I´m having
>> a little trouble reshaping the data for the (rather basic)
>> calculations I need to do.
>>
>> The dataset has the following columns:
>>
>> year flow partner value cadena usoecon subcadena cadenacompartida1
>> subcadenacompartida1 cadenacompartida2 subcadenacompartida2
>>
>> In order to regroup the data summing "value" by year, flow, cadena
>> subcadena and usoecon, I need that:
>>
>> - the values in cadenacompartida1 and cadenacompartida2 go under those
>> in the column "cadena"
>>
>> - the values in subcadenacompartida1 and "subcadenacompartida2" go
>> under those in the column "subcadena"
>>
>> To do so, I tried several options with -reshape long-, but I don´t
>> seem to get the right reshaping to get the data in the way I need to
>> then calculate:
>>
>> bysort year flow cadena subcadena usoecon: egen double
>> svalue=sum(value)
>>
>> Any ideas of those handling large datasets would be more than welcomed!
>>
>> Cheers,
>> Andrea
>>
>> --
>> Andrea Molinari, PhD
>> Investigadora Asistente
>> Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET)
>> Instituto Interdisciplinario de Economía Política de Buenos Aires
>> (IIEP- BAIRES) Córdoba 2122, 2do. piso
>> (http://iiep-baires.econ.uba.ar)
>> Tel: +54 11 4374-4448, int. 6362
>
>
>
> --
> Andrea Molinari, PhD
> Investigadora Asistente
> Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET)
> Instituto Interdisciplinario de Economía Política de Buenos Aires (IIEP-
> BAIRES) Córdoba 2122, 2do. piso (http://iiep-baires.econ.uba.ar)
> Tel: +54 11 4374-4448, int. 6362
>
> *
> * For searches and help try:
> * http://www.stata.com/help.cgi?search
> * http://www.stata.com/support/faqs/resources/statalist-faq/
> * http://www.ats.ucla.edu/stat/stata/
>
>
> *
> * For searches and help try:
> * http://www.stata.com/help.cgi?search
> * http://www.stata.com/support/faqs/resources/statalist-faq/
> * http://www.ats.ucla.edu/stat/stata/
--
Andrea Molinari, PhD
Investigadora Asistente
Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET)
Instituto Interdisciplinario de Economía Política de Buenos Aires (IIEP- BAIRES)
Córdoba 2122, 2do. piso (http://iiep-baires.econ.uba.ar)
Tel: +54 11 4374-4448, int. 6362
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/faqs/resources/statalist-faq/
* http://www.ats.ucla.edu/stat/stata/