Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
st: RE: Re: Reshaping dataset
From
"Sarah Edgington" <[email protected]>
To
<[email protected]>
Subject
st: RE: Re: Reshaping dataset
Date
Wed, 1 May 2013 17:19:03 -0700
Andrea,
What syntax did you use? It sounds like you tried to do a 1:1 merge when
you needed a 1:m merge.
If I'm reading your description right, you have 1 observation per SITC in
your master data and you want that to match to ALL the chains with that SITC
in your using data. As long as both those things are true, merge 1:m should
get you what you need.
-Sarah
-----Original Message-----
From: [email protected]
[mailto:[email protected]] On Behalf Of Andrea Molinari
Sent: Wednesday, May 01, 2013 4:05 PM
To: statalist
Subject: st: Re: Reshaping dataset
Dear statalisters,
It´s me again trying to reshape a piece of my dataset.
I need to assign values from one trade classification (SITC) to another
(chain), but with the complexity that there may be one SITC that corresponds
to more than one chain. I then need to sum (with
-egen-) the values by SITC to group them into the chain classification.
When I tried to use the -merge- command to do this, as the identifying
variable to use -merge- (SITC) "does not uniquely identify observations in
the using data" (sic), the system does not allow me to merge the two
datasets.
Does anyone know of any other command that allows me to do this?
Cheers!
Andrea
On 26 April 2013 13:24, Andrea Molinari <[email protected]> wrote:
> Dear statalisters,
>
> I´m working with a dataset which groups many dimensions and I´m having
> a little trouble reshaping the data for the (rather basic)
> calculations I need to do.
>
> The dataset has the following columns:
>
> year flow partner value cadena usoecon subcadena cadenacompartida1
> subcadenacompartida1 cadenacompartida2 subcadenacompartida2
>
> In order to regroup the data summing "value" by year, flow, cadena
> subcadena and usoecon, I need that:
>
> - the values in cadenacompartida1 and cadenacompartida2 go under those
> in the column "cadena"
>
> - the values in subcadenacompartida1 and "subcadenacompartida2" go
> under those in the column "subcadena"
>
> To do so, I tried several options with -reshape long-, but I don´t
> seem to get the right reshaping to get the data in the way I need to
> then calculate:
>
> bysort year flow cadena subcadena usoecon: egen double
> svalue=sum(value)
>
> Any ideas of those handling large datasets would be more than welcomed!
>
> Cheers,
> Andrea
>
> --
> Andrea Molinari, PhD
> Investigadora Asistente
> Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET)
> Instituto Interdisciplinario de Economía Política de Buenos Aires
> (IIEP- BAIRES) Córdoba 2122, 2do. piso
> (http://iiep-baires.econ.uba.ar)
> Tel: +54 11 4374-4448, int. 6362
--
Andrea Molinari, PhD
Investigadora Asistente
Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET)
Instituto Interdisciplinario de Economía Política de Buenos Aires (IIEP-
BAIRES) Córdoba 2122, 2do. piso (http://iiep-baires.econ.uba.ar)
Tel: +54 11 4374-4448, int. 6362
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/faqs/resources/statalist-faq/
* http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/faqs/resources/statalist-faq/
* http://www.ats.ucla.edu/stat/stata/