This is simply a reformulation of a question I sent out yesterday (and
didn't get any responses to :) I have data sets that, when merged
produce a table with many-to-many relationships. The table below
contains the ID's from each table (Z and E)
     +----------+
     | Z     E  |
     |----------|
  1. |   a    x |
  2. |   b    x |
  3. |   b    z |
  4. |   c    y |
  5. |   d    z |
     |----------|
  6. |   e    q |
  7. |   e    z |
     +----------+
In as a base for further calculations I've created variables showing
duplicates and overlap between groups:
     +----------------------------------+
     | Z     E    zdup   edup   overlap |
     |----------------------------------|
  1. |   a    x      0      1         0 |
  2. |   b    x      1      1         1 |
  3. |   b    z      1      2         1 |
  4. |   c    y      0      0         0 |
  5. |   d    z      0      2         0 |
     |----------------------------------|
  6. |   e    q      1      0         0 |
  7. |   e    z      1      2         1 |
     +----------------------------------+
What I need to do is to create a group variable for all records that
are linked to each other through overlapping Z/E. In the example above
I would like to end up with something like:
     +------------------+
     | zip   ex   group |
     |------------------|
  1. |   a    x       1 |
  2. |   b    x       1 |
  3. |   b    z       1 |
  4. |   c    y       2 |
  5. |   d    z       1 |
     |------------------|
  6. |   e    q       1 |
  7. |   e    z       1 |
     +------------------+
I've spent several days now trying to figure out how to do that in
Stata/Filemaker/Excel and haven't solved it yet. Any help would be
most welcome!!!!
Fredrik
*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/