Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Randomize points to map polygons using -spmap- ?


From   Maurizio Pisati <[email protected]>
To   [email protected]
Subject   Re: st: Randomize points to map polygons using -spmap- ?
Date   Tue, 16 Jul 2013 11:06:10 +0200

Dear Andrew,
what do you mean exactly by "randomize cases within NAME_1"?
Anyway, please find below a Stata command I wrote a few years ago for personal use (i.e., no help file and possible bugs) which might be of help. It is called -csrpp- and generates a given number of random points within a set of polygons specified by the user.
Using your Andorra map, suppose you'd like to generate 50 random cases within each city. Here's a way to go:

. use "ADMdb.dta", clear
. generate cases = 50
. csrpp cases using "ADMcoord.dta", id(_ID) saving("casecoord.dta") replace

The Stata dataset "casecoord.dta" contains the coordinates of the 50x7 random points:

. spmap using "ADMcoord.dta", id(_ID) point(data("casecoord.dta") x(_X) y(_Y))

Best wishes,
Maurizio




===== CODE STARTS HERE =============================================================

*! -csrpp-: Generate completely spatially random points in a polygon           
*! Version 1.0 - 19 August 2006                                                
*! Author: Maurizio Pisati                                                     
*! Department of Sociology and Social Research                                 
*! University of Milano Bicocca (Italy)                                        
*! [email protected]                                                   




*  ----------------------------------------------------------------------------
*  1. Define program                                                           
*  ----------------------------------------------------------------------------

program csrpp
version 9.2




*  ----------------------------------------------------------------------------
*  2. Define syntax                                                            
*  ----------------------------------------------------------------------------

syntax varname(numeric) [if] [in] [iweight/] using/,   ///
       ID(varname numeric)                             ///
      [Scale(numlist max=1 >0)]                        ///
       SAVing(string)                                  ///
      [REPLACE]                                        ///
      [noDOTS]




*  ----------------------------------------------------------------------------
*  3. Check syntax                                                             
*  ----------------------------------------------------------------------------

/* Check weight */
if ("`weight'" != "") {
   tempvar W
   qui gen `W' = `exp'
   qui su `W'
   if r(min) < 0 {
      di as err "{p}The weighting variable can take only nonnegative "   ///
                "values{p_end}"
      exit 198
   }
}
else local exp = 1

/* Check option id() */
capture isid `id'
if _rc {
   di as err "{p}Variable `id' specified in option {bf:{ul:id}()} "   ///
             "does not uniquely identify the polygons{p_end}"
   exit 459
}

/* Check option saving() */
if (substr(reverse("`saving'"),1,4) != "atd.") {
   local region "`saving'.dta"
}

/* Marksample */
marksample TOUSE
markout `TOUSE' `id'
qui count if `TOUSE'
if (r(N) == 0) error 2000




*  ----------------------------------------------------------------------------
*  4. Define basic objects                                                     
*  ----------------------------------------------------------------------------

/* Preserve data */
preserve

/* Select relevant observations */
qui keep if `TOUSE'

/* Set default scale factor */
if ("`scale'" == "") local scale = 1

/* Set option dots */
if ("`dots'" != "") local dots "no"
else local dots "yes"




*  ----------------------------------------------------------------------------
*  5. Data setup                                                               
*  ----------------------------------------------------------------------------

/* Generate id variable */
cap rename `id' _ID
sort _ID

/* Generate variables stating number of points to be generated */
unab varlist : `varlist'
cap rename `varlist' NP
qui replace NP = round(NP * `scale')

/* Generate weight variable */
qui gen W = `exp' * (1 / `scale')

/* Save temporary file */
keep _ID NP W
tempfile DATA
qui save `DATA', replace




*  ----------------------------------------------------------------------------
*  6. Polygon setup                                                            
*  ----------------------------------------------------------------------------

/* Open file */
use "`using'", clear

/* Generate subpolygon id */
quietly {
   gen POLY = (_X == .)
   replace POLY = sum(POLY)
   drop if _X == .
   sort POLY, stable
   tempfile POLY
   save `POLY', replace
}

/* Compute polygon area */
quietly {
   by POLY : gen double AREA = (_X*_Y[_n+1]) - (_X[_n+1]*_Y) if _n>1 & _n<_N
   by POLY : replace AREA = sum(AREA)
   by POLY : replace AREA = abs(AREA[_N] / 2)
}

/* Compute subpolygon weigths */
collapse AREA (count) NV = AREA, by(_ID POLY)
quietly {
   by _ID : gen double CUM = sum(AREA)
   by _ID : replace AREA = AREA / CUM[_N]
   drop CUM
}




*  ----------------------------------------------------------------------------
*  7. Prepare working dataset                                                  
*  ----------------------------------------------------------------------------

/* Merge files and adjust NP variable */
quietly {
   merge _ID using `DATA'
   keep if _merge == 3
   replace NP = round(AREA * NP)
   gen ID = _n
}

/* Select relevant variables and save dataset */
keep POLY ID NP NV W
order POLY ID NP NV W
sort POLY
qui save `DATA', replace

/* Select relevant records from polygon file */
quietly {
   use `POLY', clear
   merge POLY using `DATA'
   keep if _merge == 3
   keep ID _X _Y
   ren ID _ID
   order _ID
   sort _ID, stable
   save `POLY', replace   
}

/* Open data  */
use `DATA', clear
keep ID NP NV W
qui gen END = sum(NV)
qui gen START = END[_n-1] + 1
qui replace START = 1 in 1
drop NV
order ID NP START END W

/* Count polygons */
qui count
local NPOLY = r(N)

/* Count points to be generated */
su NP, mean
local NPOINTS = r(sum)

/* Merge polygon file */
qui merge using `POLY'
drop _merge ID _ID

/* Set number of records */
qui count
if (`NPOINTS' > r(N)) qui set obs `NPOINTS'

/* Generate variables */
qui gen X  = .
qui gen Y  = .
qui gen _W = .




*  ----------------------------------------------------------------------------
*  8. Generate points and save dataset                                         
*  ----------------------------------------------------------------------------

/* Generate points */
mata: csrpp(`NPOLY', "`dots'")

/* Create and select relevant variables */
keep X Y _W
ren X _X
ren Y _Y
lab var _X "x-coordinate"
lab var _Y "y-coordinate"
lab var _W "Weight"

/* Save dataset */
qui drop if _X == .
qui compress
save "`saving'", `replace'




*  ----------------------------------------------------------------------------
*  9. End program                                                              
*  ----------------------------------------------------------------------------

restore
end








/*  -----------------------------------------------------------------------  */
/*  Call mata                                                                */
/*  -----------------------------------------------------------------------  */

version 9.2
mata:
mata clear
mata set matastrict on




/*  -----------------------------------------------------------------------  */
/*  csrpp()                                                                  */
/*  -----------------------------------------------------------------------  */
void csrpp(real scalar npoly, string scalar dots)
{


real matrix    POLY, COOR
real scalar    i, line
real scalar    np, we, fr, lr, fi, li

if (dots == "yes") {
   printf("\n")
   printf("{txt}Scanning polygons ({res}%g{txt})\n", npoly)
   printf("{txt}{hline 4}{c +}{hline 3} 1 {hline 3}{c +}{hline 3} 2 ") 
   printf("{txt}{hline 3}{c +}{hline 3} 3 {hline 3}{c +}{hline 3} 4 ")
   printf("{txt}{hline 3}{c +}{hline 3} 5\n")
   displayflush()
}

fi = 1
li = 0
for (i=1; i<=npoly; i++) {
   if (dots == "yes") {
      line = mod(i,50)
      if (line != 0  &  i < npoly) {
         printf("{txt}.")
         displayflush()
      }
      if (line == 0  &  i < npoly) {
         printf("{txt}. %5.0f\n", i)
         displayflush()
      }
      if (i == npoly) {
         printf("{txt}.\n")
         printf("\n")
         displayflush()
      }
   }
   np = _st_data(i, 1)
   if (np > 0) {
      we = _st_data(i, 4)
      li = li + np
      fr = _st_data(i, 2)
      lr = _st_data(i, 3)
      POLY = st_data((fr,lr), (5..6))
      COOR = sp_csrpp(np, POLY)
      st_store((fi,li), (7..8), COOR)
      st_store((fi,li), 9, J(np, 1, we))
      fi = li + 1
   }
}


}




/*  -----------------------------------------------------------------------  */
/*  sp_csrpp()                                                               */
/*  -> sp_pip()                                                              */
/*  -> sp_polyarea()                                                         */
/*                                                                           */
/*  Returns matrix npoints-by-2 containing the coordinates of npoints        */
/*  completely spatially random points lying within polygon POLY             */
/*  -----------------------------------------------------------------------  */
real matrix sp_csrpp(real scalar npoints, real matrix POLY)
{


/* Setup */
real matrix    COOR
real matrix    XC, YC, XY, PIP, DONE
real scalar    xmin, xmax, ymin, ymax, areap, areab, aratio
real scalar    ndone, nmiss, ngen

/* Generate working objects */
xmin = min(POLY[., 1])
xmax = max(POLY[., 1])
ymin = min(POLY[., 2])
ymax = max(POLY[., 2])
areap = sp_polyarea(POLY)
areab = abs(xmax - xmin) * abs(ymax - ymin)
aratio = areab / areap

/* Generate points */
ndone = 0
while (ndone < npoints) {
   nmiss = npoints - ndone
   ngen = round(nmiss * aratio)
   if (ngen == 1) ngen = 10
   XC = xmin :+ uniform(ngen, 1) :* (xmax - xmin)
   YC = ymin :+ uniform(ngen, 1) :* (ymax - ymin)
   XY = XC , YC
   PIP = sp_pip(XY, POLY)
   DONE = select(XY, PIP)
   if (ndone == 0) COOR = DONE
   else COOR = COOR \ DONE
   ndone = rows(COOR)
}

/* Return results */
return(COOR[(1..npoints), .])


}




/*  -----------------------------------------------------------------------  */
/*  sp_pip()                                                                 */
/*  -> sp_spip()                                                             */
/*                                                                           */
/*  Returns matrix R-by-1 indicating whether each point in R-by-2 matrix     */
/*  POINTS lies within (value 1) or without (value 0) polygon POLY           */
/*  -----------------------------------------------------------------------  */

real matrix sp_pip(real matrix POINTS, real matrix POLY)
{


/* Setup */
real matrix    PIP
real scalar    np
real scalar    i, x, y

/* Generate working objects */
np = rows(POINTS)
PIP = J(np, 1, 0)

/* Check and report points status */
for (i=1; i<=np; i++) {
   x = POINTS[i, 1]
   y = POINTS[i, 2]
   PIP[i] = sp_spip(x, y, POLY)
}

/* Return results */
return(PIP)


}




/*  -----------------------------------------------------------------------  */
/*  sp_polyarea()                                                            */
/*                                                                           */
/*  Returns scalar containing the area of polygon POLY                       */
/*  -----------------------------------------------------------------------  */

real scalar sp_polyarea(real matrix POLY)
{


/* Setup */
real scalar    area
real scalar    nv, i, x1, y1, x2, y2

/* Generate working objects */
area = 0
nv = rows(POLY)
if (POLY[1,1] == POLY[nv,1] & POLY[1,2] == POLY[nv,2]) nv = nv - 1

/* Compute polygon area */
for (i=1; i<=nv; i++) {
   x1 = POLY[i, 1]
   y1 = POLY[i, 2]
   if (i == nv) {
      x2 = POLY[1, 1]
      y2 = POLY[1, 2]
   }
   else {
      x2 = POLY[i+1, 1]
      y2 = POLY[i+1, 2]
   }
   area = area + ((x2 - x1) * (y2 + y1))
}

/* Return results */
return(abs(area) * 0.5)


}




/*  -----------------------------------------------------------------------  */
/*  sp_spip()                                                                */
/*                                                                           */
/*  Returns scalar indicating whether single point (x,y) lies within         */
/*  (value 1) or without (value 0) polygon POLY                              */
/*  -----------------------------------------------------------------------  */

real scalar sp_spip(real scalar x, real scalar y, real matrix POLY)
{


/* Setup */
real scalar    pip
real scalar    nv, iwind, xlastp, ylastp, ioldq, inewq
real scalar    xthisp, ythisp
real scalar    i, a, b

/* Generate working objects */
nv = rows(POLY)
if (POLY[1,1] == POLY[nv,1] & POLY[1,2] == POLY[nv,2]) nv = nv - 1
iwind = 0
xlastp = POLY[nv, 1]
ylastp = POLY[nv, 2]
ioldq = sp_spip_aux(xlastp, ylastp, x, y)

/* Check and report point status */
for (i=1; i<=nv; i++) {
   xthisp = POLY[i, 1]
   ythisp = POLY[i, 2]
   inewq = sp_spip_aux(xthisp, ythisp, x, y)
   if (ioldq != inewq) {
      if (mod(ioldq+1, 4) == inewq) {
         iwind = iwind + 1
      }
      else if (mod(inewq+1, 4) == ioldq) {
         iwind = iwind - 1
      }
      else {
         a = (ylastp - ythisp) * (x - xlastp)
         b = xlastp - xthisp
         a = a + ylastp * b
         b = b * y
         if (a > b) {
            iwind = iwind + 2
         }
         else {
            iwind = iwind - 2
         }
      }
   }
   xlastp = xthisp
   ylastp = ythisp
   ioldq = inewq
}
pip = abs(iwind / 4)

/* Returns results */
return(pip)


}

/* Auxiliary function */
real scalar sp_spip_aux(real scalar xp, real scalar yp, real scalar xo,
                        real scalar yo)
{
real scalar iq
if(xp < xo) {
   if(yp < yo) iq = 2
   else iq = 1
}
else {
   if(yp < yo) iq = 3
   else iq = 0
}
return(iq)
}




/*  -----------------------------------------------------------------------  */              
/*  Exit Mata                                                                */
/*  -----------------------------------------------------------------------  */              

end




===== CODE ENDS HERE ===============================================================




> Hello Sergiy,
> 
> Yes, I would certainly prefer to use Stata; however there appear to be
> quite a few options in R, eg -spatstat- and -rgdal- among others.
> 
> I am stymied on how to a) form a grid  and b) assign properly
> randomized points as you suggest. My sample frame is city data (at the
> ADM3+ (commune) level).
> 
> Some typos in previous example.
> 
> ******** Revised example starts here  ********
> 
> * cd "User/Data/"
> 
> * open-source shapefile for Andorra(AND)
> 
> copy "http://gadm.org/data/shp/AND_adm.zip"; AND_adm.zip, replace
> unzipfile AND_adm.zip, replace
> 
> shp2dta using "AND_adm1", database(ADMdb) coordinates(ADMcoord) replace
> 
> use ADMdb, clear
> 
> gen cases = ceil(uniform() * 25)
> desc
> 
> * use ADMcoord, clear
> * desc
> 
> ******** Example ends here  ********
> 
> thanks for the suggestions and the Ukraine example-
> 
> cheers
> Andrew Lover
> ___________________________________________________________
> Epidemiologist
> Centre for Infectious Disease Epidemiology Research (CIDER)
> Saw Swee Hock School of Public Health
> National University of Singapore
> *
> *   For searches and help try:
> *   http://www.stata.com/help.cgi?search
> *   http://www.stata.com/support/faqs/resources/statalist-faq/
> *   http://www.ats.ucla.edu/stat/stata/


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index