I have now tried to perform the complete calibration:
The do file is here:
egen tot_age_grp = count(pop),by(age_grp)
egen tot_age_grp_q = count(pop) if quest==1, by(age_grp)
egen tot_geo_grp = count(pop),by(geo_grp)
egen tot_geo_grp_q = count(pop) if quest==1, by(geo_grp)
gen weight1x = (tot_age_grp / tot_age_grp_q)
keep if quest==1
*(reducing the dataset to 3743 men)
survwgt rake weight1x, ///
by(age_grp geo_grp) ///
totvars(tot_age_grp tot_geo_grp) ///
gen(weight2x)
svyset [pweight=weight2x], strata(age_grp)
*Step 2
xi: logistic sample i.age_grp i.child_cat i.marrital_cat i.job_kat
i.bicycle_cat i.jogging_cat i.organised i.smoke_grp i.education_grp
sp1_diag_db91 sp1_diag_dc61 sp1_diag_dd51 sp1_diag_de05
sp1_diag_de10
sp1_diag_de11 sp1_diag_de14 sp1_diag_de78 sp1_diag_dg40
sp1_diag_dg43
sp1_diag_dg44 sp1_diag_dg47 sp1_diag_dh40 sp1_diag_di10
sp1_diag_di20
sp1_diag_di25 sp1_diag_di49 sp1_diag_di51 sp1_diag_di63
sp1_diag_dj30
sp1_diag_dj33 sp1_diag_dj42 sp1_diag_dj44 sp1_diag_dj45
sp1_diag_dj96
sp1_diag_dk21 sp1_diag_dk22 sp1_diag_dk50 sp1_diag_dk51
sp1_diag_dl30
sp1_diag_dl40 sp1_diag_dm10 sp1_diag_dm13 sp1_diag_dm19
sp1_diag_dm53
sp1_diag_dm79 sp1_diag_dm81 sp1_diag_dn30 sp1_diag_dn40
sp1_diag_dt78
sp1_a02ba sp1_a02bc sp1_a03aa sp1_a07ec sp1_a10a sp1_a10ba sp1_a10bb
sp1_a11ba sp1_a11ea sp1_a12ba sp1_b01aa sp1_b01ac sp1_b03b sp1_b03xa
sp1_c01aa sp1_c02ac sp1_c02ca sp1_c03aa sp1_c03ba sp1_c03ca sp1_c03ea
sp1_c07aa sp1_c07ab sp1_c07ag sp1_c07bb sp1_c07cb sp1_c08ca sp1_c09aa
sp1_c09ba sp1_c09ca sp1_c10aa sp1_c01bd sp1_c01da sp1_c03da
sp1_c08da
sp1_c08db sp1_c09da sp1_g04bd sp1_g04ca sp1_g04cb sp1_h02ab sp1_h03aa
sp1_h03bb sp1_l01ba sp1_m01ab sp1_m01ae sp1_m01ah sp1_m04aa
sp1_n02aa
sp1_n02ax sp1_n02ba sp1_n02be sp1_n03af sp1_n03ax sp1_n05ab
sp1_n05ba
sp1_n05cd sp1_n06aa sp1_n06ab sp1_n06ax sp1_p01bc sp1_r03ac
sp1_r03ak
sp1_r03ba sp1_r03bb sp1_r03dc sp1_r06ae sp1_r06ax
predict p_r
lroc
gen weight3x = weight2x * (1/p_r)
keep if sample == 1
*(reducing dataset to 600 men)
survwgt rake weight3x, ///
by(age_grp geo_grp) ///
totvars(tot_age_grp tot_geo_grp ) ///
gen(weight4x)
svyset [pweight=weight4x], strata(age_grp)
As you can see the logistic regression is quite large.
All the variables with _cat are categorical the rest are binary
variables.
The variables with diag is diagnoses on ICD10 codes the rest are
medication
on ATC codes.
I have performed the lroc command and you can see the ROC curve here:
http://www.euphonium.dk/ROC1.pdf
After running the calibration I have done some svyprop commands on
some of
the variables used in the logistic regression.
The tables are to be read two and two. First for the 3743
calibrated to 4975
and then the 600 calibrated to the 4975.
----------------------------------------------------------------------
------
--
pweight: weight2x Number of obs =
3743
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
3743
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+----------------------------------------+
| age_grp Obs Est. Prop. Std. Err. |
|----------------------------------------|
| 1 346 0.090452 0.000136 |
| 2 333 0.088040 0.000140 |
| 3 304 0.079397 0.000128 |
| 4 297 0.075377 0.000124 |
| 5 284 0.075578 0.000127 |
|----------------------------------------|
| 6 275 0.074372 0.000126 |
| 7 249 0.069146 0.000124 |
| 8 246 0.063317 0.000114 |
| 9 231 0.061508 0.000116 |
| 10 209 0.060101 0.000116 |
|----------------------------------------|
| 11 212 0.055276 0.000105 |
| 12 210 0.054472 0.000110 |
| 13 184 0.052864 0.000116 |
| 14 174 0.048442 0.000104 |
| 15 189 0.051658 0.000113 |
+----------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight4x Number of obs =
600
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
600
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+----------------------------------------+
| age_grp Obs Est. Prop. Std. Err. |
|----------------------------------------|
| 1 38 0.090452 0.008267 |
| 2 47 0.088040 0.006523 |
| 3 41 0.079397 0.009195 |
| 4 41 0.075377 0.006819 |
| 5 44 0.075578 0.008794 |
|----------------------------------------|
| 6 38 0.074372 0.007082 |
| 7 44 0.069146 0.007934 |
| 8 48 0.063317 0.004680 |
| 9 43 0.061508 0.004916 |
| 10 41 0.060101 0.004005 |
|----------------------------------------|
| 11 42 0.055276 0.003447 |
| 12 35 0.054472 0.004787 |
| 13 39 0.052864 0.006314 |
| 14 33 0.048442 0.005914 |
| 15 26 0.051658 0.006062 |
+----------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight2x Number of obs =
3743
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
3743
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+----------------------------------------+
| geo_grp Obs Est. Prop. Std. Err. |
|----------------------------------------|
| 1 207 0.056884 0.003840 |
| 2 528 0.146533 0.005868 |
| 3 681 0.176080 0.006141 |
| 4 82 0.022111 0.002415 |
| 5 490 0.133467 0.005609 |
|----------------------------------------|
| 6 485 0.128844 0.005465 |
| 7 460 0.122010 0.005345 |
| 8 429 0.108744 0.004978 |
| 9 381 0.105327 0.005098 |
+----------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight4x Number of obs =
600
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
600
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+----------------------------------------+
| geo_grp Obs Est. Prop. Std. Err. |
|----------------------------------------|
| 1 31 0.056884 0.011311 |
| 2 106 0.146533 0.015523 |
| 3 123 0.176080 0.017349 |
| 4 10 0.022111 0.007494 |
| 5 68 0.133467 0.016347 |
|----------------------------------------|
| 6 80 0.128844 0.015683 |
| 7 76 0.122010 0.015553 |
| 8 83 0.108744 0.013280 |
| 9 23 0.105327 0.021331 |
+----------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight2x Number of obs =
3743
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
3743
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+-------------------------------------------+
| child_cat Obs Est. Prop. Std. Err. |
|-------------------------------------------|
| 1 317 0.084959 0.004569 |
| 2 525 0.140334 0.005677 |
| 3 1575 0.419478 0.008049 |
| 4 892 0.238786 0.006974 |
| 5 434 0.116444 0.005238 |
+-------------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight4x Number of obs =
600
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
600
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+------------------------------------------+
| child_cat Obs Est. Prop. Std. Err. |
|------------------------------------------|
| 1 28 0.081317 0.016047 |
| 2 95 0.141463 0.015804 |
| 3 282 0.436360 0.024237 |
| 4 132 0.214139 0.018806 |
| 5 63 0.126722 0.017682 |
+------------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight2x Number of obs =
3743
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
3743
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+----------------------------------------------+
| marrital_cat Obs Est. Prop. Std. Err. |
|----------------------------------------------|
| 1 2983 0.796177 0.006586 |
| 2 186 0.049980 0.003575 |
| 3 289 0.077124 0.004348 |
| 4 285 0.076718 0.004331 |
+----------------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight4x Number of obs =
600
Strata: age_grp Number of strata =
15
observations> Number of PSUs = 600
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+---------------------------------------------+
| marrital_cat Obs Est. Prop. Std. Err. |
|---------------------------------------------|
| 1 497 0.806723 0.020646 |
| 2 18 0.049089 0.013136 |
| 3 35 0.068634 0.013130 |
| 4 50 0.075554 0.012606 |
+---------------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight2x Number of obs =
3743
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
3743
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+-----------------------------------------+
| job_kat Obs Est. Prop. Std. Err. |
|-----------------------------------------|
| 1 513 0.134823 0.005026 |
| 2 351 0.093210 0.004715 |
| 3 1024 0.271445 0.005905 |
| 4 283 0.075224 0.004200 |
| 5 1572 0.425299 0.003928 |
+-----------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight4x Number of obs =
600
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
600
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+----------------------------------------+
| job_kat Obs Est. Prop. Std. Err. |
|----------------------------------------|
| 1 75 0.138626 0.016727 |
| 2 53 0.111242 0.017358 |
| 3 150 0.257521 0.018800 |
| 4 27 0.067597 0.014395 |
| 5 295 0.425014 0.014693 |
+----------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight2x Number of obs =
3743
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
3743
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+---------------------------------------------+
| bicycle_cat Obs Est. Prop. Std. Err. |
|---------------------------------------------|
| 1 2040 0.545593 0.008148 |
| 2 257 0.068640 0.004134 |
| 3 319 0.084807 0.004544 |
| 4 341 0.090828 0.004696 |
| 5 218 0.058108 0.003827 |
|---------------------------------------------|
| 6 235 0.062901 0.003969 |
| 7 333 0.089124 0.004663 |
+---------------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight4x Number of obs =
600
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
600
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+--------------------------------------------+
| bicycle_cat Obs Est. Prop. Std. Err. |
|--------------------------------------------|
| 1 300 0.565556 0.023693 |
| 2 48 0.066815 0.011137 |
| 3 58 0.085304 0.013220 |
| 4 70 0.088783 0.011871 |
| 5 33 0.047710 0.008854 |
|--------------------------------------------|
| 6 38 0.060439 0.010592 |
| 7 53 0.085394 0.012454 |
+--------------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight2x Number of obs =
3743
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
3743
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+---------------------------------------------+
| jogging_cat Obs Est. Prop. Std. Err. |
|---------------------------------------------|
| 1 3576 0.955597 0.003357 |
| 2 167 0.044403 0.003357 |
+---------------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight4x Number of obs =
600
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
600
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+--------------------------------------------+
| jogging_cat Obs Est. Prop. Std. Err. |
|--------------------------------------------|
| 1 567 0.950082 0.009837 |
| 2 33 0.049918 0.009837 |
+--------------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight2x Number of obs =
3743
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
3743
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+-------------------------------------------+
| organised Obs Est. Prop. Std. Err. |
|-------------------------------------------|
| 1 3163 0.845951 0.005894 |
| 2 409 0.108535 0.005071 |
| 3 171 0.045514 0.003403 |
+-------------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight4x Number of obs =
600
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
600
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+------------------------------------------+
| organised Obs Est. Prop. Std. Err. |
|------------------------------------------|
| 1 459 0.851567 0.013912 |
| 2 93 0.111387 0.012676 |
| 3 48 0.037046 0.005691 |
+------------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight2x Number of obs =
3743
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
3743
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+-------------------------------------------+
| smoke_grp Obs Est. Prop. Std. Err. |
|-------------------------------------------|
| 1 882 0.235073 0.006911 |
| 2 1384 0.369679 0.007899 |
| 3 1477 0.395247 0.007996 |
+-------------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight4x Number of obs =
600
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
600
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+------------------------------------------+
| smoke_grp Obs Est. Prop. Std. Err. |
|------------------------------------------|
| 1 142 0.249654 0.021930 |
| 2 224 0.351138 0.022435 |
| 3 234 0.399208 0.024120 |
+------------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight2x Number of obs =
3743
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
3743
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+----------------------------------------------+
| education_~p Obs Est. Prop. Std. Err. |
|----------------------------------------------|
| 1 1783 0.476759 0.008146 |
| 2 936 0.250221 0.007089 |
| 3 252 0.067364 0.004100 |
| 4 487 0.129609 0.005478 |
| 5 285 0.076047 0.004329 |
+----------------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight4x Number of obs =
600
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
600
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+---------------------------------------------+
| education_~p Obs Est. Prop. Std. Err. |
|---------------------------------------------|
| 1 246 0.484926 0.024701 |
| 2 160 0.240029 0.019931 |
| 3 48 0.075965 0.014029 |
| 4 85 0.124745 0.015151 |
| 5 61 0.074336 0.010873 |
+---------------------------------------------+
Some of the chronic diseases:
Hypertension:
----------------------------------------------------------------------
------
--
pweight: weight2x Number of obs =
3743
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
3743
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+----------------------------------------------+
| sp1_diag~i10 Obs Est. Prop. Std. Err. |
|----------------------------------------------|
| 0 3404 0.909003 0.004718 |
| 1 339 0.090997 0.004718 |
+----------------------------------------------+
----------------------------------------------------------------------
------
--
pweight: weight4x Number of obs =
600
Strata: age_grp Number of strata =
15
PSU: <observations> Number of PSUs =
600
Population size =
4975
----------------------------------------------------------------------
------
--
Survey proportions estimation
+---------------------------------------------+
| sp1_diag~i10 Obs Est. Prop. Std. Err. |
|---------------------------------------------|
| 0 548 0.905739 0.015309 |
| 1 52 0.094261 0.015309 |
+---------------------------------------------+
Any thoughts ?
- Kristian
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/