Attachment U - CE_CPI 200 Redesign

Attachment U - CE_CPI 2000 Redesign Within-PSU Sampling Intervals Memo.pdf

The Consumer Expenditure Surveys: The Quarterly Interview and the Diary

Attachment U - CE_CPI 200 Redesign

OMB: 1220-0050

Document [pdf]
Download: pdf | pdf
DRAFT
July 25, 2002

MEMORANDUM FOR

Kenneth V. Dalton
Associate Commissioner
Office of Prices and Living Conditions
Bureau of Labor Statistics

From:

Alan R. Tupek
Chief, Demographic Statistical Methods Division
Bureau of the Census

Subject:

Calculation of Within-PSU Sampling Intervals for the Census
2000-Based Redesign of the Consumer Expenditure Surveys and
the CPI Permit New Construction Housing Sample

I.

Purpose of this Document
This document explains how the Census Bureau will calculate within-PSU sampling
intervals for the Census 2000 -based redesign of the Consumer Expenditure Surveys
(Quarterly Interview and Diary) and the CPI Permit New Construction Housing Sample.
The calculations are based on instructions provided by the Bureau of Labor Statistics in
reference [1].

II.

Calculating Sampling Intervals for the Consumer Expenditure (CE) Surveys
There are four basic steps involved in calculating the sampling intervals for the CE
surveys. Appendices 7-10 are the code for the four SAS programs written to accomplish
these basic steps:
ƒ Allocate the national target sample size of 7700 housing units (HUs) among the 102
stratification PSUs, attempting to make the allocation as close to proportional as
possible, but subject to the constraints that each CPI Index Area have at least 80 HUs
and the Z size (non-CBSA) PSUs have a total of 400 HUs.
ƒ Calculate factors for inflating the target sample sizes to account for expected nonresponse. The factors will be based on CE response rates in the years 1999-2001.
ƒ Calculate the PSU designated sample sizes (the PSU allocations inflated for nonresponse.)

ƒ Calculate the within-PSU sampling interval for each PSU as the ratio of the PSU
measure of size 1 to the PSU designated sample size.
A.

Allocate the National Target Sample Size to the PSUs
1.

Allocate the 7300 CBSA Housing Units (HUs) to the 36 CPI Areas
There are 36 CPI Areas. Each of the 28 self-representing A PSUs is its
own CPI Area; and each of the eight region/size classes formed by the X
and Y PSUs is a CPI Area. (The four regions are 1=Northeast,
2=Midwest, 3=South, and 4=West. The two size classes are X and Y.
Thus the eight non-A CPI Areas are X100-X400 and Y100-Y400.)
We want the allocation of the 7300 HUs among the 36 areas to be as close
as possible to population proportionality, but with the constraint that we
must allocate each CPI Area a minimum of 80 HUs. We measure distance
from proportionality as the sum of squared differences between each
area’s fraction of the total population across all strata and each area’s
allocated fraction of the total 7300 HUs. We want to minimize this sum.
Briefly, this least-squares minimization problem can be stated as:
Minimize

⎛ ai
p ⎞
⎜⎜
− i ⎟⎟
∑
7300
p⎠
i =1 ⎝

Subject to

ai ≥ 80,

36

2

i = 1, 2, K , 36

Where
ai = units allocated to area i
pi = population represented by area i
36

p = ∑ pi
i =1

We solve this problem using the SAS Procedure NLP, as suggested in
reference [2]. The solution to the problem consists of the optimal values
for the ai.
See Appendix 1 for a listing of the CPI Area allocations.

1

The measure of size is the projected number of housing units in 2005 (by county.) See Reference [3] for an
explanation of the projection

2.

Allocate to the X and Y PSUs Within Each Region
Once we have determined the CPI Area allocations, we sub-allocate
within each X and Y CPI Area to the PSUs. Each PSU’s sub-allocation is
proportional to the fraction of the CPI Area total population represented
by the PSU. Specifically, the ratio of the PSU sub-allocation to the CPI
Area allocation is equal to the ratio of the population represented by the
PSU to the CPI Area total population.
See Appendix 2 for a listing of the X and Y PSU allocations.

3.

Allocate the 400 Non-CBSA HUs to the Z PSUs
We allocate the 400 HUs designated for the Z PSUs so that each Z PSU’s
allocation is proportional to the fraction of the total non-CBSA population
represented by that PSU. Specifically, the ratio of the Z PSU allocation to
400 is equal to the ratio of the population represented by the Z PSU to the
total non-CBSA population.
See the end of Appendix 2 for a listing of the Z PSU allocations.

B.

Calculate the Non-participation Inflation Factors
In order to achieve the target of obtaining completed interviews from 7,700
housing units 2 (HUs,) we need to designate enough sample HUs to account for
non-participants. We project the participation rates based on results from the CE
Interview and Diary Surveys during the three calendar years 1999 – 2001.
The final inflation factors we use are determined at the CPI Area level, or at the
region level for the Z PSUs. For brevity, within this section we use the term
“PSU group” to refer to either type of grouping.
See Appendix 3 for a listing of the PSU group inflation factors.
Our procedure for calculating the non-participation inflation factors is as follows:
1. Group the 1990 design PSUs into PSU groups corresponding to the 2000
design CPI areas or region/size classes. Specifically:
a) Except for three in the Midwest region, each 1990 A PSU is also a
2000 A PSU, with the same PSU code and CPI Area code. Each of
these is a PSU group by itself.

2
Note that we expect to get more than 7,700 completed interviews, because some housing units (HUs) contain
multiple consumer units (CUs.) We estimate a “CU inflation factor” of 1.05, so 7,700 HUs should yield 8,085
completed CU interviews (7,700 x 1.05 = 8,085).

b) The three Midwest 1990 A PSUs A212, A213, and A214 are X
PSUs in the 2000 design, so these become part of the X200 PSU
group.
c) All of the B, C, and D 1990 PSUs are grouped according to the first
two characters in their PSU code. Then we convert B to X, C to Y,
and D to Z. This results in eleven PSU groups: X100, X200, X300,
X400, Y200, Y300, Y400, Z100, Z200, Z300, and Z400.
d) Notice that there are no 1990 PSUs which directly correspond to the
2000 CPI Area Y100. Therefore the inflation factor calculated for
the X100 PSU group will also be applied to the Y100 CPI Area.
2. For each of the 39 PSU groups created in step 1, and for each of the two
surveys (Interview and Diary,) calculate the overall participation rate in that
PSU group during the period 1999 – 2001. The participation rate for the
interview survey is the number of completed interviews divided by the
number of attempted interviews. The participation rate for the Diary survey is
the number of completed diaries divided by twice the number of possible
participants (since each participant is supposed to complete two diaries.) Also
calculate national participation rates for each of the two surveys during that
period.
3. In each PSU group, and for each survey, calculate a weighted average of the
PSU group participation rate with the national participation rate:

AVGPSURATE = (0.25)( NATRATE ) + (0.75)( PSURATE )
4. In each PSU group, find the minimum of the two survey average rates, and
use the inverse of this number as the PSU group inflation factor. Also, in PSU
groups where the CED participation rate is lower than the CEQ participation
rate, calculate a CEQ sub-sampling take-every as the ratio of the CEQ rate to
the CED rate. We will sub-sample the CEQ sample after the initial samples
are selected, in order to reduce the CEQ workload in PSUs where we expect a
better participation rate for CEQ than for CED. We are doing this only for
CEQ (and not CED) because the cost of a CEQ interview is large compared
with the cost of getting a completed Diary.
C.

Calculate the PSU Designated Sample Sizes
The PSU designated sample size is TWICE the product of the PSU sub-allocation
and the PSU inflation factor. We multiply by two because we need separate
sample hits for CEQ and CED. We assign hits alternately to the two surveys’
samples during within-PSU sampling.
See Appendix 4 for a list of the PSU designated sample sizes.

D.

Calculate the PSU Sampling Intervals
1.

Project 2005 Housing Unit Counts by County
We use the same modified projections of 2005 housing unit (HU) counts
the Current Population Survey (CPS) and Survey of Income and Program
Participation (SIPP) are using. Documentation of the projections may be
found in reference [3]. We modified those projections for counties in
North Dakota, West Virginia, and the District of Columbia. The North
Dakota and West Virginia projected housing unit state totals were less
than the Census 2000 housing unit counts for those two states. This did
not seem reasonable, so we replaced the projections for those two states
with the Census 2000 counts (at the county level.) For DC, the projection
was deemed unrealistic, and we replaced it with an estimate of 268,504
housing units.

2.

Summarize HU Counts to PSU Level
For each PSU in the CE sample, we sum the projected HU counts from the
counties in that PSU. This sum is the projected PSU HU count.

3.

Calculate PSU Sampling Intervals
The final PSU sampling interval for each PSU is the ratio of the projected
PSU HU count to the PSU designated sample size calculated in C.

See Appendix 5 for a listing of the PSU sampling intervals.
III.

Calculate the CPI Permit New Construction Housing Sample Sampling Intervals
A.

Project Yearly Permit Activity in CPI Sample PSUs
We will project 2005 annual permit activity in the counties selected for the CPI
Permit New Construction Housing Sample PSUs based on county-level counts
from the permit files received in the Census Bureau Demographic Statistical
Methods Division (DSMD) from the Census Bureau Manufacturing and
Construction Division (MCD) each month (and additionally once a year for
Building Permit Offices which report annually.) We will use the files from 1997
through 2001 (calendar years.) Projections will be done separately for each
county, then summed over all counties in sample.
See Appendix 6 (not included for confidentiality reasons) for an explanation and
listing of the 2005 permit count projections by PSU.

Appendix 11 is the SAS code for the program we use to calculate the projections
and the national sampling interval.
B.

Divide Projection by 1440 and Multiply by 4 to get Sampling Interval
The final sampling interval (which is the same for all PSUs) is the ratio of the
2005 projected number of permits in CPI sample areas to the desired annual
sample size (1440), multiplied by the expected number of addresses per hit (4.)
The national sampling interval is:
1,106,785
× 4 = 3,074.4028
1,440

C.

Monitor Sample Size and Reduce When Necessary
DSMD will monitor the number of permits being selected for the CPI Permit New
Construction Housing Sample each year, and reduce the sample if it gets
significantly larger than 1,440 permits a year.

IV.

Appendices
Appendix 1:

Listing of Target Sample Size Allocations by CPI Area

Appendix 2:

Listing of Target Sample Size Allocations by PSU

Appendix 3:

Listing of CE Participation Rates and Calculated Inflation Factors by
Region/Size Class

Appendix 4:

Listing of PSU Designated Sample Sizes

Appendix 5:

Listing of PSU Sampling Intervals

Appendix 6:

Documentation of 2005 Permit Activity Projection for Counties in the CPI
Permit New Construction Housing Sample (not included for
confidentiality reasons)

Appendix 7:

SAS Program to Allocate National Target Sample Size to PSUs

Appendix 8:

SAS Program to Calculate PSU Inflation Factors

Appendix 9:

SAS Program to Calculate PSU Designated Sample Sizes

Appendix 10: SAS Program to Calculate PSU Sampling Intervals

Appendix 11: SAS Program to Project 2005 Permit Counts by County and Calculate the
National Sampling Interval for the CPI New Construction Housing
Sample

V.

References

[1] Memorandum to Bowie from Dalton, “Specifications for the Selection of CE/CPI Samples
in PSUs Based on the 2000 Census,” June 28, 2002
[2] Johnson-Herring, et. al., “Determining Within-PSU Sample Sizes for the Consumer
Expenditure Survey,” 
[3] Memorandum for Documentation from Lawrence S. Cahoon, prepared by David Hall,
“Updated County-Level Population and Housing Unit Projections (Doc. #3.2-?-?),” 

VI.

Contacts
If you have any questions about this memorandum, please contact one of the following:
Padraic Murphy
Phone: 301-763-2192
e-mail: Padraic.A.Murphy@census.gov
Stephen Ash
Phone: 301-763-4294
e-mail: Stephen.Eliot.Ash@census.gov
Karen King
Phone: 301-763-1974
e-mail: Karen.E.King@census.gov

Appendix 1:

Listing of Target Sample Size Allocations by CPI Area

CE REDESIGN 2000
TARGET SAMPLE SIZE
ALLOCATIONS FOR CPI AREAS

CPI_AREA
A102
A103
A104
A109
A110
A111
A207
A208
A209
A210
A211
A312
A313
A316
A318
A319
A320
A321
A419
A420
A422
A423
A424
A425
A426
A427
A429
A433
X100
X200
X300
X400
Y100
Y200
Y300
Y400

CPI_AREA_
ALLOCATION
168.78
194.62
80.00
220.45
212.23
182.22
253.50
147.99
80.00
80.00
82.11
135.82
80.00
142.87
126.95
112.35
103.13
80.00
344.18
106.86
192.94
93.99
80.00
80.00
80.00
80.00
85.39
80.00
302.33
696.78
1342.32
445.80
80.00
240.60
342.96
142.83
==========
7300.00

Appendix 1 Page 1

Appendix 2: Listing of Target Sample Size Allocations by PSU
CE REDESIGN 2000
TARGET SAMPLE SIZE
ALLOCATIONS FOR X- AND Y-SIZE PSUS
CPI_AREA=X100
BLSPSU2K

PSU_ALLOCATION

X102
X104
X106
X108
-------CPI_AREA

99.477
70.190
57.713
74.950
-------------302.329

CPI_AREA=X200
BLSPSU2K

PSU_ALLOCATION

X210
X212
X214
X216
X218
X220
X222
X224
X226
X228
X230
X232
-------CPI_AREA

60.838
67.288
81.062
32.641
76.711
66.968
51.719
48.761
69.047
53.394
38.940
49.408
-------------696.776

CPI_AREA=X300
BLSPSU2K
X334
X336
X338
X340
X342
X344
X346
X348

PSU_ALLOCATION
76.9139
79.9226
79.3661
81.7532
74.3940
83.2174
75.3741
54.9928

Appendix 2 Page 1

Appendix 2: Listing of Target Sample Size Allocations by PSU
CE REDESIGN 2000
TARGET SAMPLE SIZE
ALLOCATIONS FOR X- AND Y-SIZE PSUS
CPI_AREA=X300
(continued)
BLSPSU2K

PSU_ALLOCATION

X350
X352
X354
X356
X358
X360
X362
X364
X366
X368
-------CPI_AREA

63.92
81.63
82.28
82.98
81.48
61.34
74.39
83.00
42.09
83.28
-------------1342.32

CPI_AREA=X400
BLSPSU2K

PSU_ALLOCATION

X470
X472
X474
X476
X478
X480
X482
X484
-------CPI_AREA

72.832
47.220
63.038
47.159
69.271
49.186
48.631
48.466
-------------445.801

CPI_AREA=Y100
BLSPSU2K

PSU_ALLOCATION

Y102
Y104
-------CPI_AREA

36.9914
43.0086
-------------80.0000

Appendix 2 Page 2

Appendix 2: Listing of Target Sample Size Allocations by PSU
CE REDESIGN 2000
TARGET SAMPLE SIZE
ALLOCATIONS FOR X- AND Y-SIZE PSUS
CPI_AREA=Y200
BLSPSU2K

PSU_ALLOCATION

Y206
Y208
Y210
Y212
-------CPI_AREA

55.062
65.484
54.589
65.465
-------------240.600

CPI_AREA=Y300
BLSPSU2K

PSU_ALLOCATION

Y314
Y316
Y318
Y320
Y322
Y324
-------CPI_AREA

54.700
63.194
52.412
55.184
65.243
52.231
-------------342.963

CPI_AREA=Y400
BLSPSU2K

PSU_ALLOCATION

Y426
Y428
Y430
Y432
-------CPI_AREA

34.58
31.61
38.96
37.69
-------------142.83
==============
3593.62

Appendix 2 Page 3

Appendix 2: Listing of Target Sample Size Allocations by PSU
CE REDESIGN 2000
TARGET SAMPLE SIZE
ALLOCATIONS FOR Z-SIZE PSUS
BLSPSU2K

PSU_ALLOCATION

Z102
Z104
Z206
Z208
Z210
Z212
Z314
Z316
Z318
Z320
Z322
Z324
Z426
Z428
Z430
Z432

14.701
22.106
33.625
24.830
30.532
36.261
30.730
29.161
30.900
40.570
37.511
22.319
10.787
9.372
12.950
13.646
==============
400.000

Appendix 2 Page 4

Appendix 3:

Listing of CE Response Rates and Calculated Inflation Factors by Region/Size Class

CE REDESIGN 2000
PARTICIPATION RATES AND INFLATION FACTORS
BY REGION/SIZE CLASS

PSU
GROUP
A102
A103
A104
A109
A110
A111
A207
A208
A209
A210
A211
A312
A313
A316
A318
A319
A320
A321
A419
A420
A422
A423
A424
A425
A426
A427
A429
A433
X100
X200
X300
X400
Y100
Y200
Y300
Y400
Z100
Z200
Z300
Z400

CEQ
PARTICIPATION
RATE

CEQ NATIONAL
PARTICIPATION
RATE

0.55420
0.69146
0.66183
0.60870
0.66438
0.65113
0.60519
0.65917
0.64473
0.68197
0.71021
0.65031
0.65504
0.67569
0.68746
0.71374
0.63616
0.68176
0.65677
0.58660
0.69853
0.68044
0.56673
0.75083
0.64114
0.67037
0.61357
0.65863
0.68091
0.70173
0.64185
0.64113
0.68091
0.69125
0.61565
0.66063
0.58890
0.53256
0.60390
0.58273

0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988
0.64988

CEQ
WEIGHTED
AVERAGE
RATE
0.57812
0.68106
0.65884
0.61899
0.66076
0.65082
0.61637
0.65684
0.64602
0.67395
0.69513
0.65020
0.65375
0.66924
0.67806
0.69778
0.63959
0.67379
0.65505
0.60242
0.68636
0.67280
0.58751
0.72559
0.64333
0.66524
0.62264
0.65644
0.67315
0.68876
0.64385
0.64331
0.67315
0.68090
0.62421
0.65794
0.60414
0.56189
0.61540
0.59952

CED
PARTICIPATION
RATE

CED NATIONAL
PARTICIPATION
RATE

0.49284
0.72693
0.66287
0.49392
0.63752
0.62884
0.50191
0.67039
0.66603
0.62427
0.84375
0.61212
0.51782
0.63666
0.65410
0.68305
0.58473
0.61142
0.63379
0.57124
0.73390
0.75000
0.63311
0.75054
0.58245
0.64931
0.60151
0.69811
0.68361
0.65887
0.59355
0.65653
0.68361
0.67328
0.55663
0.68908
0.63722
0.49407
0.52217
0.59038

0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024
0.62024

CED
WEIGHTED
AVERAGE
RATE
0.52469
0.70026
0.65221
0.52550
0.63320
0.62669
0.53149
0.65785
0.65458
0.62326
0.78787
0.61415
0.54343
0.63255
0.64564
0.66735
0.59361
0.61362
0.63040
0.58349
0.70548
0.71756
0.62989
0.71797
0.59190
0.64204
0.60619
0.67864
0.66777
0.64921
0.60022
0.64746
0.66777
0.66002
0.57253
0.67187
0.63297
0.52561
0.54669
0.59784

INFLATION
FACTOR
USED

CEQ
SUMSAMPLING
TAKE-EVERY

1.90590
1.46829
1.53324
1.90296
1.57929
1.59568
1.88151
1.52243
1.54794
1.60447
1.43858
1.62828
1.84018
1.58089
1.54886
1.49847
1.68461
1.62967
1.58629
1.71382
1.45696
1.48633
1.70209
1.39282
1.68948
1.55753
1.64964
1.52337
1.49753
1.54033
1.66605
1.55445
1.49753
1.51510
1.74663
1.51988
1.65523
1.90255
1.82919
1.67269

1.10183
1.00000
1.01017
1.17791
1.04353
1.03850
1.15970
1.00000
1.00000
1.08133
1.00000
1.05870
1.20301
1.05800
1.05022
1.04560
1.07746
1.09805
1.03910
1.03244
1.00000
1.00000
1.00000
1.01062
1.08689
1.03613
1.02714
1.00000
1.00807
1.06092
1.07269
1.00000
1.00807
1.03164
1.09026
1.00000
1.00000
1.06902
1.12567
1.00281

Notes on Calculations:
For each survey,
[Weighted Average Rate] = (0.75)[PSU Rate] + (0.25)[National Rate]
Then,
[Inflation Factor Used] = 1 / minimum{ [CEQ Avg Rate], [CED Avg Rate] }
Appendix 3 Page 1

Appendix 4: Listing of PSU Designated Sample Sizes
CE 2000 REDESIGN
DESIGNATED SAMPLE SIZES

STRAT
PSU
A102
A103
A104
A109
A110
A111
A207
A208
A209
A210
A211
A312
A313
A316
A318
A319
A320
A321
A419
A420
A422
A423
A424
A425
A426
A427
A429
A433
X102
X104
X106
X108
X210
X212
X214
X216
X218
X220
X222
X224

ALLOCATED
TARGET
SAMPLE
SIZE

NON-RESPONSE
INFLATION
FACTOR

168.778
194.615
80.000
220.452
212.232
182.217
253.500
147.992
80.000
80.000
82.109
135.821
80.000
142.866
126.951
112.350
103.126
80.000
344.180
106.864
192.940
93.994
80.000
80.000
80.000
80.000
85.393
80.000
99.477
70.190
57.713
74.950
60.838
67.288
81.062
32.641
76.711
66.968
51.719
48.761

1.90590
1.46829
1.53324
1.90296
1.57929
1.59568
1.88151
1.52243
1.54794
1.60447
1.43858
1.62828
1.84018
1.58089
1.54886
1.49847
1.68461
1.62967
1.58629
1.71382
1.45696
1.48633
1.70209
1.39282
1.68948
1.55753
1.64964
1.52337
1.49753
1.49753
1.49753
1.49753
1.54033
1.54033
1.54033
1.54033
1.54033
1.54033
1.54033
1.54033

DESIGNATED
SAMPLE
SIZE
643.35
571.50
245.32
839.02
670.35
581.52
953.92
450.62
247.67
256.72
236.24
442.31
294.43
451.71
393.26
336.71
347.46
260.75
1091.94
366.29
562.21
279.41
272.33
222.85
270.32
249.20
281.74
243.74
297.94
210.22
172.85
224.48
187.42
207.29
249.72
100.56
236.32
206.31
159.33
150.22

Note on Calculation:
Designated Sample Size = 2 x [Allocated Target Sample Size] x [Non-response Inflation Factor]
(Factor of 2 accounts for both Interview and Diary surveys needing the same sample size)
Appendix 4 Page 1

Appendix 4: Listing of PSU Designated Sample Sizes
X226

69.047

1.54033

212.71

Note on Calculation:
Designated Sample Size = 2 x [Allocated Target Sample Size] x [Non-response Inflation Factor]
(Factor of 2 accounts for both Interview and Diary surveys needing the same sample size)
Appendix 4 Page 2

Appendix 4: Listing of PSU Designated Sample Sizes
CE 2000 REDESIGN
DESIGNATED SAMPLE SIZES

STRAT
PSU
X228
X230
X232
X334
X336
X338
X340
X342
X344
X346
X348
X350
X352
X354
X356
X358
X360
X362
X364
X366
X368
X470
X472
X474
X476
X478
X480
X482
X484
Y102
Y104
Y206
Y208
Y210
Y212
Y314
Y316
Y318
Y320
Y322

ALLOCATED
TARGET
SAMPLE
SIZE

NON-RESPONSE
INFLATION
FACTOR

53.3938
38.9397
49.4079
76.9139
79.9226
79.3661
81.7532
74.3940
83.2174
75.3741
54.9928
63.9224
81.6296
82.2828
82.9765
81.4750
61.3410
74.3917
82.9983
42.0897
83.2806
72.8318
47.2201
63.0378
47.1591
69.2706
49.1856
48.6307
48.4659
36.9914
43.0086
55.0616
65.4837
54.5892
65.4655
54.6998
63.1940
52.4118
55.1839
65.2426

1.54033
1.54033
1.54033
1.66605
1.66605
1.66605
1.66605
1.66605
1.66605
1.66605
1.66605
1.66605
1.66605
1.66605
1.66605
1.66605
1.66605
1.66605
1.66605
1.66605
1.66605
1.55445
1.55445
1.55445
1.55445
1.55445
1.55445
1.55445
1.55445
1.49753
1.49753
1.51510
1.51510
1.51510
1.51510
1.74663
1.74663
1.74663
1.74663
1.74663

DESIGNATED
SAMPLE
SIZE
164.488
119.960
152.209
256.284
266.310
264.455
272.409
247.888
277.288
251.154
183.241
212.995
271.998
274.174
276.486
271.483
204.394
247.880
276.558
140.247
277.499
226.427
146.803
195.979
146.613
215.356
152.913
151.188
150.676
110.791
128.813
166.848
198.429
165.416
198.374
191.081
220.753
183.088
192.772
227.910

Note on Calculation:
Designated Sample Size = 2 x [Allocated Target Sample Size] x [Non-response Inflation Factor]
(Factor of 2 accounts for both Interview and Diary surveys needing the same sample size)
Appendix 4 Page 3

Appendix 4: Listing of PSU Designated Sample Sizes
Y324

52.2307

1.74663

182.456

Note on Calculation:
Designated Sample Size = 2 x [Allocated Target Sample Size] x [Non-response Inflation Factor]
(Factor of 2 accounts for both Interview and Diary surveys needing the same sample size)
Appendix 4 Page 4

Appendix 4: Listing of PSU Designated Sample Sizes
CE 2000 REDESIGN
DESIGNATED SAMPLE SIZES

STRAT
PSU
Y426
Y428
Y430
Y432
Z102
Z104
Z206
Z208
Z210
Z212
Z314
Z316
Z318
Z320
Z322
Z324
Z426
Z428
Z430
Z432

ALLOCATED
TARGET
SAMPLE
SIZE

NON-RESPONSE
INFLATION
FACTOR

DESIGNATED
SAMPLE
SIZE

34.5756
31.6066
38.9602
37.6852
14.7006
22.1060
33.6248
24.8297
30.5316
36.2610
30.7304
29.1609
30.9002
40.5699
37.5111
22.3189
10.7867
9.3723
12.9503
13.6456

1.51988
1.51988
1.51988
1.51988
1.65523
1.65523
1.90255
1.90255
1.90255
1.90255
1.82919
1.82919
1.82919
1.82919
1.82919
1.82919
1.67269
1.67269
1.67269
1.67269

105.10
96.08
118.43
114.55
48.67
73.18
127.95
94.48
116.18
137.98
112.42
106.68
113.04
148.42
137.23
81.65
36.09
31.35
43.32
45.65
==========
25028.79

Deleted: ¶
¶

Note on Calculation:
Designated Sample Size = 2 x [Allocated Target Sample Size] x [Non-response Inflation Factor]
(Factor of 2 accounts for both Interview and Diary surveys needing the same sample size)
Appendix 4 Page 5

Appendix 5: Listing of PSU Sampling Intervals
CE 2000 REDESIGN
WITHIN-PSU SAMPLING INTERVALS

STRAT
PSU
A102
A103
A104
A109
A110
A111
A207
A208
A209
A210
A211
A312
A313
A316
A318
A319
A320
A321
A419
A420
A422
A423
A424
A425
A426
A427
A429
A433
X102
X104
X106
X108
X210
X212
X214
X216
X218
X220
X222
X224
X226
X228

PROJECTED
2005 HU
COUNT
2644191
3011714
1104734
3394801
3102041
2735086
3656389
2261974
1182324
1264006
1315627
2099845
1116457
2300308
2016412
1834109
1811318
1300498
4712837
1597944
2946667
1585272
1156037
995210
329978
134075
1553094
1213534
153879
294956
437022
51073
211424
721424
111463
69517
837510
58658
114574
97159
770985
87590

DESIGNATED
SAMPLE
SIZE
643.35
571.50
245.32
839.02
670.35
581.52
953.92
450.62
247.67
256.72
236.24
442.31
294.43
451.71
393.26
336.71
347.46
260.75
1091.94
366.29
562.21
279.41
272.33
222.85
270.32
249.20
281.74
243.74
297.94
210.22
172.85
224.48
187.42
207.29
249.72
100.56
236.32
206.31
159.33
150.22
212.71
164.49

PSU SAMPLING
INTERVAL
4,110.0307
5,269.8147
4,503.2623
4,046.1342
4,627.4896
4,703.3136
3,833.0045
5,019.7460
4,773.7656
4,923.7661
5,569.0242
4,747.4739
3,791.9436
5,092.4152
5,127.4678
5,447.2015
5,213.0894
4,987.5927
4,316.0205
4,362.4799
5,241.2260
5,673.6122
4,244.9291
4,465.7914
1,220.7088
538.0122
5,512.5904
4,978.8245
516.4790
1,403.0638
2,528.2766
227.5182
1,128.0763
3,480.2297
446.3440
691.3239
3,543.9343
284.3260
719.1100
646.7919
3,624.5938
532.4998

Note on Calculation:
Sampling Interval = [Projected HU Count] / [Designated Sample Size]
Appendix 5 Page 1

Appendix 5: Listing of PSU Sampling Intervals
CE 2000 REDESIGN
WITHIN-PSU SAMPLING INTERVALS

STRAT
PSU
X230
X232
X334
X336
X338
X340
X342
X344
X346
X348
X350
X352
X354
X356
X358
X360
X362
X364
X366
X368
X470
X472
X474
X476
X478
X480
X482
X484
Y102
Y104
Y206
Y208
Y210
Y212
Y314
Y316
Y318
Y320
Y322
Y324
Y426
Y428

PROJECTED
2005 HU
COUNT
167478
209848
368870
139926
726725
292699
451702
176654
677261
97410
139446
278981
568413
87650
299860
82142
136875
540039
251484
437673
817875
138971
703800
79841
300935
75991
114398
86486
59102
50702
45581
21881
11881
13924
54340
20131
16683
21601
47787
19898
28963
58793

DESIGNATED
SAMPLE
SIZE
119.960
152.209
256.284
266.310
264.455
272.409
247.888
277.288
251.154
183.241
212.995
271.998
274.174
276.486
271.483
204.394
247.880
276.558
140.247
277.499
226.427
146.803
195.979
146.613
215.356
152.913
151.188
150.676
110.791
128.813
166.848
198.429
165.416
198.374
191.081
220.753
183.088
192.772
227.910
182.456
105.102
96.077

PSU SAMPLING
INTERVAL
1,396.1149
1,378.6836
1,439.3000
525.4255
2,748.0055
1,074.4816
1,822.2038
637.0773
2,696.5993
531.5946
654.6899
1,025.6738
2,073.1811
317.0145
1,104.5279
401.8809
552.1818
1,952.7130
1,793.1529
1,577.2055
3,612.0868
946.6497
3,591.2093
544.5692
1,397.3849
496.9545
756.6586
573.9863
533.4531
393.6088
273.1889
110.2712
71.8248
70.1907
284.3822
91.1924
91.1201
112.0547
209.6752
109.0566
275.5706
611.9384

Note on Calculation:
Sampling Interval = [Projected HU Count] / [Designated Sample Size]
Appendix 5 Page 2

Appendix 5: Listing of PSU Sampling Intervals
CE 2000 REDESIGN
WITHIN-PSU SAMPLING INTERVALS

STRAT
PSU
Y430
Y432
Z102
Z104
Z206
Z208
Z210
Z212
Z314
Z316
Z318
Z320
Z322
Z324
Z426
Z428
Z430
Z432

PROJECTED
2005 HU
COUNT
48781
95340
29593
43955
21856
18482
12456
43005
12140
31685
13167
37531
62302
11050
31493
9868
13671
8382

DESIGNATED
SAMPLE
SIZE
118.430
114.554
48.666
73.181
127.946
94.479
116.176
137.977
112.423
106.681
113.045
148.420
137.230
81.651
36.086
31.354
43.323
45.650

PSU SAMPLING
INTERVAL
411.8970
832.2689
608.0841
600.6324
170.8223
195.6195
107.2170
311.6832
107.9847
297.0057
116.4759
252.8699
453.9979
135.3321
872.7299
314.7304
315.5567
183.6157
Deleted: ¶

Note on Calculation:
Sampling Interval = [Projected HU Count] / [Designated Sample Size]
Appendix 5 Page 3

Appendix 7:

SAS Program to Allocate National Target Sample Size to PSUs

*************************************************************
*
CREATE A DATA SET WITH THE CPI AREA POPULATIONS
*
*
INPUT: CE-ONLY PSU DEFINITIONS FILE FROM BLS
*
*************************************************************;
%MACRO LOADPSUS(NAME);
DATA &NAME.;
INFILE "T:\COMMON\CE Sampling Intervals\DATA\BLSFILES\&NAME..TXT"
LRECL=35 PAD MISSOVER;
INPUT
@1 REGION $1.
@3 FIPSST $2.
@6 FIPSCTY $3.
@10 BLSPSU2K $4.
@15 SR_NSR $1.
@17 STRATPOP 8.0
@26 UPROB 10.8;
LENGTH CPI_AREA $4.;
IF PUT(BLSPSU2K,$1.)='A' THEN CPI_AREA=BLSPSU2K;
ELSE CPI_AREA = PUT(BLSPSU2K,$2.)||'00';
PROC APPEND BASE=BLS_CE_FILE DATA=&NAME.;
RUN;
%MEND;
%LOADPSUS(CENSOUT2000CPI);
%LOADPSUS(CENSOUT2000CE);
/* COLLAPSE COUNTY-LEVEL DATA SET TO PSU-LEVEL DATA SET

*/

PROC SORT DATA=BLS_CE_FILE NODUPKEY
OUT=PSUS(KEEP=CPI_AREA BLSPSU2K STRATPOP);
BY BLSPSU2K;
RUN;

PROC SUMMARY DATA=PSUS(WHERE=(CPI_AREA < 'Z100')) NWAY;
CLASS CPI_AREA;
VAR STRATPOP;
OUTPUT OUT=CPI_AREAS(KEEP=CPI_AREA STRATPOP) SUM=;
DATA CPI_AREAS;
SET CPI_AREAS;
I+1;
DATA POP_DATA;
ARRAY POP[36];
DO UNTIL(LASTOBS);
SET CPI_AREAS END=LASTOBS;
POP[I]=STRATPOP;
END;
KEEP POP1-POP36;
RUN;

******************************************************

Appendix 7 Page 1

Appendix 7:

SAS Program to Allocate National Target Sample Size to PSUs

* COMPUTE THE SQUARED DIFFERENCE BETWEEN EACH
*
* CPI AREA'S PROPORTION OF THE POPULATION & ITS
*
* PROPORTION OF THE SAMPLE.
*
******************************************************;
%MACRO MAC1;
SUM_POP = SUM(OF POP1-POP36);
%DO I=1 %TO 36;
SQR&I = ((A&I/7300) - (POP&I/SUM_POP))**2;
%END;
%MEND MAC1;
*************************************************
* SOLVE A CONSTRAINED LEAST SQUARES PROBLEM TO *
* FIND THE NUMBER OF HOUSING UNITS IN EACH PSU *
* THAT MINIMIZES THE SUM OF SQUARED DIFFERENCES *
************************************************;
PROC NLP DATA=POP_DATA NOPRINT
OUT=RESULTS(KEEP=A1-A36)
/* CONVERGENCE CRITERIA */
GCONV=1E-15
GCONV2=1E-15
ABSGCONV=1E-15
FCONV2=1E-15
MAXITER=100000 ;
/* DECISION VARIABLES */
DECVAR A1-A36;
/* COMPUTE THE SQUARED DIFFERENCES */
%MAC1;
/* SUM THE SQUARED DIFFERENCES */
F1=SUM(OF SQR1-SQR36);
/* FUNCTION TO BE MINIMIZED */
MIN F1;
/* PROBLEM CONSTRAINTS */
BOUNDS A1-A36>=80;
NLINCON F2=7300;
F2=SUM(OF A1-A36);
RUN;
*****************************************************
*
RE-LINK TO CPI-AREA CODES *
****************************************************;
DATA RESULTS;
ARRAY A[36] A1-A36;
SET RESULTS;
DO I = 1 TO 36;

Appendix 7 Page 2

Appendix 7:

SAS Program to Allocate National Target Sample Size to PSUs

ALLOCATION = A[I];
OUTPUT;
END;
KEEP I ALLOCATION;
PROC SORT DATA=RESULTS; BY I;
PROC SORT DATA=CPI_AREAS; BY I;
DATA FINAL_NLP_ALLOCATION;
MERGE CPI_AREAS RESULTS;
BY I;
DROP I;
RUN;
*********************************************************
*
PROPORTIONALLY ALLOCATE TARGET SAMPLE SIZES
*
*
TO PSUs WITHIN X AND Y CPI AREAS BY STRATUM POPS *
********************************************************;
/* ALLOCATE WITHIN CPI AREAS */
%MACRO ALLOCPSU(CPIAREA);
DATA _NULL_;
SET FINAL_NLP_ALLOCATION;
WHERE CPI_AREA = "&CPIAREA.";
CALL SYMPUT("CPIALLOC",ALLOCATION);
RUN;
DATA &CPIAREA.;
SET PSUS;
WHERE CPI_AREA = "&CPIAREA.";
KEEP CPI_AREA BLSPSU2K STRATPOP;
PROC FREQ DATA=&CPIAREA.;
WEIGHT STRATPOP;
TABLES BLSPSU2K /NOPRINT OUT=TEMP(DROP=COUNT);
PROC SORT DATA=TEMP; BY BLSPSU2K;
PROC SORT DATA=&CPIAREA.; BY BLSPSU2K;
DATA &CPIAREA.;
MERGE &CPIAREA. TEMP END=LASTONE;
BY BLSPSU2K;
PSU_ALLOCATION = &CPIALLOC. * PERCENT / 100 ;
KEEP CPI_AREA BLSPSU2K PSU_ALLOCATION;
RUN;
/* APPEND CPI AREA DATA SET TO CUMULATIVE DATA SET OF ALL PSUS

*/

PROC APPEND BASE=PSU_ALLOCATIONS DATA=&CPIAREA.;
RUN;
%MEND;
%ALLOCPSU(X100)
%ALLOCPSU(X200)
%ALLOCPSU(X300)
%ALLOCPSU(X400)
%ALLOCPSU(Y100)

Appendix 7 Page 3

Appendix 7:

SAS Program to Allocate National Target Sample Size to PSUs

%ALLOCPSU(Y200)
%ALLOCPSU(Y300)
%ALLOCPSU(Y400);
*********************************************************
*
APPEND "A" PSUs TO CUMULATIVE DATA SET OF ALL PSUS *
********************************************************;
PROC SORT DATA=PSU_ALLOCATIONS;
BY CPI_AREA;
PROC SORT DATA=FINAL_NLP_ALLOCATION;
BY CPI_AREA;
DATA PSU_ALLOCATIONS;
MERGE PSU_ALLOCATIONS(IN=XY) FINAL_NLP_ALLOCATION;
BY CPI_AREA;
IF NOT XY THEN DO;
BLSPSU2K = CPI_AREA;
PSU_ALLOCATION = ALLOCATION;
END;
RENAME ALLOCATION=CPI_AREA_ALLOCATION;
RUN;

*****************************************************
*
PROPORTIONALLY ALLOCATE 400 UNITS AMONG Z PSUS
*
*
AND APPEND Z PSU ALLOCATION DATA SET
*
****************************************************;
PROC SORT DATA=BLS_CE_FILE(WHERE=(PUT(BLSPSU2K,$1.) = 'Z'))
OUT=ZPSUS(KEEP=BLSPSU2K STRATPOP)
NODUPKEY;
BY BLSPSU2K;
PROC SUMMARY DATA=ZPSUS NWAY;
VAR STRATPOP;
OUTPUT OUT=ZSUM(KEEP=ZSUM) SUM=ZSUM;
DATA ZPSUS;
SET ZSUM;
DO UNTIL(LAST);
SET ZPSUS END=LAST;
PSU_ALLOCATION = 400 * ( STRATPOP / ZSUM );
CPI_AREA = 'ZALL';
CPI_AREA_ALLOCATION = 400;
OUTPUT;
END;
KEEP CPI_AREA BLSPSU2K PSU_ALLOCATION CPI_AREA_ALLOCATION STRATPOP;
PROC APPEND BASE=PSU_ALLOCATIONS DATA=ZPSUS;
RUN;

*****************************************************************
*
DISPLAY PSU ALLOCATIONS AND COMPARE PSU ALLOCATION SUMS
*
*
WITHIN EACH CPI AREA WITH THE ORIGINAL CPI AREA ALLOCATION. *
****************************************************************;

Appendix 7 Page 4

Appendix 7:

SAS Program to Allocate National Target Sample Size to PSUs

PROC SORT DATA=PSU_ALLOCATIONS;
BY CPI_AREA BLSPSU2K;
DATA CPI_AREAS;
SET PSU_ALLOCATIONS;
BY CPI_AREA;
IF FIRST.CPI_AREA AND CPI_AREA < 'Z100';
KEEP CPI_AREA CPI_AREA_ALLOCATION;
RUN;
TITLE'CE REDESIGN 2000';
TITLE2 'TARGET SAMPLE SIZE';
PROC PRINT DATA=CPI_AREAS NOOBS;
TITLE3 'ALLOCATIONS FOR CPI AREAS';
VAR CPI_AREA CPI_AREA_ALLOCATION;
SUM CPI_AREA_ALLOCATION;
PROC PRINT DATA=PSU_ALLOCATIONS NOOBS;
TITLE3 'ALLOCATIONS FOR X- AND Y-SIZE PSUS';
WHERE PUT(CPI_AREA,$1.) IN ('X','Y');
BY CPI_AREA;
VAR BLSPSU2K PSU_ALLOCATION;
SUM PSU_ALLOCATION;
SUMBY CPI_AREA;
RUN;
PROC PRINT DATA=PSU_ALLOCATIONS NOOBS;
TITLE3 'ALLOCATIONS FOR Z-SIZE PSUS';
WHERE PUT(CPI_AREA,$1.) = 'Z';
VAR BLSPSU2K PSU_ALLOCATION;
SUM PSU_ALLOCATION;
RUN;

Appendix 7 Page 5

Appendix 8:

SAS Program to Calculate PSU Inflation Factors

*****************************************************************
*
USE CEQ AND CED INTERVIEW STATUS DATA FROM THE PERIOD
*
*
1999 - 2001 TO CALCULATE PARTICIPATION RATES FOR CPI AREAS
*
*
AND ALSO NATIONAL RATES FOR EACH SURVEY. FOR EACH CPI
*
*
AREA, CALCULATE A FACTOR WHICH IS A WEIGHTED AVERAGE
*
*
OF THE CPI AREA RATE AND THE NATIONAL RATE, WITH THE
*
*
CPI AREA RATE WEIGHTED 75% AND THE NATIONAL RATE
*
*
WEIGHTED 25%.
*
*****************************************************************;

LIBNAME CEQ 'T:\COMMON\CE Sampling Intervals\DATA\CE DATA 99_01\CEQ';
LIBNAME CED 'T:\COMMON\CE Sampling Intervals\DATA\CE DATA 99_01\CED';

/* LOAD CEQ DATA

*/

%MACRO LOADCEQ(MONTH);
DATA TEMP;
LENGTH ID $9. STATUS $2.;
ARRAY ISTAT[5] $ INTSTAT1-INTSTAT5;
SET CEQ.INT&MONTH.;
ID = PUT(CENSID,$9.);
STATUS = ISTAT[INPUT(INTERI,1.)];
IF STATUS = '01' THEN STATUS = 'I';
ELSE STATUS = 'NI';
KEEP ID STATUS;
PROC APPEND DATA=TEMP BASE=CEQ;
RUN;
%MEND;

%MACRO DOQYEAR(Y);
%DO M = 1 %TO 9;
%LOADCEQ(&Y.0&M.);
%END;
%DO M=10 %TO 12;
%LOADCEQ(&Y.&M.);
%END;
%MEND;
%DOQYEAR(99)
%DOQYEAR(00)
%DOQYEAR(01);
PROC SORT DATA=CEQ;
BY ID;
RUN;
DATA IDTOCPIA;
INFILE 'T:\COMMON\CE Sampling Intervals\DATA\CE DATA
99_01\CEQ\CE_CENSID_TO_CPI_AREA.TXT';
INPUT @1 ID $9. @11 CPI_AREA $4.;
RUN;
PROC SORT;

Appendix 8 Page 1

Appendix 8:

SAS Program to Calculate PSU Inflation Factors

BY ID;
RUN;
DATA CEQ;
MERGE CEQ(IN=OK) IDTOCPIA;
BY ID;
IF OK;
/*
CONVERT OBSERVATIONS FROM A212, A213, A214 TO CPI AREA X200
IF CPI_AREA IN ('A212','A213','A214') THEN CPI_AREA = 'X200';

*/

KEEP CPI_AREA STATUS;
RUN;
/*

LOAD CED DATA

*/

%MACRO LOADCED(MONTH);
DATA TEMP;
LENGTH CPI_AREA $4. STATUS $2.;
SET CED.CED_&MONTH.;
SELECT(PUT(BLSPSU,$1.));
WHEN('A') CPI_AREA=BLSPSU;
WHEN('B') CPI_AREA='X'||SUBSTR(BLSPSU,2,1)||'00';
WHEN('C') CPI_AREA='Y'||SUBSTR(BLSPSU,2,1)||'00';
WHEN('D') CPI_AREA='Z'||SUBSTR(BLSPSU,2,1)||'00';
OTHERWISE;
END;
/*
CONVERT OBSERVATIONS FROM A212, A213, A214 TO CPI AREA X200
IF CPI_AREA IN ('A212','A213','A214') THEN CPI_AREA = 'X200';

*/

DO W=1 TO 2;
IF W=1 THEN STATUS=INTSTAT1;
ELSE STATUS=INTSTAT2;
IF STATUS = '01' THEN STATUS = 'I';
ELSE STATUS = 'NI';
OUTPUT;
END;
KEEP CPI_AREA STATUS;
RUN;
PROC APPEND DATA=TEMP BASE=CED;
RUN;
%MEND;

%MACRO DODYEAR(Y);
%DO M = 1 %TO 9;
%LOADCED(&Y.0&M.);
%END;
%DO M=10 %TO 12;
%LOADCED(&Y.&M.);
%END;
%MEND;
%DODYEAR(99)

Appendix 8 Page 2

Appendix 8:

SAS Program to Calculate PSU Inflation Factors

%DODYEAR(00)
%DODYEAR(01);

/*

GET PARTICIPATION RATES AND CALCULATE FACTORS FOR EACH SURVEY */
%MACRO RATES(DSNAME);
/* CPI AREA RATES */
PROC SORT DATA=&DSNAME.;
BY CPI_AREA;
PROC FREQ DATA=&DSNAME.;
BY CPI_AREA;
TABLES STATUS /NOPRINT OUT=&DSNAME._CPI_AREA_RATES(DROP=COUNT);
RUN;
DATA &DSNAME._CPI_AREA_RATES;
SET &DSNAME._CPI_AREA_RATES;
WHERE STATUS='I';
&DSNAME._CPI_AREA_RATE = PERCENT / 100;
KEEP CPI_AREA &DSNAME._CPI_AREA_RATE;
RUN;
/* NATIONAL RATE */
PROC FREQ DATA=&DSNAME.;
TABLES STATUS /NOPRINT OUT=&DSNAME._NAT_RATE(DROP=COUNT);
RUN;
DATA &DSNAME._NAT_RATE;
SET &DSNAME._NAT_RATE;
WHERE STATUS='I';
&DSNAME._NAT_RATE = PERCENT / 100;
KEEP &DSNAME._NAT_RATE;
RUN;
/* CALCULATE CPI AREA FACTORS
*/
DATA &DSNAME._FACTORS;
SET &DSNAME._NAT_RATE;
DO UNTIL(LAST);
SET &DSNAME._CPI_AREA_RATES END=LAST;
&DSNAME._CPIA_FACTOR =
( (0.75 * &DSNAME._CPI_AREA_RATE) + (0.25 * &DSNAME._NAT_RATE)

);
OUTPUT;
END;
KEEP CPI_AREA &DSNAME._CPIA_FACTOR &DSNAME._CPI_AREA_RATE
&DSNAME._NAT_RATE;
RUN;
%MEND;
%RATES(CEQ);
%RATES(CED);
/* COMPARE THE TWO SURVEY FACTORS IN EACH CPI_AREA. THE LOWER FACTOR WILL */
/* BE USED TO INFLATE THE TARGET SAMPLE SIZES IN THE PSUS TO DETERMINE THE */
/* DESIGNATED SAMPLE SIZES FOR INITIAL SAMPLING. IF CED (DIARY) HAS THE
*/

Appendix 8 Page 3

Appendix 8:
/*
/*
/*
/*

SAS Program to Calculate PSU Inflation Factors

LOWER FACTOR, THEN THE RATIO OF THE CEQ (INTERVIEW) FACTOR TO THE CED
*/
FACTOR WILL BE USED AS A SUBSAMPLING TAKE-EVERY TO REDUCE THE CEQ
*/
DESIGNATED SAMPLE SIZE AFTER INITIAL SAMPLING, AFTER THE TWO SAMPLES */
ARE SPLIT.
*/
PROC SORT DATA = CEQ_FACTORS;
BY CPI_AREA;
PROC SORT DATA = CED_FACTORS;
BY CPI_AREA;
DATA CEFACS;
MERGE CEQ_FACTORS CED_FACTORS;
BY CPI_AREA;
CE_FACTOR = 1 / MIN( CEQ_CPIA_FACTOR, CED_CPIA_FACTOR);
IF CED_CPIA_FACTOR < CEQ_CPIA_FACTOR THEN
CEQ_TE = CEQ_CPIA_FACTOR / CED_CPIA_FACTOR;
ELSE CEQ_TE = 1;
RUN;
/* BECAUSE THERE ARE NO 1990 PSUS CORRESPONDING TO THE Y100 CPI AREA
*/
/* EDIT THE DATA SET TO COPY THE X100 VALUES TO Y100.
*/
PROC SORT DATA=CEFACS;
BY CPI_AREA;
DATA CEFACS;
SET CEFACS;
IF CPI_AREA = 'X100' THEN DO;
OUTPUT;
CPI_AREA='Y100';
OUTPUT;
END;
ELSE OUTPUT;
PROC SORT; BY CPI_AREA;
RUN;
/* VIEW THE PARTICIPATION RATES AND INFLATION FACTORS */
PROC PRINT DATA=CEFACS LABEL NOOBS;
TITLE 'CE REDESIGN 2000';
TITLE2 'PARTICIPATION RATES AND INFLATION FACTORS';
TITLE3 'BY REGION/SIZE CLASS';
VAR CPI_AREA CEQ_CPI_AREA_RATE CEQ_NAT_RATE CEQ_CPIA_FACTOR
CED_CPI_AREA_RATE CED_NAT_RATE CED_CPIA_FACTOR
CE_FACTOR CEQ_TE;
LABEL
CPI_AREA='PSU GROUP'
CEQ_CPI_AREA_RATE='CEQ PARTICIPATION RATE'
CEQ_NAT_RATE='CEQ NATIONAL PARTICIPATION RATE'
CEQ_CPIA_FACTOR='CEQ WEIGHTED AVERAGE RATE'
CED_CPI_AREA_RATE='CED PARTICIPATION RATE'
CED_NAT_RATE='CED NATIONAL PARTICIPATION RATE'
CED_CPIA_FACTOR='CED WEIGHTED AVERAGE RATE'
CE_FACTOR='INFLATION FACTOR USED'
CEQ_TE='CEQ SUMSAMPLING TAKE-EVERY';
RUN;

Appendix 8 Page 4

Appendix 9:

SAS Program to Calculate PSU Designated Sample Sizes

*********************************************************
*
CALCULATE CE DESIGNATED SAMPLE SIZES TO BE USED FOR
*
*
INITIAL SAMPLING. DIVIDE TARGET SAMPLE ALLOCATED *
*
TO EACH PSU BY THE CORRESPONDING CPI AREA FACTOR *
*
CALCULATED FROM CE 1999-2001 RESPONSE RATES.
*
********************************************************;
* Note: The allocation program and the inflation factor program must be run
before this program;
DATA PSU_ALLOCATIONS;
SET PSU_ALLOCATIONS;
IF CPI_AREA='ZALL' THEN CPI_AREA=PUT(BLSPSU2K,$2.)||'00';
PROC SORT DATA=PSU_ALLOCATIONS;
BY CPI_AREA;
RUN;
PROC SORT DATA=CEFACS;
BY CPI_AREA;
RUN;
/* MERGE DATA SETS AND CALCULATE DESIGNATED SAMPLE SIZES */
DATA CE_PSU_DSS;
MERGE PSU_ALLOCATIONS CEFACS;
BY CPI_AREA;
/* MULTIPLY BY 2 BECAUSE TWO SURVEY SAMPLES NEEDED, CEQ AND CED
PSU_DSS = 2 * PSU_ALLOCATION * CE_FACTOR ;

*/

KEEP BLSPSU2K PSU_ALLOCATION CE_FACTOR PSU_DSS;
RUN;
/* DISPLAY PSU DESIGNATED SAMPLE SIZES AND TOTAL DESIGNATED SAMPLE SIZE */
PROC PRINT DATA=CE_PSU_DSS LABEL NOOBS;
TITLE 'CE 2000 REDESIGN';
TITLE2 'DESIGNATED SAMPLE SIZES';
VAR BLSPSU2K PSU_ALLOCATION CE_FACTOR PSU_DSS;
SUM PSU_DSS;
LABEL
BLSPSU2K = 'STRAT PSU'
PSU_ALLOCATION = 'ALLOCATED TARGET SAMPLE SIZE'
CE_FACTOR = 'NON-RESPONSE INFLATION FACTOR'
PSU_DSS = 'DESIGNATED SAMPLE SIZE';
RUN;

Appendix 9 Page 1

Appendix 10:

SAS Program to Calculate PSU Sampling Intervals

*************************************************************************
*
CALCULATE CE WITHIN-PSU SAMPLING INTERVALS. SAMPLING INTERVAL
*
*
WILL BE THE RATIO OF THE PSU MEASURE OF SIZE (2005 PROJECTED # OF
*
*
HOUSING UNITS) TO THE DESIGNATED SAMPLE SIZE.
*
************************************************************************;
LIBNAME CENSUS2K 'T:\COMMON\CE Sampling Intervals\DATA\CENSUS DATA';
* Note: The Allocation, Rates, and Designated Sample Size programs must be
run before this one. ;
/* GET PROJECTED 2005 HOUSING UNIT COUNTS BY COUNTY */
DATA PROJ_HU_CTS;
SET CENSUS2K.Proj_05_hu_counts_by_cty;
RENAME STATE=FIPSST
COUNTY=FIPSCTY
PHU05ACSNU=HU_CT_PROJ;
KEEP STATE COUNTY PHU05ACSNU;
RUN;
/*
/*
/*
/*
/*
/*
/*

MODIFY TO CORRECT FOR PROJECTIONS IN NORTH DAKOTA AND WEST VIRGINIA
WHICH WERE LESS THAN THE CENSUS 2000 COUNTS FOR THOSE STATES, AND
ALSO MODIFY THE DC PROJECTION, WHICH IS DEEMED UNREALISTIC. THE */
NORTH DAKOTA AND WEST VIRGINIA PROJECTIONS WILL BE REPLACED BY
THE CENSUS 2000 COUNTS, AND THE DC PROJECTION WILL BE REPLACED BY
A HOUSING UNIT ESTIMATE OF 268,504 WHICH IS THE ESTIMATE BEING USED
BY CPS AND SIPP FOR DC.
*/

*/
*/
*/
*/
*/

DATA ND_WV_2000_HUS;
SET CENSUS2K.C2KCOUNT;
WHERE FIPSST IN ('38','54');
KEEP FIPSST FIPSCTY CENSUS2000HOUSINGUNITCOUNT;
PROC SORT;
BY FIPSST FIPSCTY;
PROC SORT DATA=PROJ_HU_CTS;
BY FIPSST FIPSCTY;
DATA PROJ_HU_CTS;
MERGE PROJ_HU_CTS(IN=P) ND_WV_2000_HUS(IN=C);
BY FIPSST FIPSCTY;
IF P AND C THEN HU_CT_PROJ = CENSUS2000HOUSINGUNITCOUNT;
IF FIPSST='11' THEN HU_CT_PROJ = 268504;
KEEP FIPSST FIPSCTY HU_CT_PROJ;
RUN;
/* APPEND PROJECTED 2005 HU COUNTS TO CE PSU FILE

*/

PROC SORT DATA=BLS_CE_FILE;
BY FIPSST FIPSCTY;
PROC SORT DATA=PROJ_HU_CTS;
BY FIPSST FIPSCTY;
DATA BLS_CE_FILE;
MERGE BLS_CE_FILE(IN=OK) PROJ_HU_CTS;
BY FIPSST FIPSCTY;

Appendix 10 Page 1

Appendix 10:

SAS Program to Calculate PSU Sampling Intervals

IF OK;
RUN;
/* GET PSU MEASURE OF SIZE

*/

PROC SUMMARY DATA=BLS_CE_FILE NWAY;
CLASS BLSPSU2K;
VAR HU_CT_PROJ;
OUTPUT OUT=PSUHUCTS(KEEP=BLSPSU2K HU_CT_PROJ) SUM=;
RUN;
/* MERGE DATA SETS AND CALCULATE SAMPLING INTERVALS

*/

PROC SORT DATA=PSUHUCTS;
BY BLSPSU2K;
PROC SORT DATA=CE_PSU_DSS;
BY BLSPSU2K;
DATA SAMPINTS;
MERGE PSUHUCTS CE_PSU_DSS;
BY BLSPSU2K;
SAMPINT = HU_CT_PROJ / PSU_DSS ;
RUN;
/* VIEW FINAL DATA SET

*/

PROC PRINT DATA=SAMPINTS LABEL NOOBS;
TITLE 'CE 2000 REDESIGN';
TITLE2 'WITHIN-PSU SAMPLING INTERVALS';
VAR BLSPSU2K HU_CT_PROJ PSU_DSS SAMPINT;
FORMAT SAMPINT COMMA14.4;
LABEL
BLSPSU2K = 'STRAT PSU'
HU_CT_PROJ = 'PROJECTED 2005 HU COUNT'
PSU_DSS = 'DESIGNATED SAMPLE SIZE'
SAMPINT = 'PSU SAMPLING INTERVAL';
RUN;

Appendix 10 Page 2

Appendix 10:

SAS Program to Calculate PSU Sampling Intervals

*************************************************************************
*
PROJECT 2005 PERMIT COUNTS BY COUNTY BASED ON FILES FROM MCD WHICH *
*
DSMD RECEIVED FOR THE YEARS 1997 THROUGH 2001 AND USED TO BUILD *
*
THEIR 1990-BASED DESIGN PERMIT DATA UNIVERSE FOR NEW CONSTRUCTION
*
*
SAMPLING. FOR EACH COUNTY, THE PROJECTION WILL BE THE COUNT VALUE *
*
OF THE POINT ON THE LEAST SQUARES REGRESSION LINE CORRESPONDING TO *
*
THE YEAR 2005.
*
************************************************************************;
* Note: The CE Sampling Interval Programs must be run before this one.;
DATA PROJECTED2005PERMITS;
SET census2k.PERMITBYCTY;
ARRAY YR_[5];
ARRAY CT[5] COUNT1997-COUNT2001;
ARRAY RESIDUAL[5];
RETAIN YR_1-YR_5 (1997 1998 1999 2000 2001);
YR_SUM = SUM(OF YR_[*]);
CTSUM = SUM(OF CT[*]);
YR_SQSUM = 0; YR_CTSUM = 0;
DO I = 1 TO 5;
YR_SQSUM + YR_[I]**2;
YR_CTSUM + (YR_[I]*CT[I]);
END;
SLOPE = ( (5 * YR_CTSUM) - (YR_SUM * CTSUM ) )
/
( ( 5 * YR_SQSUM ) - YR_SUM**2);
INTERCEPT = ( CTSUM - ( SLOPE * YR_SUM ) )
/
5;
DO I = 1 TO 5;
RESIDUAL[I] = ABS( CT[I] - ( (SLOPE * YR_[I]) + INTERCEPT) );
END;
PROJECTED2005COUNT = CEIL((2005 * SLOPE) + INTERCEPT);
IF PROJECTED2005COUNT > 0 THEN
RESIDUALRATIO = MEAN(OF RESIDUAL[*]) / PROJECTED2005COUNT ;
ELSE RESIDUALRATIO = 2;
IF SLOPE < 0 OR RESIDUALRATIO > 1 THEN DO;
ORIGINAL_PROJECTION = PROJECTED2005COUNT;
PROJECTED2005COUNT = CEIL(MEAN(OF CT[*]));
MEAN_USED = 1;
END;
ELSE MEAN_USED=0;
/* RECODE FIPS COUNTY FOR MIAMI-DADE, FLORIDA */
IF FIPSST = '12' AND FIPSCTY='025' THEN FIPSCTY='086';
OUTPUT;
RUN;
*********************************************************
*
SUBSET CPI COUNTIES AND GET SUM ACROSS ALL CPI PSUS
*
********************************************************;

Appendix 10 Page 1

Appendix 10:

SAS Program to Calculate PSU Sampling Intervals

DATA CPICTYS;
INFILE
'T:\COMMON\CE Sampling Intervals\DATA\BLSFILES\CENSOUT2000CPI.TXT'
MISSOVER;
INPUT @3 FIPSST $2. @6 FIPSCTY $3.;
KEEP FIPSST FIPSCTY;
PROC SORT; BY FIPSST FIPSCTY;
PROC SORT DATA=PROJECTED2005PERMITS;
BY FIPSST FIPSCTY;
DATA CPIPMTCTS;
MERGE CPICTYS(IN=CPI) PROJECTED2005PERMITS;
BY FIPSST FIPSCTY;
IF CPI;
PROC SUMMARY DATA=CPIPMTCTS;
OUTPUT OUT=CPIPMTSUM(KEEP=NAT2005PP) SUM=NAT2005PP;
VAR PROJECTED2005COUNT;
RUN;
DATA CPI_SAMPINT;
SET CPIPMTSUM;
/* SAMPLING INTERVAL IS
(PROJECTED NUMBER OF PERMITS IN 2005 IN CPI-U SAMPLE COUNTIES)

X

4
-------------------------------------------------------------1440
BECAUSE ANNUAL SAMPLE SHOULD BE 1440 PERMIT ADDRESSES, AND WE EXPECT A
CLUSTER OF
4 ADDRESSES FOR EACH HIT
*/
CPISAMPINT = (NAT2005PP / 1440) * 4;
RUN;

OPTIONS NODATE NONUMBER NOCENTER LS=97 PS=51;
*********************************************
*
DISPLAY THE NATIONAL SAMPLING INTERVAL *
********************************************;
PROC PRINT DATA=CPI_SAMPINT NOOBS LABEL;
TITLE 'THE CPI PERMIT NEW CONSTRUCTION HOUSING SAMPLE';
TITLE2 'NATIONAL SAMPLING INTERVAL FOR THE CENSUS-2000 BASED DESIGN';
VAR CPISAMPINT;
LABEL CPISAMPINT='NATIONAL SAMPLING INTERVAL';
FORMAT CPISAMPINT COMMA10.4;
RUN;
*********************************************************************
*
LIST HISTORICAL COUNTS AND 2005 PROJECTIONS FOR CPI COUNTIES *

Appendix 10 Page 2

Appendix 10:

SAS Program to Calculate PSU Sampling Intervals

********************************************************************;
PROC PRINT DATA=CPIPMTCTS(KEEP=COUNT1997-COUNT2001 PROJECTED2005COUNT FIPSST
FIPSCTY) N LABEL;
TITLE 'PROJECTIONS OF PERMIT COUNTS';
TITLE2 'IN COUNTIES SELECTED FOR THE CENSUS 2000-BASED CPI SAMPLE DESIGN';
TITLE3 'BASED ON PERMIT COUNTS FROM THE YEARS 1997-2001';
ID FIPSST FIPSCTY;
VAR COUNT1997-COUNT2001 PROJECTED2005COUNT;
LABEL
FIPSST = 'FIPS STATE'
FIPSCTY = 'FIPS COUNTY'
COUNT1997 = '1997 COUNT'
COUNT1998 = '1998 COUNT'
COUNT1999 = '1999 COUNT'
COUNT2000 = '2000 COUNT'
COUNT2001 = '2001 COUNT'
PROJECTED2005COUNT = 'PROJECTED 2005 COUNT';
SUM _NUMERIC_;
FORMAT _NUMERIC_ COMMA10.0;
RUN;

Appendix 10 Page 3


File Typeapplication/pdf
AuthorPadraic Murphy
File Modified2009-08-10
File Created2009-08-10

© 2024 OMB.report | Privacy Policy