Download:
pdf |
pdfUsing OES Data in Federal Pay Comparability: A
Regression-Based Approach
March 2009
A report of the Wage Estimates for Federal Pay Comparability Team
Team Members
Matt Dey
Maury Gittleman
Mike Lettau
Steve Miller
Introduction
Under the Federal Employees Pay Comparability Act (FEPCA), the President’s Pay
Agent must use salary surveys conducted by Bureau of Labor Statistics (BLS) to set
locality pay. At present, the sole survey that is used in this endeavor is the National
Compensation Survey (NCS). Given the possibility that future budget shortfalls may
lead to cuts in the wage sample of the NCS, a team was formed to investigate whether
one can use data from the Occupational Employment Statistics (OES) program instead of
or in addition to data from the NCS.
To this end, the team proposed two alternative estimators – one called an interval method
and the other a regression method -- that combine NCS and OES data in different ways.
In an internal research paper, we compare and contrast the results under these two
methods to the current approach that is used to generate pay gaps.1 Four main themes
emerged from this analysis. First, we demonstrate that it is, indeed, feasible to use OES
data for pay comparability, but the OES data cannot be used by itself; it must be
combined with NCS data. Second, the proposed methods both appear to be capable of
estimating reasonable-looking pay gaps with greater precision than does the current
approach. Third, the proposed methods are more robust to cuts in the NCS sample,
assuming the OES sample sizes would remain constant, than is the current approach.
Fourth, the proposed methods can both be used to extend the estimation of pay gaps to
areas that are not present in the NCS sample.
After careful investigation of the advantages and disadvantages of the two proposed
methods, the team believes that the regression method is clearly better suited to produce
the non-Federal salary estimates required to calculate area pay gaps.
The remainder of the report proceeds as follows. In the next section, we describe the
current NCS-based approach to Federal pay comparability. After describing the potential
benefits of incorporating OES data, we present the details of the regression method
combining both OES and NCS data. Following that, we turn to the actual numbers and
present our analysis of the current method versus the proposed approach. The final
section provides a restatement of our main themes and some indication of where we think
further investigation may be warranted.
Current Approach Using NCS Data Only
For the 2008 Pay Agent's Report to the President on Locality Pay for 2010, the Pay Agent
was interested in non-Federal pay levels and non-Federal/Federal pay gaps for 31 areas,
including one residual category for the Rest of the United States (RUS).2 BLS provided,
for each of these areas, non-Federal salary estimates for up to 15 grades for the five
PATCO job families (professional, administrative, technical, clerical and officer),
1
This document is available upon request.
If one includes Raleigh, the number of pay agent areas is actually 32, but the NCS sample did not contain
sufficient data for this locality for estimates to be produced.
2
1
resulting in 67 estimates per area.3 Using local Federal employment weights by grade
and PATCO, the Office of Personnel Management (OPM) then averaged the estimates
provided by BLS to arrive at a single number for non-Federal pay in each area.4 These
numbers were then compared to the average salary for Federal white-collar employees in
each locality, and a non-Federal/Federal pay gap was then calculated.
To help evaluate the proposed methods that make use of OES data, it is useful to take a
few steps back, to see how NCS data are used to arrive at the 67 estimates per area. For
the purposes of the latest Pay Agent’s report, there were nearly 5,000 different jobs held
by Federal white-collar employees, where jobs are defined by GS series, grade and
whether the job is supervisory or not. Because jobs in the NCS are not defined in the
same manner, a crosswalk was created that maps these 5,000 jobs to jobs defined by sixdigit Standard Occupational Classification (SOC) code and grade. As many Federal jobs
map to the same SOC code-grade combination, the creation of the crosswalk resulted in
the identification of about 2,000 unique SOC code-grade pairs. The current methodology
asks BLS to come up with an estimate of average pay in each of these jobs in each area,
and then to use national Federal employment weights to arrive at the 67 estimates for
each locality. Multiplying the number of jobs by the number of localities reveals that
more than 60 thousand estimates were required.
To compute these estimates, BLS made use of the data in the 2007 NCS sample.5 To get
an idea of the strain that these calculations place on the NCS data, it may be worth noting
that, taking all the localities together, there were approximately 23,000 establishments
that responded to the NCS that year. Each establishment provided information on
anywhere from one to 32 different jobs, with most reporting data for eight or fewer jobs.
In order for the NCS sample to better correspond to jobs that are relevant to pay
comparisons for the Federal white-collar workforce, a number of restrictions were then
applied to the resulting samples for each locality. Only those jobs that are full-time, have
a grade attached, and have valid wage information were included. The sample was then
further limited to jobs in the crosswalk file provided by OPM and to jobs that would not
be classified above GS-15 in the Federal Government. After applying these restrictions,
a national sample of 41,250 jobs remained. Put differently, the number of jobs in this
sample was less than the number of job-average estimates needed for the calculations.
The estimates were computed in two steps, one involving direct estimates at the level of
the locality, the other involving indirect estimates generated by a national regression
model. If data meeting publication criteria were available to calculate average pay in an
SOC code-grade combination in a given area, a direct estimate was computed. In cases
where a direct estimate was not available, a regression model, which will be described
below, was used to estimate average pay in area-SOC code-grade cells.6 Then, using
3
There is no federal employment for eight of the grade-PATCO job family combinations.
OPM aged the data by area, so that the reference date became March 2008. Unless otherwise noted, the
estimates presented in this report have this reference date as well.
5
Establishments located in Alaska and Hawaii were excluded, as the Pay Agent’s interest was limited to
jobs in the contiguous United States.
6
Because 25 occupations did not appear in the national sample used for the regression model, indirect
estimates could not be produced for 118 cells in each area.
4
2
national Federal employment weights, the direct and modeled estimates were averaged up
to job family and grade, to produce the 67 numbers needed for each area.
It is worth stressing that, as a result of the relatively small size of the usable NCS sample
and the fact that there was Federal white-collar employment in numerous SOC codegrade combinations that are unlikely to occur in the non-Federal sector, the majority of
the estimates, as weighted by national Federal employment, come from the model. In the
past year, the percentage modeled ranged from 33 percent for RUS to 86 percent for
Milwaukee. But this range is misleading because the percentage for RUS is an outlier, as
it is the only case where the majority of estimates was not modeled. Three areas had
between 50 and 60 percent of the estimates modeled, 9 were between 60 and 70 percent,
11 were between 70 and 80 percent, and the remaining 7 were above 80 percent.
As a point of comparison with the approach using OES data that will be discussed
subsequently, it may be useful to briefly discuss the regression model that is currently
used to produce the indirect estimates.
The regression model is of the following form:
[1]
A−1
O −1
a =1
o =1
ln Wm = α + ∑ β a ⋅ AREAma + ∑ χ o ⋅ OCCUPmo + δ ⋅ XGRADE m + φ ⋅ XGRADESQm + ε m
where ln Wm is the natural log of the average hourly wage rate of the mth job, AREA is a
vector of dummy variables indicating locality, OCCUP is a vector of dummy variables
for occupation as defined by six-digit SOC code (XX-YYYY), XGRADE is a
transformation of grade7, and XGRADESQ is the square of XGRADE. β, χ , δ, and φ are
the corresponding coefficients, α is a constant term, and ε is the error term. Areas are
indexed by a and are numbered from one to A, occupations are indexed by o and are
numbered from one to O.
The functional form of the model, which was chosen in line with OPM’s preferences,
embodies some strong assumptions. First, the three components of the model – area,
occupation and grade (xgrade) – enter directly without any interactions. Put differently,
estimated area pay differentials will be the same regardless of occupation and grade
combination, occupational pay differentials will not vary by area and grade, and the
returns to grade will be the same regardless of area and occupation. Second, returns to
grade are assumed to be quadratic in xgrade.
Clearly, it is possible to come up with examples where one would not expect the first set
of restrictions to hold, owing to differences across local labor markets. Past research
suggests, however, that owing to the relatively small size of the sample that is used for
modeling, any reduction in bias from having a less restrictive model with more
7
xgrade is the same as grade if grade is less than 12. For grade 12, xgrade=13, for grade 13, xgrade=15,
for grade 14, xgrade=17 and for grade 15, xgrade=19.
3
parameters would be offset by increases in the variance of the estimates. As an alternative
to the assumption about the functional form of grade (xgrade), one can imagine using a
set of dummy variables for each grade. While such an approach has the advantage of
allowing a freer estimation of returns to grade, it has the disadvantage of increasing the
number of parameters that need to be estimated with a small sample, as well as
necessitating the estimate of a parameter for grades that are not well represented in the
regression sample. Nonetheless, even if these assumptions are reasonable for a model
estimated only on NCS data, starting afresh with OES data, one would not start out by
imposing these restrictions.
Potential Benefits of Incorporating OES Data
Before proceeding to the details of the proposed method that combines both NCS and
OES data, we will quickly discuss why incorporating OES data into the process to
produce wage estimates for the President’s Pay Agent potentially provides important
benefits.
First, pinning down area-SOC code mean wages with the OES data frees up the NCS data
to allow a richer specification of the grade level effects. As discussed above, the current
regression has a constant term (1 parameter), occupation dummy variables (258
parameters), area dummy variables (31 parameters), and a quadratic grade level
specification (2 parameters). Therefore, the regression has a total of 292 parameters, but
only two of these parameters determine the behavior of mean wages across grades. By
pinning down a majority of these parameters with the OES data, we could allow a more
robust specification of how grade affects wages.
Second, since the OES sample is much larger than the NCS sample, we would expect,
overall, efficiency gains in the estimates of mean wages by occupation and area. In some
cases, because the current OPM model borrows strength by pooling data across
occupations and areas and because a direct estimate of mean wages from the OES may be
based on a small sample, there may not be gains in precision. But, at the very least, one
could run a similar (to the current OPM model) log wage regression with only main
effects on the OES data and expect to obtain more precise estimates. Moreover, since the
OES sample is larger, we could perhaps allow a richer specification of how area and
occupation interact in the wage process (i.e., we could relax the strong assumption that,
regardless of occupation, an area will always be high wage and another area will always
be low wage).
Third, since the OES samples establishments in all metropolitan areas of the country, we
may be able to extrapolate the level effects from the NCS to the “unsampled” areas to
generate Federal pay gaps for all metropolitan areas. Of course, this requires that the
estimated grade effects do not vary across (detailed) areas and that we are comfortable in
the extrapolation, but if we have sufficient confidence in the robustness of these
estimates, the incorporation of the OES potentially increases the number of localities for
which estimates can be provided to the Pay Agent.
4
Finally, with the possibility of a sample cut to the NCS, the question of whether OES can
provide some support for the Pay Agent estimates seems quite natural. If the NCS
sample is reduced to the index only portion of the sample, for example, can we maintain
the quality of the Pay Agent products by augmenting the remaining NCS data with the
OES data?
The Regression Method: An Estimation Method that Combines OES and NCS data
This method is somewhat similar to the procedure used currently when the sample size
from the NCS is not sufficient to provide a direct estimate of the average wage rate for an
area-SOC code-grade combination. As noted, in this case, the current procedure uses the
prediction from a log-wage regression with main effects for area, occupation, and grade,
but with no interactions among them. The proposed regression method uses a regression
equation with an additive effect on job grade on the expected log wage rate. However,
the proposed method applies this grade effect to the average wage rate by area and
occupation from the OES data, instead of relying exclusively on data from the NCS.
Define lnWi as the natural log of the hourly wage rate for the ith individual in the NCS
sample8, and define lnWoaOES as the natural log of the average wage rate for occupation o
in area a from the OES sample. The regression method uses the following equation:
lnWi − lnWoaOES
,i =
[2]
1
G
G
15
g =1
l =1
θ + ∑ ∑ κ gv ⋅ Leveled v ,i ⋅ Group g ,i + ∑ λ g ⋅ FTi ⋅ Group g ,i + ∑ π l ⋅ Levell ,i + ν i
v = 0 g =1
where Leveled v ,i is a vector of indicator variables for whether the ith individual is in a
job that is leveled or not, FTi is an indicator variable for whether the individual’s job is
full-time, Group g ,i is a vector of expected grade level group indicator variables, and
Levell ,i is a vector of grade indicator variables.9 The term ν i is the residual for the
regression equation.
The estimate for the average full-time wage rate for grade l of occupation o in area a is
then equal to the following.
[3]
Wˆ oal = WoaOES × ∆ ol
8
The sample used for the regression method is significantly bigger than that used under the current
approach, as it includes those who work full-time, those in an SOC code that is in the crosswalk but in a
grade that is not, and those with missing grade information.
9
The expected grade level groups are roughly defined by the midpoint of the expected minimum and
maximum grade levels defined by the NCS, plus a separate category for nurses.
5
where
(
∆ ol = ∆−g1(o ) × exp θˆ + κˆ g (o ),1 + λˆg (o ) + πˆ l
∆ g (o ) =
)
⎛
⎞
∑ ωi × exp⎜θˆ + ∑ κˆ g (o )v ⋅ Leveled v,i + λˆg (o ) ⋅ FTi + ∑ πˆ l ⋅ Levell ,i ⎟
i∈g ( o )
⎝
1
15
v =0
l =1
∑ ωi
⎠
i∈g ( o )
and ω i represents the NCS weight of the ith individual. The term ∆ g (o ) is an adjustment
to ensure that the detailed mean wages for the area-SOC code-grade-hours combinations
add up to the overall wage rate for occupation o in area a from the OES sample.
The regression equation includes expected grade group indicator variables as an
alternative to estimating the coefficients of the regression separately for each expected
grade group. This is done for two key reasons. First, some area-SOC code-grade
combinations of interest to the Pay Agent would not be covered if we estimated the grade
effects separately for each group. For example, the Pay Agent wants a wage estimate for
grade 9 security guards. According to the NCS, the expected grade range for security
guards is 1 to 5, so the Pay Agent is asking for data that the NCS expects there to be little
chance of getting. For the security guard example, there are only a handful of
observations in the group that contains security guards that are in grade 9.
Second, if we do not somehow correct for the expected grade group that an occupation
falls into, we are comparing apples and oranges. For example, security guards who are
paid close to the average wage are likely to be in grade 3 (security guards are expected to
be in grades 1 to 5), while paralegals who are paid close to the average wage are likely to
be in grade 7 (they are expected to be in grades 5 to 9). By correcting for expected grade
group in the regression we are able to shift the wage differentials by group for each
grade, while nevertheless being able to produce the wage estimates that the Pay Agent
desires.
Analysis
Our analysis can be roughly divided into three main subsections, with corresponding
tables. The first subsection addresses the feasibility of the proposed approach and how
the results of the current method versus proposed method compare and contrast. In the
second subsection, we turn our analysis of the impact of a reduction in the NCS sample
from the current, full sample to the current, index sample. Finally, we explore the issue
of whether having the broader geographic coverage of the OES can be used to extend pay
gap calculations to areas not in the NCS.
Table 1 (Non-Federal Salaries) & Table 2 (Pay Gaps)
6
Tables 1 and 2 present very similar information, except that Table 1 is presented in terms
of estimated non-Federal salaries for the Pay Agent areas, while Table 2 provides
information in terms of pay gaps. The two tables provide an immediate answer to the
question of whether the OES data alone can be used to calculate non-Federal salaries and
pay gaps. A comparison of the results in the columns for “Current Method” with those
from the columns for “OES Only” indicates that the answer is “no”, at least if an
important criterion is that the results should be similar to those under the present
approach. Using the OES data alone results in much lower estimates of non-Federal
salaries and correspondingly narrower pay gaps. The stark differences do not seem to be
the result of fundamental differences between the OES and NCS, but rather to the fact
that grade information is not present in the OES. When grade information in the NCS is
ignored and calculations are made in a manner similar to what one does in using the OES
alone, the estimates of non-Federal salaries and pay gaps are again much lower than
under the current method, as shown in the column labeled “NCS with No Grade
Information”.
The remaining columns of the two tables provide means, standard errors, and relative
standard errors (RSEs) for estimates from an NCS-only approach that relies exclusively
on modeled data (“OPM Model”) and the regression method. Ideally, we would have
standard errors and RSEs from the current method, but these are not currently produced.
Owing to the complex way the current approach combines local direct estimates and
national indirect estimates, it was deemed too difficult to use this approach in our current
analysis, not least because it would have required the development of a methodology to
compute standard errors. Instead, much of the analysis revolves around comparisons
between the OPM model and the regression method. As can be seen from a comparison
of the means from the OPM model with those from the current method, these two NCSbased approaches are very similar in terms of estimates of non-Federal salaries and of pay
gaps.10 This closeness should not be surprising given the widespread use of the modeled
estimates noted above. Thus, any estimates for the OPM Model are likely to be very
similar to what would have been generated by the current method, could that have been
replicated for this project.
How does the attempt to combine the two datasets fare? A glance at the tables suggests
that, unlike the case with OES data alone, the estimates from the NCS-OES regression
method are in the same ballpark as those from the current method and OPM model. The
unweighted correlation between the non-Federal salaries estimated by the OPM model
and the regression method is 0.97, indicating consistency across the methods in terms of
which areas are high-paying and which are low-paying. Though the corresponding
correlation for the pay gaps themselves is lower (0.90), the pay gaps are also fairly close
between the two methods. With each area given equal weight, the average of the pay gap
over the areas is 45.5 percent for the OPM model, while it is 49.9 percent for the
regression method.
10
The unweighted correlation for non-Federal salaries is 0.99 and that for the pay gaps is 0.97. The
average pay gap under the current method is 46.1 percent versus 45.5 percent for the OPM model.
7
A pictorial look at how close the regression method is to the current method in terms of
the ranking of pay gaps is provided in Figure1. If the pay gaps had exactly the same
ordering under both methods, all localities would be on the 45 degree line. That is clearly
not the case, but the Figure does demonstrate a high degree of similarity between the two
sets of rankings.
How does the regression method fare in terms of the precision of its estimates? Though
the standard errors and RSEs from this approach should be viewed as lower bounds
because in calculating them the OES data were treated as fixed, this approach appears to
have greater precision than the OPM model. On average, the RSEs for the pay gaps from
the OPM model are about 177 percent higher than those from the regression method.
Thus, it appears that using the OES data to pin down occupation-area mean wages does,
in fact, play a useful role in improving the efficiency of the estimates.11
Table 3 (Non-Federal Salaries) & Table 4 (Pay Gaps)
Next, we turn to our analysis of the impact of a reduction in the NCS sample from the
current, full sample to the current, index sample, which cuts the sample roughly in half.
Obviously, this exercise is meant only to gauge how sensitive the various estimators are
to a significant reduction in the size of the NCS sample and should in no way be
interpreted as a recommendation about the scope and manner an NCS sample reduction
would or should take.
Not surprisingly, the sample cut has a bigger impact on the estimates that rely exclusively
on the NCS data, though, even here, the impact is not huge. For the OPM model, the
estimates from the index sample are highly correlated with those from the full sample
(0.97 for the non-Federal salaries and 0.90 for the pay gaps), but there is a tendency for
the non-Federal salaries and pay gaps to be somewhat higher for the reduced sample.
The standard errors do go up substantially, moreover, suggesting that, with a smaller
sample, estimates will bounce around more from year to year.
For the regression method, it is striking how little the estimates change (the correlations
are nearly perfect for both the non-Federal salaries and the pay gaps). In no case is the
absolute value of the change in non-Federal salary estimates greater than 1 percent, and
the largest change in pay gap is 1.23 percentage points. Unavoidably, the precision of the
estimates is reduced by the sample cut, though it still compares favorably with the
precision of the estimates from the OPM model using the full sample.
Table 5 & Table 6
11
It may be worth noting that the greater precision for the methods that include the OES data does not
appear to be attributable to the fact that these methods use a greater portion of the NCS sample – unlike the
OPM model, they include part-timers, those with missing grades, and those with SOC code-grade
combinations where the occupation was in the crosswalk, but the grade does not. Estimation of an OPMstyle model that included the larger sample did not lead to lower standard errors.
8
Finally, in this third subsection, we turn to the question of whether the greater geographic
coverage of the OES can lead to an expansion of the number of Pay Agent areas that can
be considered. As a first exercise, summarized in Table 5, we construct new estimates of
non-Federal salaries and pay gaps for each Pay Agent area after eliminating NCS data
from that particular area. This exercise allows us to assess how sensitive the proposed
estimators are to the absence of area-specific NCS data.12 It is striking to note how little
estimates from the regression method change when the area-specific NCS data are
removed.
A second exercise is to estimate non-Federal salaries and pay gaps for nine areas that
wish to be considered Pay Agent areas, but either are not sampled by the NCS or do not
presently have a sample of sufficient size for Federal pay comparability. Under the
current approach, these areas would be included in the “Rest of the US” and hence would
be assigned RUS’s pay gap. In Table 6, therefore, we compare the estimated non-Federal
salaries and pay gaps to those from RUS to determine whether things would change much
for these localities if the gaps were determined for them specifically.
Conclusions
The above analysis yields four main findings. First, it does seem to be feasible to use the
NCS and OES in combination for the purposes of pay comparability. Second, the
regression method does appear to be capable of estimating reasonable-looking pay gaps
with greater precision than does the current approach. Third, it seems that one can use
the OES to buffer any future cuts in the NCS sample, as the regression method is fairly
robust to a large sample cut. Fourth, the regression method can be used to extend the
estimation of pay gaps to areas that are not present in the NCS sample.
While we are confident in drawing these conclusions from our analysis, there are a
number of additional issues we are planning on examining in the near future or have
identified as possible areas of future research. Most importantly, we are planning on
repeating our analysis using data from 2006 and 2008 in order to determine whether the
comparisons between the regression method and the current approach are fairly robust
across time. We anticipate this work will begin in late spring when the 2008 data
becomes available.
In addition, while unlikely to substantially change the comparisons of the precision of the
two methods, it must be noted that we assumed that the OES data were fixed when
calculating the precision of the regression method estimates. While published OES
estimates generally suggest rather precise estimates of mean wages, the precision of the
regression method estimates should nevertheless account for variation in OES mean
wages. At this point, we have no definitive plans for incorporating this factor.
Finally, further research might also yield sharper insights into the reasons why the
regression method estimates somewhat higher pay gaps than does the current approach.
12
The OPM model could not be used in this exercise as it requires area-specific data.
9
In analysis not shown here, we decomposed the gap between estimates from the
regression method and those from the OPM model into a portion attributable to the
difference between the NCS SOC code-area means and their OES-based counterparts and
a portion due to differences in how pay estimates rise with grade, termed level effects.
The first factor seems to be more important in explaining the difference, as the level
effects are fairly similar. Understanding why the two surveys generate different SOC
code-area means would be valuable, though it is not obvious that further investigation
will definitely be fruitful. The NCS-OES Wage Comparisons team spent considerable
time making comparisons across the two surveys, but found it difficult to come up with
systematic explanations for the differences noted.
10
Table 1. Estimated Non-Federal Salaries for Current OPM Areas
Area
Atlanta
Boston
Buffalo
Chicago
Cincinnati
Cleveland
Columbus
Dallas
Dayton
Denver
Detroit
Hartford
Houston
Huntsville
Indianapolis
Los Angeles
Miami
Milwaukee
Minneapolis
New York
Philadelphia
Phoenix
Pittsburgh
Portland
Rest of U.S.
Richmond
Sacramento
San Diego
San Francisco
Seattle
Washington, DC
Federal
Salary
$63,444
$59,238
$53,669
$60,985
$52,254
$58,802
$51,760
$60,354
$56,528
$63,680
$57,518
$55,337
$64,558
$70,566
$50,690
$55,156
$55,490
$53,259
$53,008
$57,666
$55,923
$53,989
$55,390
$61,544
$51,746
$53,542
$58,197
$52,815
$59,990
$56,343
$72,883
Current
Method
$94,031
$92,703
$75,783
$92,029
$69,477
$82,793
$72,314
$89,963
$74,960
$91,563
$84,390
$85,791
$95,829
$98,335
$68,165
$84,731
$80,461
$73,568
$77,152
$91,638
$80,894
$77,600
$76,473
$90,046
$70,025
$71,655
$86,905
$81,216
$100,160
$85,316
$120,578
NCS with No
Grade
Information
$58,827
$59,667
$49,276
$57,511
$44,295
$52,864
$49,544
$56,171
$45,708
$59,358
$57,283
$57,914
$58,605
$58,448
$46,065
$60,492
$51,866
$49,218
$54,483
$63,011
$54,699
$52,951
$47,700
$56,173
$45,103
$49,916
$59,710
$58,328
$69,936
$60,148
$66,813
OPM Model
OES only
$59,966
$65,669
$53,663
$64,233
$51,066
$55,978
$52,248
$59,924
$53,526
$66,553
$64,041
$60,928
$65,261
$65,388
$49,370
$63,454
$56,333
$54,481
$56,502
$66,599
$60,229
$53,460
$52,109
$58,743
$48,595
$56,030
$60,164
$59,606
$73,074
$61,972
$76,716
Mean
$95,057
$91,429
$75,379
$91,067
$68,745
$81,917
$72,495
$88,098
$74,556
$91,727
$84,937
$86,337
$93,711
$98,215
$68,676
$85,428
$78,376
$73,487
$77,205
$94,322
$80,574
$77,420
$76,033
$88,905
$65,502
$71,210
$87,156
$80,142
$99,941
$87,353
$119,469
Standard
Error
$1,471
$1,793
$1,542
$2,655
$1,299
$1,455
$2,132
$1,082
$596
$1,678
$1,069
$2,079
$1,756
$2,343
$1,884
$1,829
$2,223
$1,805
$1,200
$1,726
$1,720
$1,143
$1,156
$1,347
$773
$1,214
$1,163
$2,078
$1,472
$1,282
$2,740
Regression Method
RSE
1.5
2.0
2.0
2.9
1.9
1.8
2.9
1.2
0.8
1.8
1.3
2.4
1.9
2.4
2.7
2.1
2.8
2.5
1.6
1.8
2.1
1.5
1.5
1.5
1.2
1.7
1.3
2.6
1.5
1.5
2.3
Mean
$92,229
$96,091
$75,175
$96,392
$72,366
$82,635
$71,755
$89,567
$78,096
$101,173
$92,410
$86,028
$99,231
$102,254
$67,748
$90,335
$80,670
$76,251
$79,052
$96,310
$85,459
$75,080
$74,414
$87,764
$66,989
$77,644
$87,152
$82,816
$108,118
$89,432
$126,579
Standard
Error
RSE
$983
1.1
$723
0.8
$387
0.5
$883
0.9
$497
0.7
$677
0.8
$296
0.4
$791
0.9
$557
0.7
$1,000 1.0
$657
0.7
$507
0.6
$1,160 1.2
$1,229 1.2
$345
0.5
$530
0.6
$565
0.7
$457
0.6
$491
0.6
$721
0.7
$547
0.6
$435
0.6
$501
0.7
$764
0.9
$299
0.4
$367
0.5
$581
0.7
$374
0.5
$926
0.9
$505
0.6
$1,990 1.6
Table 2. Estimated Pay Gaps for Current OPM Areas
Area
Atlanta
Boston
Buffalo
Chicago
Cincinnati
Cleveland
Columbus
Dallas
Dayton
Denver
Detroit
Hartford
Houston
Huntsville
Indianapolis
Los Angeles
Miami
Milwaukee
Minneapolis
New York
Philadelphia
Phoenix
Pittsburgh
Portland
Rest of U.S.
Richmond
Sacramento
San Diego
San Francisco
Seattle
Washington, DC
NCS with No
Current
Grade
Method Information OES only
48.21
-7.28
-5.48
56.49
0.72
10.86
41.20
-8.19
-0.01
50.90
-5.70
5.33
32.96
-15.23
-2.27
40.80
-10.10
-4.80
39.71
-4.28
0.94
49.06
-6.93
-0.71
32.61
-19.14
-5.31
43.78
-6.79
4.51
46.72
-0.41
11.34
55.03
4.66
10.10
48.44
-9.22
1.09
39.35
-17.17
-7.34
34.47
-9.13
-2.60
53.62
9.67
15.04
45.00
-6.53
1.52
38.13
-7.59
2.29
45.55
2.78
6.59
58.90
9.26
15.49
44.65
-2.19
7.70
43.73
-1.92
-0.98
38.06
-13.88
-5.92
46.30
-8.74
-4.55
35.32
-12.84
-6.09
33.83
-6.77
4.65
49.33
2.60
3.38
53.78
10.44
12.86
66.96
16.58
21.81
51.42
6.75
9.99
65.44
-8.33
5.26
OPM Model
Mean
49.83
54.34
40.45
49.33
31.56
39.31
40.06
45.97
31.89
44.04
47.67
56.02
45.16
39.18
35.48
54.88
41.24
37.98
45.65
63.56
44.08
43.40
37.27
44.45
26.58
33.00
49.76
51.74
66.60
55.04
63.92
Standard
Error
RSE
2.32
4.7
3.03
5.6
2.87
7.1
4.35
8.8
2.49
7.9
2.47
6.3
4.12
10.3
1.79
3.9
1.05
3.3
2.64
6.0
1.86
3.9
3.76
6.7
2.72
6.0
3.32
8.5
3.72
10.5
3.32
6.0
4.01
9.7
3.39
8.9
2.26
5.0
2.99
4.7
3.07
7.0
2.12
4.9
2.09
5.6
2.19
4.9
1.49
5.6
2.27
6.9
2.00
4.0
3.93
7.6
2.45
3.7
2.28
4.1
3.76
5.9
Regression Method
Mean
45.37
62.21
40.07
58.06
38.49
40.53
38.63
48.40
38.16
58.88
60.66
55.46
53.71
44.91
33.65
63.78
45.38
43.17
49.13
67.01
52.81
39.07
34.35
42.60
29.46
45.02
49.75
56.80
80.23
58.73
73.68
Standard
Error
1.55
1.22
0.72
1.45
0.95
1.15
0.57
1.31
0.99
1.57
1.14
0.92
1.79
1.74
0.68
0.96
1.02
0.86
0.93
1.25
0.98
0.81
0.91
1.24
0.58
0.68
1.00
0.71
1.54
0.89
2.73
RSE
3.4
2.0
1.8
2.5
2.5
2.8
1.5
2.7
2.6
2.7
1.9
1.7
3.3
3.9
2.0
1.5
2.2
2.0
1.9
1.9
1.9
2.1
2.6
2.9
2.0
1.5
2.0
1.2
1.9
1.5
3.7
Table 3. The Effect of an NCS Sample Cut on Non-Federal Salary Estimates
OPM Model
Full Sample
Area
Atlanta
Boston
Buffalo
Chicago
Cincinnati
Cleveland
Columbus
Dallas
Dayton
Denver
Detroit
Hartford
Houston
Huntsville
Indianapolis
Los Angeles
Miami
Milwaukee
Minneapolis
New York
Philadelphia
Phoenix
Pittsburgh
Portland
Rest of U.S.
Richmond
Sacramento
San Diego
San Francisco
Seattle
Washington, DC
Mean
$95,057
$91,429
$75,379
$91,067
$68,745
$81,917
$72,495
$88,098
$74,556
$91,727
$84,937
$86,337
$93,711
$98,215
$68,676
$85,428
$78,376
$73,487
$77,205
$94,322
$80,574
$77,420
$76,033
$88,905
$65,502
$71,210
$87,156
$80,142
$99,941
$87,353
$119,469
Index Sample
Std
Error RSE
Mean
$97,107
$3,434 3.5
$94,901
$3,259 3.4
$76,488
$2,497 3.3
$92,495
$2,279 2.5
$69,615
$2,454 3.5
$80,210
$2,322 2.9
$70,012
$2,797 4.0
$87,457
$3,788 4.3
$77,321
$1,976 2.6
$96,706
$3,035 3.1
$84,389
$1,331 1.6
$85,867
$2,946 3.4
$92,661
$2,934 3.2
$69,826
$85,093
$84,026
$77,759
$77,209
$96,581
$82,235
$80,055
$79,231
$93,872
$65,579
$78,758
$92,278
$84,721
$102,023
$87,335
$122,602
$4,909
$2,253
$3,389
$3,304
$2,247
$2,065
$959
$2,301
$4,081
$3,534
$948
$2,101
$2,263
$1,854
$2,205
$2,761
$3,698
7.0
2.6
4.0
4.2
2.9
2.1
1.2
2.9
5.2
3.8
1.4
2.7
2.5
2.2
2.2
3.2
3.0
%
Change
(Index Full)
2.16
3.80
1.47
1.57
1.27
-2.08
-3.42
-0.73
3.71
5.43
-0.64
-0.54
-1.12
1.67
-0.39
7.21
5.81
0.00
2.40
2.06
3.40
4.21
5.59
0.12
10.60
5.88
5.71
2.08
-0.02
2.62
Full
Sample
Mean
$92,229
$96,091
$75,175
$96,392
$72,366
$82,635
$71,755
$89,567
$78,096
$101,173
$92,410
$86,028
$99,231
$102,254
$67,748
$90,335
$80,670
$76,251
$79,052
$96,310
$85,459
$75,080
$74,414
$87,764
$66,989
$77,644
$87,152
$82,816
$108,118
$89,432
$126,579
Regression Method
Index Sample
Std
Error
Mean
$92,869 $2,304
$96,210 $1,771
$74,986 $1,051
$96,699 $2,129
$72,402 $1,230
$82,662 $1,682
$71,338
$844
$89,842 $1,875
$77,978 $1,264
$101,659 $2,347
$92,343 $1,635
$85,880 $1,197
$99,493 $2,704
$102,961 $2,904
$67,447
$831
$90,143 $1,388
$80,748 $1,434
$76,144 $1,185
$78,935 $1,202
$96,460 $1,758
$85,314 $1,332
$74,901 $1,132
$74,405 $1,212
$88,144 $1,798
$66,703
$815
$77,326
$970
$87,128 $1,449
$82,454 $1,077
$108,225 $2,285
$89,215 $1,333
$127,479 $4,485
RSE
2.5
1.8
1.4
2.2
1.7
2.0
1.2
2.1
1.6
2.3
1.8
1.4
2.7
2.8
1.2
1.5
1.8
1.6
1.5
1.8
1.6
1.5
1.6
2.0
1.2
1.3
1.7
1.3
2.1
1.5
3.5
%
Change
(Index Full)
0.69
0.12
-0.25
0.32
0.05
0.03
-0.58
0.31
-0.15
0.48
-0.07
-0.17
0.26
0.69
-0.44
-0.21
0.10
-0.14
-0.15
0.16
-0.17
-0.24
-0.01
0.43
-0.43
-0.41
-0.03
-0.44
0.10
-0.24
0.71
Table 4. The Effect of an NCS Sample Cut on Federal Pay Gaps
Full
Sample
Area
Atlanta
Boston
Buffalo
Chicago
Cincinnati
Cleveland
Columbus
Dallas
Dayton
Denver
Detroit
Hartford
Houston
Huntsville
Indianapolis
Los Angeles
Miami
Milwaukee
Minneapolis
New York
Philadelphia
Phoenix
Pittsburgh
Portland
Rest of U.S.
Richmond
Sacramento
San Diego
San Francisco
Seattle
Washington DC
Mean
49.83
54.34
40.45
49.33
31.56
39.31
40.06
45.97
31.89
44.04
47.67
56.02
45.16
39.18
35.48
54.88
41.24
37.98
45.65
63.56
44.08
43.40
37.27
44.45
26.58
33.00
49.76
51.74
66.60
55.04
63.92
OPM Model
Index Sample
Mean
53.06
60.20
42.52
51.67
33.22
36.41
35.26
44.91
36.78
51.86
46.72
55.17
43.53
Std
Error
5.41
5.50
4.65
3.74
4.70
3.95
5.40
6.28
3.50
4.77
2.31
5.32
4.55
RSE
10.2
9.1
10.9
7.2
14.1
10.8
15.3
14.0
9.5
9.2
5.0
9.7
10.4
Change
(Index Full)
3.23
5.86
2.07
2.34
1.66
-2.90
-4.80
-1.06
4.89
7.82
-0.95
-0.85
-1.63
37.75
54.28
51.42
46.00
45.65
67.48
47.05
48.28
43.04
52.52
26.73
47.10
58.56
60.41
70.07
55.01
68.22
9.68
4.08
6.11
6.20
4.24
3.58
1.72
4.26
7.37
5.74
1.83
3.92
3.89
3.51
3.68
4.90
5.07
25.7
7.5
11.9
13.5
9.3
5.3
3.6
8.8
17.1
10.9
6.9
8.3
6.6
5.8
5.2
8.9
7.4
2.27
-0.61
10.18
8.02
0.01
3.92
2.97
4.88
5.77
8.07
0.15
14.10
8.80
8.67
3.47
-0.03
4.30
Full
Sample
Mean
45.37
62.21
40.07
58.06
38.49
40.53
38.63
48.40
38.16
58.88
60.66
55.46
53.71
44.91
33.65
63.78
45.38
43.17
49.13
67.01
52.81
39.07
34.35
42.60
29.46
45.02
49.75
56.80
80.23
58.73
73.68
Regression Method
Index Sample
Mean
46.38
62.41
39.72
58.56
38.56
40.58
37.82
48.86
37.95
59.64
60.55
55.19
54.11
45.91
33.06
63.43
45.52
42.97
48.91
67.27
52.56
38.73
34.33
43.22
28.91
44.42
49.71
56.12
80.40
58.34
74.91
Std
Error
3.63
2.98
1.96
3.49
2.36
2.86
1.63
3.11
2.24
3.69
2.84
2.17
4.19
4.12
1.64
2.52
2.58
2.22
2.27
3.05
2.39
2.10
2.19
2.92
1.57
1.81
2.49
2.04
3.81
2.36
6.15
RSE
7.8
4.8
4.9
6.0
6.1
7.0
4.3
6.4
5.9
6.2
4.7
3.9
7.7
9.0
5.0
4.0
5.7
5.2
4.6
4.5
4.5
5.4
6.4
6.8
5.4
4.1
5.0
3.6
4.7
4.1
8.2
Change
(Index Full)
1.01
0.20
-0.35
0.50
0.07
0.05
-0.81
0.46
-0.21
0.76
-0.11
-0.27
0.40
1.00
-0.59
-0.35
0.14
-0.20
-0.22
0.26
-0.25
-0.34
-0.02
0.62
-0.55
-0.60
-0.04
-0.68
0.17
-0.39
1.23
Table 5. The Effect of Excluding Area-Specific NCS Data on Non-Federal Salary and Pay Gap Estimates
Regression Method
Non-Federal Salary
Pay Gap
Area
Atlanta
Boston
Buffalo
Chicago
Cincinnati
Cleveland
Columbus
Dallas
Dayton
Denver
Detroit
Hartford
Houston
Huntsville
Indianapolis
Los Angeles
Miami
Milwaukee
Minneapolis
New York
Philadelphia
Phoenix
Pittsburgh
Portland
Richmond
Sacramento
San Diego
San Francisco
Seattle
Washington, DC
With Area
Excluded from
Full Sample NCS Sample
$92,229
$92,403
$96,091
$95,834
$75,175
$75,182
$96,392
$96,227
$72,366
$72,342
$82,635
$82,636
$71,755
$71,722
$89,567
$89,275
$78,096
$78,094
$101,173
$101,153
$92,410
$92,616
$86,028
$86,051
$99,231
$99,326
$102,254
$102,368
$67,748
$67,697
$90,335
$90,760
$80,670
$80,654
$76,251
$76,255
$79,052
$79,120
$96,310
$96,565
$85,459
$85,407
$75,080
$74,998
$74,414
$74,392
$87,764
$87,751
$77,644
$77,580
$87,152
$87,145
$82,816
$82,839
$108,118
$108,482
$89,432
$89,548
$126,579
$127,274
Percent
Change
0.19
-0.27
0.01
-0.17
-0.03
0.00
-0.05
-0.33
0.00
-0.02
0.22
0.03
0.10
0.11
-0.08
0.47
-0.02
0.00
0.09
0.26
-0.06
-0.11
-0.03
-0.01
-0.08
-0.01
0.03
0.34
0.13
0.55
With Area
Excluded from
Full Sample NCS Sample
45.37
45.65
62.21
61.78
40.07
40.08
58.06
57.79
38.49
38.44
40.53
40.53
38.63
38.57
48.40
47.92
38.16
38.15
58.88
58.85
60.66
61.02
55.46
55.50
53.71
53.86
44.91
45.07
33.65
33.55
63.78
64.55
45.38
45.35
43.17
43.18
49.13
49.26
67.01
67.46
52.81
52.72
39.07
38.91
34.35
34.31
42.60
42.58
45.02
44.90
49.75
49.74
56.80
56.85
80.23
80.83
58.73
58.93
73.68
74.63
Change
0.28
-0.43
0.01
-0.27
-0.05
0.00
-0.06
-0.48
-0.01
-0.03
0.36
0.04
0.15
0.16
-0.10
0.77
-0.03
0.01
0.13
0.45
-0.09
-0.16
-0.04
-0.02
-0.12
-0.01
0.05
0.60
0.20
0.95
Table 6. Estimated Non-Federal Salaries and Pay Gaps for Potential OPM Areas
Area
Albany
Albuquerque
Bakersfield
Beaumont
Harrisburg
Lansing
New Orleans
Portland, ME
Wilmington
Federal
Salary
$57,252
$53,977
$59,297
$50,303
$54,883
$57,882
$58,290
$60,362
$51,394
Current
RUS Non- Current
Federal RUS Pay
Salary
Gap
Estimate Estimate
$70,025
35.32
$70,025
35.32
$70,025
35.32
$70,025
35.32
$70,025
35.32
$70,025
35.32
$70,025
35.32
$70,025
35.32
$70,025
35.32
Regression Method
Non-Federal Salary
Current
RUS
Estimate
$66,989
$66,989
$66,989
$66,989
$66,989
$66,989
$66,989
$66,989
$66,989
AreaSpecific
%
Estimate Change
$80,331
19.92
$75,105
12.12
$94,993
41.80
$59,427 -11.29
$75,727
13.04
$83,793
25.08
$76,563
14.29
$82,341
22.92
$61,599
-8.05
Current
RUS
29.46
29.46
29.46
29.46
29.46
29.46
29.46
29.46
29.46
Pay Gap
AreaSpecific
Estimate Change
40.31
10.85
39.14
9.68
60.20
30.74
18.14
-11.32
37.98
8.52
44.77
15.31
31.35
1.89
36.41
6.95
19.86
-9.60
Figure 1. Ranks of Area Pay Gaps
35
Rest of U.S.
Indianapolis
Pittsburgh
Dayton
Cincinnati
30
Columbus
Phoenix
Buffalo
Cleveland
Regression Method Rank
25
Portland
Milwaukee
Huntsville
20
Richmond
Atlanta
Miami
Dallas
15
Minneapolis
Sacramento
Philadelphia
Houston
Hartford
San Diego
10
Chicago
Seattle
Denver
Detroit
5
Boston
Los Angeles
New York
Washington, DC
San Francisco
0
0
5
10
15
20
Current Method Rank
25
30
35
File Type | application/pdf |
File Title | Microsoft Word - Using OES Data for Federal Pay Comparability - Updated.doc |
Author | dey_m |
File Modified | 2011-11-22 |
File Created | 2009-03-26 |