GTP OMB Supporting Statement Part B

GTP OMB Supporting Statement Part B.docx

Green Technologies and Practices Survey

OMB: 1220-0184

Document [docx]
Download: docx | pdf

6/24/11

Supporting Statement for Request for OMB Approval

Occupational Employment Statistics Green Technologies and Practices

Data Collection Clearance


B. Collection of Information Employing Statistical Methods


The collection of data on Green Technologies and Practices (GTP) is new to BLS. No survey-specific historical information is available on variances, response rates, etc.


1(a) Respondent Universe


The universe for this survey includes private and government establishments in the 50 States and the District of Columbia. The sampling frame is the Quarterly Census of Employment and Wages (QCEW) Longitudinal Database (LDB) maintained by the Bureau of Labor Statistics. The QCEW program produces a comprehensive tabulation of employment and wage information for workers covered by State unemployment insurance (UI) laws and Federal workers covered by the Unemployment Compensation for Federal Employees (UCFE) program. Included are data on the number of establishments, monthly employment, and quarterly wages, by industry based on the North American Industrial Classification System (NAICS), by county, and by ownership sector, for the entire United States. This information is provided for over 9 million business establishments of which about 6.9 million are in the scope of this survey. (Out of scope establishments include private households, establishments with zero employment, and establishments in outlying areas.)


1(b) Sample


ScopeThe GTP will measure occupational employment and wages of workers engaged in green practices in establishments in the 50 States and the District of Columbia. The survey covers all NAICS industries except NAICS 814, private households.


Sample SizeThe sample size is approximately 35,000 establishments. The following table shows the estimated number of universe units, sampled units, and responding units (assuming a 75 percent response rate, see 3(a)) for all in-scope NAICS for the GTP survey:


Table 1: Universe and Sample Size Summary

Survey

NAICS Coverage

Responding Units

Sample Units

Universe Units

FY 2011

All NAICS Except 814

26,251

35,000

6,900,000


StratificationUnits on the sampling frame will be stratified by Census Region and by two-digit NAICS industry sector.



2(a) Sample Design


The GTP will have a probability-based sample aimed at satisfying data needs at both the national and Census Region/2-digit industry level. The basic sampling unit is an establishment. The most recent data from our pilot testing shows that 62.7 percent of responding establishments from the QCEW frame reported green technologies or practices at their business. Thirty two point five percent of establishments reported employees meeting our green definition. BLS will use a rare-subpopulation sampling method to include a larger number of green establishments. To accomplish this, BLS has conducted research on establishments known to be green and has compiled its own green list through web research and the use of other known green business organization lists. This list contains about 32,000 establishments and will be incorporated into the sample in separate strata and sampled with a higher probability than other establishments.


Allocation methodA power allocation based on the square root of stratum size, defined by frame employment within each stratum will be used to allocate the sample to each Census Region/2-digit NAICS stratum. This is a compromise between 1) allocating an equal sample size to each stratum which is optimum where estimates for individual strata are of prime importance and 2) a straight proportional allocation based on stratum size which would be optimal for estimation of high-level data.


Sample SelectionWithin each stratum, the sample is selected using modified probability proportional to estimated employment size (PPES). The sample is designed to represent the frame. The estimated employment size for an establishment will be the maximum employment over the last 12 months of data available from the LDB. Units with average employment of zero over the last 12 months are excluded. Each sampled establishment is assigned a sampling weight equal to the reciprocal of its probability of selection in the sample.


2(b) Estimation Procedures


GTP estimators of total employment will take the form of a Horvitz-Thompson estimator. Let a cell have c sample units. Each establishment has a weight wi that is the inverse of its probability of selection. Each establishment has a data value yi. For example, the data value could be the total number of employees who spend more than 50% of their time engaged in green practices.


 

Weighting Class Adjustment for Nonresponse–To mitigate possible bias arising from nonresponse, weighting class adjustments to the weights will be made. Initial plans are to define Census Region x industry sector cells as the weighting classes. With PPES sampling, an adjustment based on employment will be used. Let ei be an establishment’s employment size used for allocation and sample selection (not the current size). Of c sample units in a weighting class, suppose that r respond. Weights of respondents are increased to “cover” the missing data of nonrespondents. Additionally, modifications are made to account for establishments that are out-of-scope (noos) or out-of-business (noob).


 

 

Benchmarking–Benchmarking to in-scope employment for national and Census Region x industry sector is planned. Some protection would be provided against coverage shortcomings. Since “green” economic activity is anticipated to be a small part of most industries, little variance-reduction benefit would be expected from benchmarking. The classes may be structured differently, but a weight adjustment similar to weighting class adjustment for nonresponse can be computed. Instead of employment ei at the time of sampling, use the latest employment Ei; that is, control the sample weighted estimates to known population values obtained from an updated QCEW. A ratio adjustment is computed from the unweighted “known” employment total for the class (the sum of Ei over the population N) and the sample estimate that can be made from the respondents weighted by .

 

 



For totals, simple weighted estimates can be made using the weights and the responding establishment values yi for a characteristic.

The sum in the formula is restricted to the r respondents that contribute to an estimate y of an unknown population characteristic Y.


 



Wage Estimation–Mean wage and median wage estimates will be calculated for occupations within Census Region/two-digit industry cells. Wage rate data will be collected in broad wage bands instead of exact data points. To approximate median wage rates, a uniform distribution within wage intervals is assumed and a simple linear interpolation between the endpoints of the wage intervals is used.


Because wage rate data are collected in broad wage bands instead of exact data points, the standard error for each mean wage rate estimate is calculated using a components model. This model accounts for the variability of both the observed and unobserved components of wage rate data. A traditional ratio variance estimator will be used to account for the variability observed in the collected GTP wage data.


2(c) Reliability


The estimation of sample variances will use a replication methodology similar to that used by Current Employment Statistics (CES) and the Job Openings and Labor Turnover Survey (JOLTS). Balanced Half Sampling (BHS) uses half samples of the original sample and calculates estimates using those half samples. Balanced Repeated Replication (BRR) modifies the technique by using the entire sample for making replicate estimates but by perturbing the weights of half samples in a systematic fashion using a Hadamard matrix. The sample variance is calculated by measuring the variability of the estimates made from these replicates. (For a detailed mathematical presentation of this method as applied to estimates from the Current Employment Statistics Survey, see Handbook of Methods, BLS Chapter 2, pages 8-9, Bureau of Labor Statistics, 200 or http://www.bls.gov/opub/hom/homch2_i.htm.) A method with a different Hadamard matrix is planned for GTP.


The standard weight perturbation uses a factor of ½ to either increase or decrease the weight for an establishment when making an estimate for the αth replicate.



For each replicate α, an estimate can be made, for example of a total Y. If there are A replicates, each replicate can be compared to an estimate made from the entire sample (using original weights) and the following formula used to calculate an estimated variance.



As mentioned in the beginning of Part B, this is a new survey for BLS, thus at present only crude estimates of standard errors based on results from pilot tests are available. GTP sample allocation and selection based on the properties discussed above were simulated to estimate approximate cell sample sizes. Rough design effects were calculated from historical QCEW data. The resultant sample sizes by 2-digit NAICS, and estimated National and Regional CVs by 2-digit NAICS are provided below.






Industry

Estimated CV on Green Employment

NAICS2

Sector Name

Estimated National Sample Size

National

Northeast

Midwest

South

West

11

Agriculture/Forestry/Fishing/Hunting

776

7.6%

16.2%

15.4%

13.8%

16.6%

21

Mining

504

6.4%

13.7%

13.0%

11.7%

14.0%

Industry

Estimated CV on Green Employment

NAICS2

Sector Name

Estimated National Sample Size

National

Northeast

Midwest

South

West

22

Utilities

593

5.6%

12.0%

11.4%

10.2%

12.3%

23

Construction

1,912

9.8%

21.0%

20.0%

17.9%

21.5%

31

Manufacturing, Non Durable

931

5.4%

11.6%

11.0%

9.8%

11.8%

32

Manufacturing, Non Durable

1,104

5.3%

11.3%

10.8%

9.6%

11.6%

33

Manufacturing, Durable

1,693

3.8%

8.2%

7.8%

7.0%

8.4%

42

WholesaleTrade

1,605

6.7%

14.4%

13.7%

12.3%

14.7%

44

Retail Trade

2,192

3.4%

7.2%

6.9%

6.1%

7.4%

45

Retail Trade

1,509

4.0%

8.7%

8.2%

7.4%

8.8%

48

Transportation & Warehousing

1,168

5.6%

12.1%

11.5%

10.3%

12.3%

49

Transportation & Warehousing

898

4.4%

9.4%

8.9%

8.0%

9.6%

51

Information

1,092

5.0%

10.7%

10.1%

9.1%

10.9%

52

Finance & Insurance

1,686

3.3%

7.1%

6.8%

6.1%

7.3%

53

Real Estate & Rental/Leasing

1,127

4.7%

10.1%

9.6%

8.5%

10.3%

54

Prof/Scientific/Tech. Services

2,133

3.2%

6.9%

6.5%

5.8%

7.0%

55

Management of Co & Enterprises

836

7.0%

15.0%

14.2%

12.7%

15.3%

56

Admin/Waste/Remediation Services

1,894

5.9%

12.6%

12.0%

10.7%

12.9%

61

Education Services

2,186

4.3%

9.2%

8.7%

7.8%

9.4%

62

Health Care/Social Assistance

2,739

2.9%

6.2%

5.8%

5.2%

6.3%

71

Arts/Entertainment/Recreation

1,100

4.2%

9.1%

8.6%

7.7%

9.3%

72

Accommodation/Food Services

2,192

4.0%

8.6%

8.2%

7.3%

8.8%

81

Other Services

1,538

4.0%

8.6%

8.2%

7.3%

8.8%

92

Public Administration

1,632

5.5%

11.9%

11.3%

10.1%

12.2%

All

Across All Industries

35,040

1.1%

2.5%

2.3%

2.1%

2.5%


2(d) Data Collection Cycle


The Green Technologies and Practices Survey is expected to be a one-time data collection effort.


3(a) Maximizing Response


A goal of the GTP survey is to achieve a 75 percent response rate. To date, BLS has completed three of five test panels to refine the paper survey form. The overall response rate on the first test panel was about 38 percent while the response rates on the second and third panels improved to about 50 percent. These test panels were in the field for a relatively short period of time (6 weeks) and were conducted exclusively using the mail survey form, and in the third panel, a faxable survey form. Improvements to the survey form and cover letter have been made as a result of information learned from the test panels. The 2009 Green Jobs Survey conducted by the State of Washington utilizing mail collection obtained a response rate of 70 percent. By utilizing web-based collection (currently being tested), email collection, mail, fax, and telephone collection BLS hopes to achieve a higher response rate of 75 percent.

The survey protocol includes the initial mailing in August 2011, of the survey form, return envelope, cover letter, and web reporting instructions. A non-response prompt postcard will be mailed to respondents within three weeks of the initial mailing. Approximately three weeks following the postcard mailing a third survey mailing including the survey form, return envelope, and web reporting instructions will be sent. Telephone data collection for all non-respondents will begin approximately two weeks following the third and final mail out and continue until January 2012.


The GTP is a voluntary survey with no historical response rate information. Every effort will be made to maximize response rates to achieve the 75-percent goal by:


  • Primarily reporting via a web-based instrument. Research determined that web-based reporting is important to the success of this survey due to the subject matter.

  • Researching and obtaining e-mail addresses and contact names for sampled units where possible. When contact names are not available, our research suggests the following guidelines be used in addressing the forms: “Proprietor/Owner” for firms with 10 or fewer employees; “General Manager” for larger firms with less than 100 employees and; “Human Resources Manager” for firms with more than 100 employees.

  • Conducting extensive address refinement to ensure that the survey solicitation package reaches the correct establishment in a timely manner.

  • Providing each sampled unit with a cover letter explaining the importance of the survey and the need for voluntary cooperation.

  • Giving each private sector sample unit the Bureau’s pledge of confidentiality.

  • Sending each nonresponding unit two additional mailings after the initial mail-out (and contacting nonresponding units by telephone).

  • Using status reports and control files to identify cells with low response rates and targeting nonresponse follow-up activities and telephone data collection for key respondents in these cells

  • Stressing to respondents that assistance is available to help them complete the survey form.

  • Using a respondent web page that provides detailed information about responding to the GTP survey, including contact information for those needing assistance and a link to on-line data reporting.

  • Increasing the use of electronic and telephone collection in order to allow the respondent to provide information in a way that is most convenient to them.



3(b) Nonresponse Adjustment


Prior to calculating estimates, preliminary editing procedures will flag questionable data. The extent and nature of item non-response is not known. It is likely that responses that are incomplete after follow-up contact will either be made unusable (treated like unit non-response) or imputed.


Weighting class adjustments to sampling weights wi will be made to partially compensate for the bias that arises if non-response is ignored (see Estimation Procedure section). In the simplest form, the entire sample is divided into weighting classes based on strata or a simple subdivision of the population. The default for GTP will be weighting classes defined by Census Region x industry sector, and analysis of responses will determine if more complexity is needed.


Given a hypothesized response rate (75%) that is less than optimal (80 – 100%), nonresponse bias in key estimates may be assessed through multivariate models that incorporate data from, primarily, two sources. The first source of data is the sample frame data. The sampling frame for the GTP survey, the LDB, contains establishment size and industry information, as well as age of the sampled establishment. The second source, sometimes referred to as “paradata,” describes important aspects of the administration of the survey. These data include variables that may describe some of the following: the number and type of contact attempts and interview forms or reminders mailed to each establishment, the time in the field, the job title of the respondent within the sampled establishment, the number of different persons in the establishment that have been contacted, whether the form was sent to a central office, any expressions by the contacts within the establishment indicating unwillingness or refusal to complete the survey, and any interviewer comments recorded for the record. Nonresponse propensity using data from these two sources will be assessed, possibly through multivariate logistic regression models, within each mode of survey administration (telephone, internet, and mail) and across the modes of survey administration. The propensity to respond will then be examined in terms of its contribution to bias on key survey variables, such as the number and type of green activities and the number and type of green jobs.


BLS will conduct a Response Analysis Survey (RAS) on up to 1,000 GTP respondents and non-respondents to ensure accurate data collection and to assess non-response bias.  The RAS will begin immediately following the general data collection for the GTP survey.  BLS will examine general data quality issues in the RAS. To the extent possible, the analysis will of the RAS will inform the aforementioned nonresponse models.  The questions will be submitted as a non-substantive change closer to the start of the RAS.


3(c) Confidentiality


Before estimates are released to the public, they must first be screened to ensure that they do not violate the Bureau of Labor Statistics’ (BLS) confidentiality pledge. A promise is made by the Bureau to each private sector sample unit that the BLS will not release its employment data to the public in a manner that would allow others to identify the unit. If a green estimate fails confidentiality screening, the estimate is suppressed.



4. Developmental Tests


A series of five field tests is being conducted to develop data collection methodology. The first was a basic examination of data collection issues conducted under OMB Clearance 1220-0141 and completed in October 2010. Beginning in January 2011, the second and third tests were used to modify and fine tune the survey questions, the fourth was used to finalize the survey form, and the fifth will be a test of the web collection instrument. These efforts include periodic evaluations of the effectiveness of the materials mailed to each sampled unit. The second through fifth tests are covered under OMB Clearance 1220-0182.


5. Statistical and Analytical Responsibility


Ms. Shail Butani, Chief, Statistical Methods Division of the Office of Employment and Unemployment Statistics, and Mr. George Stamas, Chief, Division of Occupational Employment Statistics, Office of Employment and Unemployment Statistics are responsible for the statistical and analytical aspects of the GTP program respectively. Ms. Butani can be reached on 202-691-6347. Mr. Stamas can be reached on 202- 691-6350. Additionally, BLS seeks consultation with other outside experts on an as needed basis.


6. References


Bankier, Michael D. (1988). Power Allocations: Determining Sample Sizes for Subnational Areas. American Statistician, Vol. 42, pp. 174-177.


Bureau of Labor Statistics’ Handbook of Methods, Chapter 3, Bureau of Labor Statistics, 2008 (http://www.bls.gov/opub/hom/homch3_a.htm)


Bureau of Labor Statistics’ Handbook of Methods, Chapter 2, pages 8-9, Bureau of Labor Statistics, 2004 ( http://www.bls.gov/opub/hom/homch2_i.htm.)


Kalton, Graham and Anderson, Dallas W. (1986). Sampling Rare Populations. Journal of the Royal Statistical Society, A.  149:69-82.


Lawley, Ernest, Stetser, Marie, and Valaitis, Eduardas. (2007) Alternative Allocation Designs for a Highly Stratified Establishment Survey. 2007 Joint Statistical Meetings.


Lohr, Sharon L. and Rao, J. N. K. (2006), Estimation in Multiple-Frame Surveys, Journal of the American Statistical Association. 475:1019-1030.


OES State Operations Manual, Bureau of Labor Statistics, Internal Document, 2008.


Piccone, David and Stetser, Marie. (2009) National Sample Reallocation for the Occupational Employment Statistics Survey, 2009 Joint Statistical Meetings.


Technical Notes for May 2009 OES Estimates, Bureau of Labor Statistics, 2010 (http://www.bls.gov/oes/current/oes_tec.htm).


22


File Typeapplication/vnd.openxmlformats-officedocument.wordprocessingml.document
File TitleSupporting Statement for Request for OMB Approval
Authorfairman_k
File Modified0000-00-00
File Created2021-02-01

© 2024 OMB.report | Privacy Policy