Occupation coding, an important task in official statistics, refers to coding a respondent's text answer into one of many hundreds of occupation codes. To date, occupation coding is still at least partially conducted manually, at great expense. We propose three methods for automatic coding: combining separate models for the detailed occupation codes and for aggregate occupation codes, a hybrid method that combines a duplicate-based approach with a statistical learning algorithm, and a modified nearest neighbor approach. Using data from the German General Social Survey (ALLBUS), we show that the proposed methods improve on both the coding accuracy of the underlying statistical learning algorithm and the coding accuracy of duplicates where dupl...
This dissertation addresses the measurement of occupation in surveys. Many surveys ask respondents a...
In social surveys, although occupation coding is very important, it is also very difficult. In this ...
We develop a new automatic coding system with a three-grade confidence level corresponding to each o...
In almost all surveys, respondents are asked for details of their occupation in the context of the d...
Currently, most surveys ask for occupation with open-ended questions. The verbatim responses are cod...
As occupational data play a crucial part in many social and economic analyses, information on the re...
This article studies coding errors in occupational data, as the quality of this data is important b...
Machine learning approaches achieve high accuracy for text recognition and are therefore increasingl...
This report is a deliverable of Work Package 21 ‘Innovative tools and protocols for working conditi...
Most surveys use an open-ended question to measure occupation, followed by office coding. This is ex...
Occupational coding in multi-country surveys is mostly a black box: have national survey agencies cl...
This article studies coding errors in occupational data, as the quality of this data is important bu...
Currently, most surveys ask for occupation with open-ended questions. The verbal responses are coded...
OBJECTIVES: Automatic job coding tools were developed to reduce the laborious task of manually assig...
This paper was written for the InGRID - Inclusive Growth Infrastructure Diffusion – project, which h...
This dissertation addresses the measurement of occupation in surveys. Many surveys ask respondents a...
In social surveys, although occupation coding is very important, it is also very difficult. In this ...
We develop a new automatic coding system with a three-grade confidence level corresponding to each o...
In almost all surveys, respondents are asked for details of their occupation in the context of the d...
Currently, most surveys ask for occupation with open-ended questions. The verbatim responses are cod...
As occupational data play a crucial part in many social and economic analyses, information on the re...
This article studies coding errors in occupational data, as the quality of this data is important b...
Machine learning approaches achieve high accuracy for text recognition and are therefore increasingl...
This report is a deliverable of Work Package 21 ‘Innovative tools and protocols for working conditi...
Most surveys use an open-ended question to measure occupation, followed by office coding. This is ex...
Occupational coding in multi-country surveys is mostly a black box: have national survey agencies cl...
This article studies coding errors in occupational data, as the quality of this data is important bu...
Currently, most surveys ask for occupation with open-ended questions. The verbal responses are coded...
OBJECTIVES: Automatic job coding tools were developed to reduce the laborious task of manually assig...
This paper was written for the InGRID - Inclusive Growth Infrastructure Diffusion – project, which h...
This dissertation addresses the measurement of occupation in surveys. Many surveys ask respondents a...
In social surveys, although occupation coding is very important, it is also very difficult. In this ...
We develop a new automatic coding system with a three-grade confidence level corresponding to each o...