Software

I’ve created a few R packages for working with UK labour market datasets. All of them take the source data and compile into a single R data frame where each variable is a column, so that it’s possible to do time series analysis out of the box. In short, they turn each dataset into tidy data.

tidylfs

ONS Labour Force Survey. Stiches together individual files into one R data.frame, and adds descriptions for occupation, industry. Quality of life improvements like picking the correct weight variable from each quarterly dataset and turning it into a single column. Works for quarterly data from 1992, or the APS from 2004.

tidyusoc

Understanding Society/UK Household Longitudinal Study (UKHLS/BHPS). Joins waves together to create a single file, one per category (indresp, youth, etc.). Works on the entire dataset going back to wave 1 of the BHPS.

tidyashe

ONS Annual Survey of Hours and Earnings. Experimental. Adds descriptions to occupation and industry and stiches files together. Works for files as they are in DfE, but may not work elsewhere.