South Africa - Post Apartheid Labour Market Series 1993-2019
Reference ID | zaf-datafirst-palms-1993-2019-v3.3 |
Year | 1993 - 2019 |
Country | South Africa |
Producer(s) |
Andrew Kerr - DataFirst, University of Cape Town David Lam - University of Michigan Martin Wittenberg - DataFirst, University of Cape Town |
Created on
Sep 05, 2013
Last modified
Sep 11, 2020
Page views
87455
Overview
Identification
ID Number zaf-datafirst-palms-1993-2019-v3.3 |
Version
Version Description
v3.3: Edited, anonymised dataset for public distributionProduction Date
2019-09-02Notes
The current version of the Post Apartheid Labour Market Series (PALMS) is version 3.3 covering 1993-2019. Data from Statistics SA's Quarterly Labour Force Surveys 2018 and 2019 have been added to the data in this dataset. This version also contains the new the new calibrated weights for the data produced on the 4th of March 2020. This is a temporary solution to provision of the weights, which will be incorporated into PALMS version 3.4 currently being prepared by DataFirst.Earlier versions of this dataset were:
Version 1.0.1:
The following derived variables were added to this version of the dataset:
enrollment3, enrolled: enrolment variables
hrslstwk: Hours worked in last week
employer1, businesstype1, businesstype2: questions about wage/self employment. More detailed in LFSs.
publicemp: a dummy for whether the individual is employed in the public sector
Changes in versions 1.0.2 and 1.0.3 were not recorded.
Version 1.0.4
The following derived variables were added to this version of the dataset:
Improved psu LFS 03:1 variable.
personnum in OHSs and LFSs to allow easier merging, on advice from Nicola Branson.
Extra EA/PSU variables in some years where these were missing.
numlabels, were also added, where these were missing.
Version 1.0.5
The following derived variables were added to this version of the dataset:
EA/PSU variables for both 2000 waves, both 2006 waves and both 2007 waves. EA variables may not always be consistent ACROSS waves but are correct WITHIN waves to allow for checks on the number of hh per ea, which look right in ALL LFS waves now.
Version 1.0.6.
In this version the 1994 income data from the OHS 1994 has been corrected, and the 1994 data no longer has the imputations and fixes from Statistics South Africa.
The OHS 1994 wage employment income is included as wageempincome2 and wageempincome3 (different for gross and net responses).
Version 1.0.7
This version has label changes (data signature is the same as version 1.0.6)
Version 1.0.8
In this version the variable uqnr_orig has been added, which is a string version of the household id variable exactly as it appeared in each survey. This will assist those researchers who wish to merge in extra data from the OHSs or LFSs.
Version 1.0.9, October 2012
The following variables have been added to this version of the data:
(i) The LFS variable for hours worked in last week
(ii) An homogenised EA variable
(iii) A variable indicating formality of firm an individual owns or works for
(iv) Variables for number of workers for self-employed (worker numbers for OHS only): selfformalreg selfvatreg selfpaidemp selfunpaidemp wageformalreg formalreg
In this version the OHS 1997 industry variable has been corrected to include industry of the self-employed.
Version 2.0, August 2013
(i) Included the QLFS up until March 2012 (inclusive)
(ii) Reworked approach to labour income variable creation (see documentation for more information)
Version 2.1, September 2013 (modified September 2015)
(i) Included a separate datafile with multiple imputations for labour income. This datafile is called "palmsv2.1miincomes" and can be used for analysis of trends in labour income over time. Details of the imputation process can be found in the document titled, "Multiply Imputed Labour Income Data, PALMS v2.1 (1994-2012)"
(ii) Also included with this version of the PALMS are the cross-entropy weights
Not all the files in this dataset are version 2.1. DataFirst versions at file level, so only the files which have been updated will be re-versioned in a new release. This prevents users having to download the files which have not been changed.
Some of the data files in this version of the dataset - the ones that remain unchanged - will therefore still be version 2. The dataset will receive the version number of the latest versioned file.
The PALMS 1994-2012 version 2.1 dataset consists of the following files:
The data files (unchanged version 2 files)
The cross-entropy weights (unchanged version 1.2 files)
The incomes data (unchanged version 2)
The imputed incomes data version 2.1
document
Version 3.1 includes data from the QLFS 2013-2015 and data from the 1993 survey "Project for Statistics on Living Standards and Development", conducted by SALDRU at UCT.
Version 3.2 had data from the QLFS 2016-2017 added to version 3.1
Overview
Abstract
The Post-Apartheid Labour Market Series (PALMS) version 3.3 is a stacked cross sectional dataset created by DataFirst at the University of Cape Town. The data consists of microdata from 69 household surveys conducted by Statistics South Africa between 1994 and 2019, as well as the 1993 Project for Statistics on Living Standards and Development conducted by SALDRU at UCT. The Statistics South Africa surveys include the October Household Surveys from 1994 to 1999, the bi-annual Labour Force Surveys from 2000-2007, including the smaller LFS pilot survey from February 2000, and the Quarterly Labour Force Surveys from 2008-2019. The data is at individual level, but household level variables may be created using the household id variable uqnr. No attempt has been made to link individuals or households across waves, although there was a panel element to the earlier rounds of the LFS, as well as the QLFS.Kind of Data
Sample survey dataUnits of Analysis
Households and individualsScope
Notes
There are currently over 120 variables in the dataset and over 5.7 million observations, including concerning children and the elderly. The variables included are mainly those to do with the labour market, although some household variables, such as dwelling type and access to services, as well as access to government social grants, are also included.Topics
Topic | Vocabulary | URI |
---|---|---|
Employment | ||
income | ||
labour markets | ||
social grants. |
Coverage
Geographic Coverage
The surveys used to construct PALMS had national coverageGeographic Unit
The lowest level of geographic aggregation in PALMS is province.Universe
The target population is all households. Coverage of workers' hostels, convents/monasteries, as well as institutions such as old age homes, hospitals, prisons and military barracks varied across the surveys. Data users will need to consult the individual OHS, LFS and QLFS datasets for information on the universe for each survey.Producers and Sponsors
Primary Investigator(s)
Name | Affiliation |
---|---|
Andrew Kerr | DataFirst, University of Cape Town |
David Lam | University of Michigan |
Martin Wittenberg | DataFirst, University of Cape Town |
Other Acknowledgements
Name | Affiliation | Role |
---|---|---|
Statistics South Africa | Producer of original datasets used to create the PALMS dataset |
Metadata Production
Metadata Produced By
Name | Abbreviation | Affiliation | Role |
---|---|---|---|
DataFirst | University of Cape Town | Metadata Producer |
Date of Metadata Production
2020-05-03DDI Document Version
Version 5