The collection and analysis of the data on job advertisements with the use of big data Cover Image

Pozyskiwanie i analiza danych na temat ofert pracy z wykorzystaniem big data
The collection and analysis of the data on job advertisements with the use of big data

Author(s): Jacek Maślankowski
Subject(s): Economy, Socio-Economic Research
Published by: Główny Urząd Statystyczny
Keywords: big data; text mining; web scraping; labour market

Summary/Abstract: The goal of this paper is to present, on the one hand, the benefits for official statistics (labour market) resulting from the use of web scraping methods to gather data on job advertisements from websites belonging to big data compilations, and on the other, the challenges connected to this process. The paper introduces the results of experimental research where web-scraping and text-mining methods were adopted. The analysis was based on the data from 2017–2018 obtained from the most popular job-searching websites, which was then collated with Statistics Poland’s data obtained from Z-05 forms. The above-mentioned analysis demonstrated that web-scraping methods can be adopted by public statistics services to obtain statistical data from alternative sources complementing the already-existing databases, providing the findings of such research remain coherent with the results of the already-existing studies.

  • Issue Year: 64/2019
  • Issue No: 09
  • Page Range: 60-74
  • Page Count: 15
  • Language: Polish