Overview - Challenges
This page gives an overview of common challenges you could encounter when analyzing OJAs. Next to each challenge is the relevant section in the guide.
Data Collection
Where to find OJA data? Data Sources and OJA Landscaping
How to collect OJA data? Web Scraping
How to store OJA data? Job Posting Data Schema
Data Enrichment
How to segment a job ad? Text Segmentation
How to identify duplicates? Identifying Duplicates
How to extract occupations? Data Enrichment
How to extract skills and competences? Extracting Skills
Evaluation and Quality Control
How to evaluate extraction and classification algorithms? Evaluation Metrics
How to create a gold standard for evaluation? Gold Standard Annotation and Quality
Taxonomies and Ontologies
Which taxonomies are there (ISCO, ESCO, KLDB, etc.)? Taxonomies and Ontologies
How to develop a taxonomy? Developing a Taxonomy
How to evaluate a taxonomy? Data Standards for Taxonomies and Ontologies
Dataset Curation and Representativity Analysis
How to deal with duplicates? Deduplication
How to validate an OJA dataset/sample? Representativity Analysis
What to report? Reporting Results
Last updated