◼️
Guide To German Online Job Ad Data
Impressum
  • A Guide to Collecting, Processing and Analyzing Online Job Ad Data
  • Navigation
    • Lifecycle
    • Overview - Challenges
    • Overview - Methods
  • Steps
    • Data Collection
    • Data Enrichment
    • Extraction Methods
    • Evaluation and Quality Control
    • Taxonomies and Ontologies
    • Dataset Curation and Representativity Analysis
  • In Practice
    • Literature and Projects
Powered by GitBook
On this page
  • Data Collection
  • Data Enrichment and Methods
  • Evaluation and Quality Control
  • Taxonomies and Ontologies
  • Dataset Curation and Representativity Analysis
Edit on GitLab
  1. Navigation

Overview - Methods

This page gives an overview of methodological approaches in the field of OJA analysis. Each method is linked to its respective section in this guide.

PreviousOverview - ChallengesNextData Collection

Last updated 1 year ago

Data Collection

  • Landscaping

  • Web Scraping

  • API Data Collection

  • Data Formats

Data Enrichment and Methods

  • Text Segmentation

  • Duplicate Identification

  • Natural Language Processing and Data Pre-Processing

  • Rule-based Matching

  • Supervised Document Classification

  • Named-Entity Recognition

  • Disambiguation/ Entity Linking

Evaluation and Quality Control

Taxonomies and Ontologies

Dataset Curation and Representativity Analysis

Gold Standard Annotation

Machine Learning Evaluation

Taxonomy Development

Taxonomy Evaluation

Sampling

Deduplication

Representativity Analysis

Filtering Data
Deduplication
Representativity Analysis
Gold Standard Annotation and Quality
Evaluation
Data Sources and OJA Landscaping
Web Scraping
API Data Collection / Data Providers
Job Posting Data Schema
Pre-Processing and Embeddings
Rule-Based Matching
#supervised-classification
#statistical-named-entity-recognition-token-classification
Semantic Similarity
Developing a Taxonomy
Data Standards for Taxonomies and Ontologies
Text Segmentation
Identifying Duplicates