Skip to content

DBpedia Pipeline

Lando edited this page Jul 27, 2017 · 4 revisions

The pipeline describes the import of the DBpedia data and follows the guidelines of the Structured Data Import. All relevant Jobs are provided below and sorted by their execution order. Notice it is assumed that Implisense and Wikidata are already imported.

Normalization

  1. DBpediaImport
  2. DBpediaDataLakeImport
  3. DBpediaRelationParser
  4. DBpediaRelationImport

Duplicate Detection

  1. Deduplication with config file deduplication_dbpedia.xml

Data Merge

  1. Merging
  2. MasterConnecting

Next step Kompass

Clone this wiki locally