This component reads RDF files and applies the following OPAL components:
- Catfish (data-cleaner-service)
- Civet (quality-metrics-service)
- Metadata-Refinement (opal-confirm-conversion-service)
- Download the latest release.
- Create a copy of the default.properties file and edit the copy.
At least, setio.input
andio.outputDirectory
. - Finally, run it by
java -jar opal-batch.jar default.properties
.
Data should be in a RDF serialization format and using the Data Catalog Vocabulary (DCAT) vocabulary.
You can find ready-to go input data at the Hobbitdata Server. Data from open data portals is available in the directories OPAL/processed_datasets and OPAL/SourceGraphs.
Data Science Group (DICE) at Paderborn University
This work has been supported by the German Federal Ministry of Transport and Digital Infrastructure (BMVI) in the project Open Data Portal Germany (OPAL) (funding code 19F2028A).