Kohesio and the
EU Knowledge Graph
Anne Thollard | Max De Wilde
Data Oriented Services (DORIS)
CNECT.R3
Presentation for DG ECHO
30.06.2023
The Wikimedia Foundation hosts many wikis...
Wikibase is the software behind Wikidata
Architecture
Docker image
Comparison with other solutions
A repository to store structured information
about the European Union
Can be edited by humans and by bots
Wikibase hosts Wikidata, one of the largest existing KG which contains billions of triples
Projects funded by the EU
Beneficiaries of EU funds
1. Take any structured data
2. Model the data
- We need entities like buildings, offices...
- We need properties like address, opening hours, occupant...
- Whenever possible, reuse Wikidata entities/properties or other existing ones
3. Keep identifiers
Use external identifiers so that one can use them to link to other resources!
4. Import using Wikibase APIs
We always use Pywikibot
But there are alternatives...
The data imported is understandable, aligned with existing concepts, queryable, and easy to reuse
But DG REGIO moved to another building!
How to stay in line with reality?
What is it?
Similar to WikibaseImport but...
- you can run it locally
- it can sync items and properties
- local changes are not overwritten
WikibaseUpdater
- A bot based on WikibaseSync that checks that the data is synchronised
- Refreshed every 5 minutes
Query service
SPARQL endpoint
Use case: Kohesio
Transparent communication on projects co-funded by the EU
What is Kohesio?
- Cohesion funds are managed together with national and local authorities in the 27 EU member states
- The member states have a legal obligation to publish the list of projects and beneficiaries on their national websites
- The goal of Kohesio is to aggregate this data and make it publicly available in an easy, open way
Data sources
- Dozens of Excel files describing the projects
- Additional vocabularies specific to Cohesion Policy: categories of intervention, thematic objectives, etc.
- Data about geographic entities (NUTS)
- Wikidata
Enriching the data
- Translating project labels and descriptions
- Computing location on the map (geocoding)
- Inferring the NUTS region
- Linking beneficiaries to Wikidata
Building a website on top
Mostly open source
Contributing back to Wikibase (e.g. BatchIngestion)
Memorandum of understanding with WMDE
Other use cases
- with DIGIT: linked data solutions in Europe
- with OP: linking vocabularies like Eurovoc
- with RTD: closer integration with Horizon projects
- with Eurostat: Local Administrative Units (LAUs)
- with OIB: historical archives about organisations
More ideas welcome but we need to prioritise! 😊
Acknowledgements
- Dennis Diefenbach @ The QA Company
- Knowledge Management Team @ DG REGIO
- Wikimedia Deutschland (WMDE)
Data Oriented Services (DORIS)