The EU Knowledge Graph
For structured information
about the European Union
Max De Wilde
Data Oriented Services (DORIS)
CNECT.R3
@ Co-innovation iTalks series
11.05.2023
The Wikimedia Foundation hosts many wikis...
Wikibase is the software behind Wikidata
Architecture
Docker image
Comparison with other solutions
A repository to store structured information
about the European Union
Can be edited by humans and by bots
Wikibase hosts Wikidata, one of the largest existing KG which contains billions of triples
Projects funded by the EU
Beneficiaries of EU funds
1. Take any structured data
2. Model the data
- We need entities like buildings, offices...
- We need properties like address, opening hours, occupant...
- Whenever possible, reuse Wikidata entities/properties or other existing ones
3. Keep identifiers
Use external identifiers so that one can use them to link to other resources!
4. Import using Wikibase APIs
We always use Pywikibot
But there are alternatives...
The data imported is understandable, aligned with existing concepts, queryable, and easy to reuse
But DG REGIO moved to another building!
How to stay in line with reality?
What is it?
Similar to WikibaseImport but...
- you can run it locally
- it can sync items and properties
- local changes are not overwritten
WikibaseUpdater
- A bot based on WikibaseSync that checks that the data is synchronised
- Refreshed every 5 minutes
Query service
SPARQL endpoint
Use case: Kohesio
Transparent communication on projects co-funded by the EU
What is Kohesio?
- Cohesion funds are managed together with national and local authorities in the 27 EU member states
- The member states have a legal obligation to publish the list of projects and beneficiaries on their national websites
- The goal of Kohesio is to aggregate this data and make it publicly available in an easy, open way
Data sources
- Dozens of Excel files describing the projects
- Additional vocabularies specific to Cohesion Policy: categories of intervention, thematic objectives, etc.
- Data about geographic entities (NUTS)
- Wikidata
Enriching the data
- Translating project labels and descriptions
- Computing location on the map (geocoding)
- Inferring the NUTS region
- Linking beneficiaries to Wikidata
Building a website on top
Other use cases
- with OP: linking more vocabularies like Eurovoc
- with RTD: closer integration with Horizon projects
- with Eurostat: Local Administrative Units (LAUs)
- with OIB: historical archives about organisations
- with GROW: platform for single market obstacles
More ideas welcome but we need to prioritise! 😊
Acknowledgements
- Dennis Diefenbach @ The QA Company
- Knowledge Management Team @ DG REGIO
- Wikimedia Deutschland (WMDE)