Wikidata and Wikibase
Activity report
Max De Wilde
Data Oriented Services (DORIS)
CNECT.R3
@ CRDM coordination group
30.06.2023
The Wikimedia Foundation hosts many wikis...
Wikibase is the software behind Wikidata
Architecture
Docker image
Comparison with other solutions
A repository to store structured information
about the European Union
Can be edited by humans and by bots
Wikibase hosts Wikidata, one of the largest existing KG which contains billions of triples
Projects funded by the EU
Beneficiaries of EU funds
1. Take any structured data
2. Model the data
- We need entities like buildings, offices...
- We need properties like address, opening hours, occupant...
- Whenever possible, reuse Wikidata entities/properties or other existing ones
3. Keep identifiers
Use external identifiers so that one can use them to link to other resources!
4. Import using Wikibase APIs
We always use Pywikibot
But there are alternatives...
The data imported is understandable, aligned with existing concepts, queryable, and easy to reuse
But DG REGIO moved to another building!
How to stay in line with reality?
What is it?
Similar to WikibaseImport but...
- you can run it locally
- it can sync items and properties
- local changes are not overwritten
WikibaseUpdater
- A bot based on WikibaseSync that checks that the data is synchronised
- Refreshed every 5 minutes
Query service
SPARQL endpoint
Use cases
- with OP: linking more vocabularies like Eurovoc
- with RTD: closer integration with Horizon projects
- with Eurostat: Local Administrative Units (LAUs)
- with OIB: historical archives about organisations
- with GROW: platform for single market obstacles
- with VLOCA (Flanders): open city architecture KB
More ideas welcome but we need to prioritise! 😊
Acknowledgements
- Dennis Diefenbach @ The QA Company
- SEMIC Team @ DIGIT and PwC
- Knowledge Management Team @ DG REGIO
- Wikimedia Deutschland (WMDE)