Dissecting data workflows using SHACL
The DiSHACLed project aims to increase the efficiency of the data intermediaries within the European data ecosystem. DiSHACLed will generate the framework, standards and tooling to replace today’s mostly manual processes to discover and integrate external datasets in a given business or research context with semi-automated algorithms.
The key to establishing semantic interoperability is the use of standardized data models when data are registered. To support the screening of datasets, they can be described by their ‘shape’, i.e. the applied datastructure, through the Shapes Constraints Language (SHACL).
Until now, it was not possible to search for datasets that (partially) adhere to a certain expected minimal shape of data elements and relations. Given the fast growing number of open data sets, this is a major hurdle to find machine supported, suitable datasets candidates that may enrich a given dataset in a certain business domain and allow to activate the data in the business context in an efficient way.
Flanders, through initiatives like OSLO (Open Standards for Linking Organisations), has demonstrated strong leadership in semantic interoperability and data governance, establishing over 134 semantic standards that align with European vocabularies. These efforts have positioned Flanders and its Data Sharing Service Providers (DSSPs) as pioneers in using SHACL to define application profiles. Building on this foundation, they aim to turn their expertise into business value, in line with the goals of the EU’s Data Governance Act.
The DiSHACLed project aims to enhance data discovery, tool interoperability, and automated form generation within the European data ecosystem. Aligned with the DGA, the project strengthens Flemish DSSPs by harnessing SHACL to develop scalable, efficient solutions for data governance. By bringing together industry and research partners, DiSHACLed contributes to the broader European data technology ecosystem, advancing the next generation of data governance practices.
DiSHACLed focuses on three major research goals related to data governance:
DiSHACLed has broad applications across various domains, including:
While the project focuses on technological advancements, it ensures compliance with data governance regulations. By promoting data interoperability and automation, DiSHACLed aims to reduce manual effort, increase data accessibility, and enhance trust in data sharing practices.
“DiSHACLed seeks to not only streamline manual processes but also contribute to the European datatech ecosystem by proposing efficient, scalable solutions with wide-reaching implications for data intermediaries, businesses, and citizens.”
DiSHACLed addresses critical challenges in data discovery, tool interoperability, and form automation, aiming to set a new standard for efficient data workflows.
DiSHACLed is an imec.icon research project funded by imec and Agentschap Innoveren & Ondernemen (VLAIO).
The project started on 01.03.2025 and is set to run until 30.02.2027.