Noticia

Gaia-X represents an innovative paradigm for linking data more closely to the technological infrastructure underneath, so as to ensure the transparency, origin and functioning of these resources. This model allows us to deploy a sovereign and transparent data economy, which respects European fundamental rights, and which in Spain will take shape around the sectoral data spaces (C12.I1 and C14.I2 of the Recovery, Transformation and Resilience Plan). These data spaces will be aligned with the European regulatory framework, as well as with governance and instruments designed to ensure interoperability, and on which to articulate the sought-after single data market.

In this sense, Gaia-X interoperability nodes, or Gaia-X Digital Clearing House (GXDCH), aim to offer automatic validation services of interoperability rules to developers and participants of data spaces. The creation of such nodes was announced at the Gaia-X Summit 2022 in Paris last November. The Gaia-X architecture, promoted by the Gaia-X European Association for Data & Cloud AISBL, has established itself as a promising technological alternative for the creation of open and transparent ecosystems of data sets and services.

These ecosystems, federated by nature, will serve to develop the data economy at scale. But in order to do so, a set of minimum rules must be complied with to ensure interoperability between participants. Compliance with these rules is precisely the function of the GXDCH, serving as an "anchor" to deploy certified market services. Therefore, the creation of such a node in Spain is a crucial element for the deployment of federated data spaces at national level, which will stimulate development and innovation around data in an environment of respect for data sovereignty, privacy, transparency and fair competition.

The GXDCH is defined as a node where operational services of an ecosystem compliant with the Gaia-X interoperability rules are provided. Operational services" should be understood as services that are necessary for the operation of a data space, but are not in themselves data sharing services, data exploitation applications or cloud infrastructures. Gaia-X defines six operational services, of which at least two must be part of the mandatory nodes hosting the GXDCHs:

Mandatory services

  • Gaia-X Registry: Defined as an immutable, non-repudiable, distributed database with code execution capabilities. Typically it would be a blockchain infrastructure supporting a decentralised identity service ('Self Sovereign Identity') in which, among others, the list of Trust Anchors or other data necessary for the operation of identity management in Gaia-X is stored.
  • Gaia-X Compliance Service or Gaia-X Compliance Service: Belongs to the so-called Gaia-X Federation Services and its function is to verify compliance with the minimum interoperability rules defined by the Gaia-X Association (e.g. the Trust Framework).

Optional services

  • Self-Descriptions (SDs) or Wizard Edition Service: SDs are verifiable credentials according to the standard defined by the W3C by means of which both the participants of a Gaia-X ecosystem and the products made available by the providers describe themselves. The aforementioned compliance service consists of validating that the SDs comply with the interoperability standards. The Wizard is a convenience service for the creation of Self-Descriptions according to pre-defined schemas.
  • Catalogue: Storage service of the service offer available in the ecosystem for consultation.
  • e-Wallet: For the management of verifiable credentials (SDs) by participants in a system based on distributed identities.
  • Notary Service: Service for issuing verifiable credentials signed by accreditation authorities (Trust Anchors).

What is the Gaia-X Compliance Service (i.e. Compliance Service)?

The Gaia-X Compliance Service belongs to the so-called Gaia-X Federation Services and its function is to verify compliance with the minimum interoperability rules defined by the Gaia-X Association. Gaia-X calls these minimum interoperability rules (Trust Framework). It should be noted that the establishment of the Trust Framework is one of the differentiating contributions of the Gaia-X technology framework compared to other solutions on the market. But the objective is not just to establish interoperability standards, but to create a service that is operable and, as far as possible, automated, that validates compliance with the Trust Framework. This service is the Gaia-X Compliance Service.

The key element of these rules are the so-called "Self-Descriptions" (SDs). SDs are verifiable credentials according to the standard defined by the W3C by which both the participants of a data space and the products made available by the providers describe themselves. The Gaia-X Compliance service validates compliance with the Trust Framework by checking the SDs from the following points of view:

  • Format and syntax of the SDs
  • Validation of the SDs schemas (vocabulary and ontology)
  • Validation of the cryptography of the signatures of the issuers of the SDs
  • Attribute consistency
  • Attribute value veracity.

Once the Self-Descriptions have been validated, the compliance service operator issues a verifiable credential that attests to compliance with interoperability standards, providing confidence to ecosystem participants. Gaia-X AISBL provides the necessary code to implement the Compliance Service and authorises the provision of the service to trusted entities, but does not directly operate the service and therefore requires the existence of partners to carry out this task.

 

 

calendar icon
Blog

The Spanish Hub of Gaia-X (Gaia-X Hub Spain), a non-profit association whose aim is to accelerate Europe's capacity in data sharing and digital sovereignty, seeks to create a community around data for different sectors of the economy, thus promoting an environment conducive to the creation of sectoral data spaces. Framed within the Spain Digital 2026 strategy and with the Recovery, Transformation and Resilience Plan as a roadmap for Spain's digital transformation, the objective of the hub is to promote the development of innovative solutions based on data and artificial intelligence, while contributing to boosting the competitiveness of our country's companies.

The hub is organized into different working groups, with a specific one dedicated to analyzing the challenges and opportunities of data sharing and exploitation spaces in the tourism sector. Tourism is one of the key productive sectors in the Spanish economy, reaching a volume of 12.2% of the national GDP.

Tourism, given its ecosystem of public and private participants of different sizes and levels of technological maturity, constitutes an optimal environment to contrast the benefits of these federated data ecosystems. Thanks to them, the extraction of value from non-traditional data sources is facilitated, with high scalability, and ensuring robust conditions of security, privacy, and thus data sovereignty.

Thus, with the aim of producing the first X-ray of this dataspace in Spain, the Data Office, in collaboration with the Spanish Hub of Gaia-X, has developed the report 'X-ray of the Tourism Dataspace in Spain', a document that seeks to summarize and highlight the current status of the design of this dataspace, the different opportunities for the sector, and the main challenges that must be overcome to achieve its deployment, offering a roadmap for its construction and deployment.

Why is a tourism data space necessary?

If something became clear after the outbreak of the COVID-19 pandemic, it is that tourism is an interdependent activity with other industries, so when it was paused, sectors such as mobility, logistics, health, agriculture, automotive, or food, among others, were also affected.

Situations like the one mentioned above highlight the possibilities offered by data sharing between sectors, as they can help improve decision-making. However, achieving this in the tourism sector is not an easy task since deploying a data space for this sector requires coordinated efforts among the different parts of society involved.

Thus, the objective and challenge is to create intelligent "spaces" capable of providing a context of security and trust that promotes the exchange and combination of data. In this way, and based on the added value generated by data, it would be possible to solve some of the existing problems in the sector and create new strategies focused on better understanding the tourist and, therefore, improving their travel experience.

The creation of these data sharing and exploitation spaces will bring significant benefits to the sector, as it will facilitate the creation of more personalized offers, products, and services that provide an enhanced and tailored experience to meet the needs of customers, thus improving the capacity to attract tourists. In addition, it will promote a better understanding of the sector and informed decision-making by both public and private organizations, which can more easily detect new business opportunities.

Challenges of security and data governance to take advantage of digital tourism market opportunities

One of the main obstacles to developing a sectoral data space is the lack of trust in data sharing, the absence of shared data models, or the insufficient interoperability standards for efficient data exchange between different existing platforms and actors in the value chain.

Moving to more specific challenges, the tourism sector also faces the need to combine B2B data spaces (sharing between private companies and organizations) with C2B and G2B spaces (sharing between users and companies, and between the public sector and companies, respectively). If we add to this the ideal need to land the tourism sector's datasets at the national, regional, and local levels, the challenge becomes even greater.

To design a sector data space, it is also important to take into account the differences in data quality among the aforementioned actors. Due to the lack of specific standards, there are differences in the level of granularity and quality of data, semantics, as well as disparity in formats and licenses, resulting in a disconnected data landscape.

Furthermore, it is essential to understand the demands of the different actors in the industry, which can only be achieved by listening and taking notes on the needs present at the different levels of the industry. Therefore, it is important to remember that tourism is a social activity whose focus should not be solely on the destination. The success of a tourism data space will also rely on the ability to better understand the customer and, consequently, offer services tailored to their demands to improve their experience and incentivize them to continue traveling.

Thus, as stated in the report prepared by the Data Office, in collaboration with the Spanish hub of Gaia-X, it is interesting to redirect the focus and shift it from the destination to the tourist, in line with the discovery and generation of use cases by SEGITTUR. While it is true that focusing on the destination has helped develop digital platforms that have driven competitiveness, efficiency, and tourism strategy, a strategy that pays the same attention to the tourist would allow for expanding and improving the available data catalogs.

Measuring the factors that condition tourists' experience during their visit to our country allows for optimizing their satisfaction throughout the entire travel circuit, while also contributing to creating increasingly personalized marketing campaigns, based on the analysis of the interests of different market segments.

Current status of the construction of the Spanish Tourism data space and next steps

The lack of maturity of the market in the creation of data spaces as a solution makes an experimental approach necessary, both for the consolidation of the technological components and for the validation of the different facets (soft infrastructure) present in the data spaces.

Currently, the Tourism Working Group of the Spanish Gaia-X Hub is working on the definition of the key elements of the tourism data space, based on use cases aligned with the sector's challenges. The objective is to answer some key questions, using existing knowledge in the field of data spaces:

  • What are the key characteristics of the tourism environment and what business problems can be addressed?
  • What data-oriented models can be worked on in different use cases?
  • What requirements exist and what governance model is necessary? What types of participants should be considered?
  • What business, legal, operational, functional, and technological components are necessary?
  • What reference technology architecture can be used?
  • What development, integration, testing, and technology deployment processes can be employed?

 

 

calendar icon
Blog

The promotion of the so-called data economy is one of the main priorities on which the European Union, in general, and Spain, in particular, are currently working. Having a single digital market for data exchange is one of the keys to achieving this momentum among the Member States, and data spaces come into play for this purpose.

Data Spaces Business Alliance (DSBA)

A data space is an ecosystem capable of realising the voluntary sharing of data among its participants, while respecting their sovereignty over it, i.e. being able to set the conditions for its access and use. The DSBA, founded in 2021, is composed of the main actors in the definition of standards, models and technological frameworks for the construction and operation of data spaces. Specifically, the alliance is composed of the Big Data Value Association (BDVA), the FIWARE Foundation and the Gaia-X European Association for Data and Cloud AISBL and the International Data Spaces Association (IDSA). The purpose of the alliance is to agree on a common technological framework that avoids technological fragmentation of the activity, as well as harmonisation in messaging and dissemination activities.

Technical Convergence Discussion Document

On the technological side, the DSBA published in September 2022 a first approximation of the desired technological convergence.

The document technically analyses the anchor points for creating trusted data spaces, federated catalogues and shared markets, and the ability to define data use policies (based on the use of a common language). This ability to share while respecting the sovereignty of the data owner is what makes these spaces novel and truly disruptive, offering for the first time technical elements with which to control the risks associated with information sharing.

The document explains, step by step and from a purely technical perspective, the actions to be addressed by each of the identified roles, with the purpose of guiding potential scenarios that could occur in reality.

To achieve technical convergence, the partnership agreed on the development of a minimum viable framework (MVF) based on three pillars:

  • Interoperability in data exchange through the use of the standard NGSI-LD data exchange protocol/API and the extended Smart Data Models for the adoption of the information model defined by the IDS architecture.
  • Sovereignty and trust in data exchange through the adoption of a decentralised model (Self-Sovereign Identity) as proposed by Gaia-X (with its Trust Framework) that would use the DLT (Distributed Ledger Technologies) promoted by the European Commission (EBSI). The result will be a trust environment compatible with the EU's eIDAS 2.0 regulation.
  • Value creation (brokering) services consisting of a decentralised data catalogue and trading services based on TM Forum standards.

The alliance believes that this MVF would be a good starting point from which to work towards the desired technological convergence, counting and reusing parts of the current solutions provided by the different suppliers.

Example of a public data marketplace

The paper gives the example of a data service provider offering its service in a public data marketplace, so that consumers can easily access this offer. In addition, providers can also delegate access to their users to modify attributes of the service they contract.

This is an example that can be seen in detail in the document, which is interesting because different authentication systems, security and access policies and, in short, different systems that must interoperate with each other come into play.

Additionally, an example of integration between the Data Marketplace and a data catalogue is presented using the approach followed by the European Horizon 2020 project 'Digital Open Marketplace Ecosystem' (DOME). In this way, offers are created in the shared catalogue and can be subsequently consulted following the defined access policies.

The future of the DSBA

The DSBA considers that the aforementioned MVF is only the first step towards the convergence of the different existing architectures and technologies in the construction of data spaces. The next steps of the alliance will take into account the roles assigned to each of the participants. More specifically:

  • IDSA: Develops data space architectures and standards. In particular, a model for connectors to ensure sovereign data sharing in a scalable way.

  • Gaia-X: Develops and deploys an architecture, a governance model according to business specifications for sectoral data spaces, as well as a toolkit (Gaia-X Federation Services toolkit) to instantiate interoperability, composability and transparency of infrastructure and cloud data services.

  • FIWARE: With a technology stack that comes from the world of Digital Twins, the community develops software components that allow to implement the construction of data spaces.

The DSBA has also set itself the following priority objectives:

  • The compatibility of the IDS architecture with an identity management mechanism based on decentralised identifiers.
  • The integration of a federated catalogue such as the one proposed with the metadata broker proposed in the IDS architecture.
  • The definition of a common vocabulary.
  • Advancing jointly with the work of the Data Space Support Center (a programme funded by the European Commission, where these associations play a leading role), as well as with the standardisation efforts based on the EC's Smart Middleware Platform (SIMPL) project.

Content prepared by Juan Mañes, expert in Data Governance.

The contents and views expressed in this publication are the sole responsibility of the author.

calendar icon
Noticia

On 20 October, Madrid hosted a new edition of the Data Management Summit Spain, an international summit that this year also took place in Italy (7 July) and Latam (20 September). The event brought together CiOs, CTOs, CDOs, Business Intelligence Officers and Data Scientists in charge of implementing emerging technologies in order to solve new technological challenges aligned with new business opportunities.

This event was preceded by a prologue held the previous evening, in collaboration with DAMA Spain and the Data Office. This was a session aimed exclusively at representatives of different levels of public administration and focused on open data and information sharing between administrations. During the day, participants discussed the transformative role of data and how its intensive use and enhancement are essential to achieve the digital transformation of public administrations.

As was mentioned in the session, data plays an essential role in the development of disruptive technologies such as Artificial Intelligence, and is a differential factor when it comes to launching an industrial and technological revolution that allows for the consolidation of a fairer, more inclusive digital economy in line with the SDGs and the 2030 Agenda. A true data economy with the vocation to nurture the development of two key and strategic processes for the reconstruction of our country: the digital transformation and the ecological transition.

Data spaces and open data, key to achieving data-driven government

The institutional opening was given by Carlos Alonso, Director of the Data Office Division. His speech focused on highlighting how the achievement of a data-oriented administration, an inseparable part of its digital transformation process, depends on the development of public sector data spaces, which enable data sharing with sovereignty and its large-scale exploitation. Data is a public good, to be preserved and processed in order to implement quality public services and policies. The aim is to achieve a data-driven, citizen-centred, open, transparent, inclusive, participatory and egalitarian administration, ensuring ethical, secure and responsible use of data.

In this process of designing public and private sector data spaces, open data is fundamental, as Carlos Alonso highlighted during his speech: "Data spaces are major consumers and generators of open data, and their availability must be ensured. That is, it is necessary to establish certain service level agreements to ensure access".

Sharing experiences between administrations

After this institutional opening, the conference addressed the opportunity provided by the creation of spaces for sharing and exploiting data in public administrations, and allowed for the dissemination of different data-related projects by representatives of the different administrative levels, including autonomous communities and local entities.

Andreu Francisco, Director General of the Localret consortium, formed by the local administrations of Catalonia for the development of telecommunications services and networks and the application of ICT, presented a digital metamodel, which aims to structure the technological architecture and services required in a digital city. It is a comprehensive solution that can be implemented in different territories and personalised according to the singularities of each city, making it easier for the inhabitants of the 877 Catalan municipalities to develop professionally and personally.

César Priol, Director General of Digitalisation and Citizen Services, of Bizkaiko Foru Aldundia (Basque Country) shared his experience in the creation of the Data Office, highlighting the importance of self-organising on an organisational, regulatory and legal level in order to have the capacity to transform not only the organisation, but also the territory with data.

Magda Lorente, Head of the Local Information Systems Assistance Section, and Sara Aguilar, Head of the Service of the Official Gazette of the Province of Barcelona and other Official Publications of the Barcelona Provincial Council, spoke of good practices in data management. Magda Lorente highlighted the importance of the Diputacions becoming data-oriented in order to assume their relevant role in the promotion of municipal data governance. According to a study carried out by the Diputació de Barcelona, which will be published at the end of November, 85% of municipalities could be left out of artificial intelligence and intelligent administration because their technical capacities do not allow them to materialise the necessary data orientation.

Sara Aguilar, for her part, presented an example of how the administrations are consolidating the way in decision-making based on quality data: the CIDO, a search engine for official information and documentation. This tool was created in 2000 with the aim of bringing government information closer to citizens in a user-friendly way. It provides access, for example, to more than 2,600 selection processes with open calls for applications and 1,600 open subsidies, thanks to the open data offered by the different municipalities of Catalonia. CIDO is based on a tag reader model and the use of artificial intelligence algorithms, which classify the information collected from the municipalities. They have more than 2 million ads, structured and documented open data that they serve through an API that can be integrated into any platform.

Roundtables and group dynamics to promote debate

During the course of the day, attendees were able to participate in different dynamics for the exchange of experiences. The first dynamic focused on open data and the second on interoperability.

In addition, two round tables were held, which allowed the subject to be approached from different points of view:

  • The first round table, moderated by Carlos Alonso, focused on the challenges and barriers to data exchange in the public sector. Current methodologies, specifications and practices related to the processing of information, in order to achieve a fluid and continuous exchange of data between administrations, industrial sectors and citizens, were projected on a larger scale. The round table was attended by: Carlos Alonso, Jose Antonio Eusamio (General Secretariat for Digital Government), César Priol (Vizcaya Provincial Council), Miguel Angel Martinez Vidal (INE) and Magda Lorente.
  • The second round table focused on how to accelerate the adoption of Open Linked Data in the public administration domain, moderated in this case by Oscar Alonso (IBM Consulting & DAMA Spain). Participants included Sara Aguilar (Barcelona Provincial Council), Oscar Alonso (DAMA Spain & IBM), Sonia Castro (datos.gob.es), Juan José Alonso (Orange) and Olga Quiros (ASEDIE). The conversation revolved around EU initiatives, such as the Data Governance Act, which are acting as a turning point in data policies. The act seeks to establish robust mechanisms to facilitate the re-use of certain categories of protected public sector data, increase trust in data brokering services and promote data altruism across the EU. This highlights how the EU is working to strengthen various data sharing mechanisms to promote the availability of data that can be used to drive advanced applications and solutions in artificial intelligence, personalised medicine, green mobility, smart manufacturing and many other areas. The importance of data ethics was also highlighted during the debate.

Materials available on the day

If you missed the session, the video is available on Youtube. The recording of the summit on the 20th has also been made public, a session that had a more business-oriented approach, with expert presentations and group dynamics focused on data governance, data quality, master data and data architecture, among other topics. Photos from the event are also available.

In addition, interviews with some of the speakers have been published on the summit's website, allowing a deeper insight into the projects they are carrying out.

calendar icon
Noticia

On 25 October, the workshop "Use cases of the Spanish Tourism data space" was held, with the aim of starting the design and deployment of the first interoperable Tourism dataspace at European level, built under the Gaia-X seal.

This activity, organised by the Data Office in collaboration with the Secretary of State for Tourism, through SEGITTUR, is part of the public-private collaboration model for the constitution of sectoral data spaces promoted by the Secretary of State for Digitalisation and Artificial Intelligence of the Ministry of Economic Affairs and Digital Transformation. The event is a continuation of the process of revitalisation of national data spaces that began with the event "Governmental impulse to a data-driven reindustrialisation - Spanish Tourism Data Space", and is part of the actions carried out by the Government of Spain to create the legal, political, technological and funding environment conducive to the deployment of the data economy, as detailed in the Digital Spain 2026 strategy. This action is aligned with the European scene.

Why a tourism data space?

Data is a vital element in the development and sustainability of the tourism sector. Enabling its proper valorisation reinforces the already significant contribution of the sector to the national economy. The place to realise this latent value is the data space.

According to their European meaning, data spaces are ecosystems where different actors share data voluntarily and securely. In this way, the sharing of data between participants is realised while respecting the principles of self-determination in data sharing (sovereignty), privacy, transparency, security and fair competition.

Tourism, given its ecosystem of public and private participants of different sizes and technological maturity, is an optimal environment in which to test the benefits of these federated data ecosystems. They facilitate the extraction of value from non-traditional data sources, with high scalability, and guaranteeing in any case robust conditions of security, privacy and therefore data sovereignty.

The high availability of tourism data brings great benefits to the sector, as it facilitates the creation of more personalised offers, products and services that provide an improved experience tailored to the needs of customers, thus improving the ability to attract tourists. It also fosters greater knowledge of the sector and informed decision-making by both public and private organisations, which can more easily identify new business opportunities.

An eminently practical workshop

During the workshop, the more than one hundred participants, from both the tourism and technology sectors, tackled the detailed characterisation of different data sharing use cases that could satisfy specific business needs in the tourism sector. The participants were organised around different working groups considering the maturity of the use cases proposed, seeking to encourage the variety of profiles and the grouping of interests. The different work dynamics were carried out in accordance with the methodology for the discovery and design of use cases developed by the Data Office, allowing the validation of their content and their scalability to the rest of the sectoral data spaces to be created.

During the final conclusions of the conference, the Data Office emphasised the central role of data spaces as a place to create new business opportunities within the tourism sector. It was also highlighted that the synergies derived from data sharing go beyond the sum of individual data, and investments, duly supported by public-private collaboration, must be seen beyond the short term and need to be backed by management. The problem to be solved is a business problem and the solution is not exclusively technological. Simple solutions must be sought that are easy to adopt and use, with robust, agile and simple deployment that do not generate barriers to entry for small participants. Taking advantage of the current technological, economic, political and social situation, it is possible to develop a tourism sector that is data-oriented, sustainable, generates social value and is inclusive.

In this link you can see the presentation of the workshop, which includes, among others, methodological aspects for the creation of data spaces, such as the characteristics or the elements to be considered.Below you can also access several infographics:

  • Tourism Data Spaces and the Gaia-X Model
  • Use case sheet for data sharing
  •  Identification and proposal of use cases in tourism
  • Elements involved in the data sharing process
calendar icon
Blog

Many organisations and administrations have found in open data a transformational pillar on which to exercise the strategy towards the data culture. Having access to data in a structured way is the basis for new business models, as well as for new initiatives aimed at citizens in different fields of action.

However, realising the full potential of open data requires a platform capable of making this data available to third parties while ensuring its quality, understanding, privacy and security.

In this context, the book Designing Data Spaces”, includes a chapter by Fabian Kirstein and Vincent Bohlen, which proposes the use of the IDS-RAM architecture proposed by the International Data Spaces (IDS) for the development of open data ecosystems. It provides a proof of concept on the feasibility of the IDS architecture for public data spaces with the aim of achieving a solid foundation for building and maintaining interoperable open data ecosystems capable of addressing existing challenges.

The following is a summary of the views gathered in the chapter.

Open data ecosystems

Data spaces are ecosystems where different actors share data voluntarily and securely, following common governance, organisational, regulatory and technical mechanisms.

IDSA (International Data Spaces Association) was created in 2016, with the aim of boosting the global digital economy, through a secure and sovereign system of data exchange in which all participants can obtain the maximum value from their data. It is a coalition of more than 130 international companies with representation in more than 20 countries around the world.

Among other initiatives, it promotes an architectural reference model called IDS-RAM, which aims to facilitate the exchange of data to optimise its value, but without losing control. It offers several approaches whose applicability can be understood in the context of both private and open data, as it is based on metadata repositories for sharing information. That is, the data remain under the control of its owners and the standardised metadata are centrally managed for sharing.

The creation of data spaces brings with it a number of risks to be addressed, both from the consumer's and the provider's point of view. Data providers focus on legal compliance, through issues such as data ownership. Although common standards exist for aspects such as metadata description - the World Wide Web Consortium is no stranger to the problem and therefore proposed several years ago its Data Catalog Vocabulary (DCAT), a standard for describing data catalogues - the truth is that interoperability is sometimes far from its greatest potential. This is because there is sometimes incomplete metadata, the quality is poor, data is outdated, there are difficulties in accessing data and interoperating, and so on.

The applicability of IDS-RAM in open data environments

IDS offers an approach based on guaranteeing data sovereignty to providers, facilitating data exchange and addressing the concerns of both consumers and providers.

The concepts and technologies underlying open data and IDS-RAM are very similar. Both initiatives rely on metadata repositories to share information about the availability and accessibility of data. These repositories store metadata, without the need to transfer the actual data. Therefore, both concepts follow the principles of decentralisation and transfer of metadata to and from central information access points. The actual data remains under the control of the data publisher's infrastructure until a user requests it. In addition, the IDS information model is based on the principles of Linked Data and DCAT. This makes it a system that is easily compatible with open data portals, driving interoperability between data spaces and open data portals.

The architecture proposed by IDS is mainly based on two artefacts, a connector to data sources (Open Data Connector) and a metadata store (Open Data Broker), as shown in the following image extracted from the book "Designing Data Spaces":

Figure illustrating the IDS-RAM architecture, explained below.

  • Open Data Connector: adopts the role of open data provider. Each publishing entity applies an instance of the connector to announce availability and grant access to the data. As it is open data, and therefore public, it is not necessary to apply usage policies or restrictions as strict as in the case of other private data connectors based on this architecture, which allows for easier configuration and management a priori.
  • Open Data Broker: the centralised metadata repository fulfils a similar function to that of an open data portal. From this metadata, the portal interface offers functionalities to locate and download the data from the connectors.

This management allows grouping by different application domains, i.e. centralised metadata repositories can be created for sectors such as health or tourism, as well as at municipal, regional, national or international level.

In a data ecosystem such as the one proposed by IDS, the connector informs about available or updated data, and in the metadata repository these are updated accordingly. For this purpose, communication mechanisms based on the IDS information model and the IDS Communication Protocol (IDS Communication Protocol or IDSCP) are used to announce possible changes in data availability. In this way, the availability of up-to-date data is guaranteed.

In open data portals that collect a large number of data sources, accessibility and overall usability depend on the metadata provided by the original data providers. Standards such as DCAT provide a common basis, but IDS offers more stringent specifications in the communication process.

Although it is an interesting proposal, in the context of open data, this approach has not yet been implemented in any space. However, proofs of concept have already been made, as can be seen in the Public Data Space, a showcase available since December 2020 that reproduces how the solution works. Here, connectors display the open data offerings of different data portals in Germany and are registered in a metadata repository.

The following image shows the workflow of an IDS-RAM-based model versus a more traditional approach:

Picture showing graphically the difference between the traditional data flow and the IDS proposal, explained below.

Conclusions

Open data portals provide access to open data from a variety of providers. The overall usability of these portals depends to some extent on the discoverability of the data, which in turn depends on the quality of their metadata.

To counteract the problems of unavailable data or dead links that sometimes occur in open data environments, portals periodically collect the publisher's data catalogues and perform availability checks. In the IDS-RAM-based open data ecosystem, the connector informs the broker about available or updated datasets. The 'pull' accountability approach that is common in the typical open data environments is reversed to a 'push' approach in the IDS ecosystem. This approach focuses on the responsibility of the publisher to maintain the data supply and also presents new possibilities to control its dissemination. Using IDS-RAM, the publisher chooses which metadata broker it signs up to, giving it greater sovereignty over their data.

For data consumers, this approach can lead to improvements in terms of the ability to find data in a timely manner and reduces fragmentation. Moreover, if open data can be acquired, managed and processed with the same tools and applications that are already applied in industry, the possibilities for integration and reuse multiply.


Content prepared by Juan Mañes, expert in Data Governance, with contributions from the Data Office.

The contents and views expressed in this publication are the sole responsibility of the author.

 

calendar icon
Blog

For a data space to function properly, it is necessary to have sufficient actors to cover a set of roles and a set of technological components. These elements enable a common governance framework to be established for secure data sharing, ensuring the sovereignty of the participants over their own data. This concept, data sovereignty, can be defined as the ability of the data owner to set the policies for the use and access of the data to be exchanged, and is the core element of a data space.

In this sense, the EU-funded report "Design Principles for Data Spaces" (April 2021) provides the fundamentals that data spaces should follow in order to act in accordance with EU values: decentralisation, openness, transparency, sovereignty and interoperability. The report was prepared by experts from 25 different companies, thus giving a consensus view with industry.

The following is a summary of some of the main contributions of the document, taking as a reference the article "Elements of a data space" published in the Boletic magazine of the Professional Association of Information Systems and Technologies. In this article, the elements of a data space are divided into two categories:

  • Roles and domains
  • Fundamental components

Roles

In a data space we can find different participants, each of them focused on a specific field of action. This is known as roles:

Visual showing the roles involved in a data space, defined below.

  1. Data providers and data consumers

These are the participants who provide and interact with the data. Within this category there are several roles:

  • Data Producer: Generates the data.
  • Data Owner: The holder of the rights to access and use the data.
  • Data Acquirer or Data Provider: Captures the data and offers it through the data space catalogue.
  • Data Consumer: Accesses the data from the catalogue.
  • Application Provider: Provides applications that allow working with the data offering added value (e.g. machine learning models, visualisations, cleansing processes, etc.).Intermediaries
  1. Intermediaries

In this case we are talking about third parties that offer the services necessary for publishing, searching for resources and registering transactions. Some examples of services offered by intermediaries are:

  • Vocabularies and ontologies, which allow information to be systematically organised, categorised or labelled, improving interoperability.
  • Application stores, which list the tools offered by application providers, ensuring that they have passed a quality control process.
  • Metadata broker services for the publication of a catalogue of resource offerings (data and applications) with as much information as possible.
  • Orchestration services, allowing the automation of various activities.
  • Clearing house, which allow to keep control of the operations carried out.
  1. Technology providers

They provide components for the data space to operate correctly, making it a secure and trusted environment. Examples of these components are the connector - a fundamental element that we will see below -, user management systems or monitoring systems.

Brokerage services and applications do not fall into this category.

  1. Data space operators

Focused on the management of the space, they carry out tasks such as the processing of requests or incidents, change control, software maintenance, etc. Among other things, they certify participants, exercise the governance of the data space and define the roadmap of functionalities.

All these roles are not exclusive, and the same user can adopt several roles.

Components of a data space

There are different approaches to the components that a data space should have. One can refer to Gaia-X and/or take as a reference the IDS-RAM architecture model (Reference Architecture model), characterised by an open, reliable and federated architecture for cross-sector data exchange.

In any case, at least the following blocks are necessary for  carrying out the activity in a secure and controlled manner:

  1. Components for accessing the data space: Connector

One of the main elements of data spaces is the connector, through which participants access the data space and the data. It is responsible for handling the data according to the usage policies defined by the owner of the access and usage rights, guaranteeing its sovereignty. To prevent malicious manipulation, connectors can be signed by a certificate provided by the data space governance, so as to guarantee their integrity and compliance with the usage rights established by the data owner.

  1. Components for intermediation

They allow for the intermediation services mentioned above, the metadata broker, the app store, etc. Of all of them, the most fundamental is the resource catalogue. In addition to a list of the available offer, it is also the tool that allows the resource provider, its characteristics and conditions of use to be located.

  1. Components for identity management and secure data exchange

These components ensure the identity of participants and the security of transactions. For this reason, participants are often required to present credentials (e.g. via X.509 certificates).

  1. Data space management components

These are tools that allow the data space to operate normally, facilitating daily operations, management of participants (registration, deregistration, revocation, suspension), monitoring of the activity, etc.

How do all these elements interact?

All these roles and components interact with each other. First, the data provider registers its data offering in the catalogue, including relevant metadata such as usage policies. The data consumer searches the catalogue for datasets and applications of interest. Once located, he/she contacts the provider, communicating which resources he wants to acquire. In this process, further negotiation of terms and conditions may take place. Once an agreement is reached, the consumer can download the data.

The transaction must be registered by both the supplier and the consumer.

Graph showing the scenario of data exchange in a data space, described above.

All these elements (roles, components and processes) allow data sharing in a secure and controlled manner, in a managed environment of trust. The aim is to enable European companies and organisations to exchange information, generating a European data market that will give rise to new products and services of value, boosting the European economy.


Content prepared by the datos.gob.es team.

calendar icon
Noticia

The Data Spaces Business Alliance (DSBA) was born in September 2021, a collaboration of four major organisations with much to contribute to the data economy: the Big Data Value Association (BDVA), FIWARE, Gaia-X and the International Data Spaces Association (IDSA). Its goal: to drive the adoption of data spaces across Europe by leveraging synergies.

How does the DSBA work?

The DSBA brings together diverse actors to realise a data-driven future, where public and private organisations can share data and thus unlock its full value, ensuring sovereignty, interoperability, security and reliability. To achieve this goal, DSBA offers support to organisations, as well as tools, resources and expertise. For example, it is working on the development of a common framework of technology agnostic blocks that are reusable across different domains to ensure the interoperability of different data spaces.

The four founding organisations, BDVA, FIWARE, Gaia-X and IDSA, have a number of international networks of national or regional hubs, with more than 90 initiatives in 34 countries. These initiatives, although very heterogeneous in focus, legal form, level of maturity, etc., have commonalities and great potential to collaborate, complement each other and create impact. Moreover, by operating at local, regional and/or national level, these initiatives provide regular feedback to European associations on the different regional policies, cultures and entrepreneurial ecosystems within the EU.

In addition, DSBA's application has been successful in the European Commission's call for the creation of a Support Centre, which will promote and coordinate actions related to sectoral data spaces. This centre will make available technologies, processes, standards and tools to support the deployment of common data spaces, thus enabling the re-use of data across sectors.

The DSBA hubs

The DSBA hubs refer to the global network combining the existing BDVA, FIWARE, Gaia-X and IDSA initiatives, as shown in the figure below.

Map showing the different organisations that are part of the DSBA

The main characteristics of each of these groups are as follows:

BDVA i-Spaces

BDVA i-Spaces are cross-sector and cross-organisational data incubators and innovation hubs, aimed at accelerating data-driven innovation and artificial intelligence in the public and private sectors. They provide secure experimentation environments, bringing together all the technical and non-technical aspects necessary for organisations, especially SMEs, to rapidly test, pilot and exploit their services, products and applications.

i-Spaces offer access to data sources, data management tools and artificial intelligence technologies, among others. They host closed and open data from corporate and public sources, such as language resources, geospatial data, health data, economic statistics, transport data, weather data, etc. The i-spaces have their own Big Data infrastructure with ad hoc processing power, online storage and state-of-the-art accelerators, all within European borders.

To become an i-Space, organisations must go through an assessment process, using a system of 5 categories, which are ranked according to gold, silver and bronze levels.  These hubs must renew their labels every two years, and these certifications allow them to join a pan-European federation to foster cross-border data innovation, through the EUHubs4Data project.

FIWARE iHubs

FIWARE is an open software community promoted by the ICT industry, which - with the support of the European Commission - provides tools and an innovation ecosystem for entrepreneurs to create new Smart applications and services. FIWARE iHubs are innovation hubs focused on creating communities and collaborative environments that drive the advancement of digital businesses in this area. These centres provide private companies, public administrations, academic institutions and developers with access to knowledge and a worldwide network of suppliers and integrators of this technology, which has also been endorsed by international standardisation bodies.

There are 5 types of iHubs:

  • iHub School: An environment focused on learning FIWARE, from a business and technical perspective, taking advantage of practical use cases.
  • iHub Lab: Laboratory where you can run tests and pilots, as well as obtain FIWARE certifications.
  • iHub Business Mentor: Space to learn how to build a viable business model.
  • iHub Community Creator: Physical meeting point for the local community to bring together all stakeholders, acting as a gateway to the local and global FIWARE ecosystem.

Gaia-X Hubs

The Gaia-X Hubs are the national contact points for the Gaia-X initiative. It should be noted that they are not as such part of Gaia-X AISBL (the European non-profit association), but act as independent think tanks, which cooperate with the association in project deployment, communication tasks, and generation of business requirements for the definition of the architecture of the initiative (as the hubs are close to the industrial projects in each country).

Through them, specific data spaces are developed based on national needs, as well as the identification of funding opportunities to implement Gaia-X services and technology. They also seek to interact with other regions to build transnational data spaces, facilitating the exchange of information and the scaling up of national use cases internationally. To this end, the AISBL provides access to a collaborative platform, as well as support to the respective hubs in the distribution and communication of the use cases.

IDSA Hubs

The IDSA Hubs enable the exchange of knowledge around the reference architecture (known as the IDS-RAM) at country level. By bringing together research organisations, innovation promotion organisations, non-profit organisations, and companies that use IDS concepts and standards in the region, they seek to foster their adoption, and thus promote a sovereign data economy with greater capillarity.

These centres are driven in each country by a university, research organisation, or non-profit entity, working with IDSA to raise awareness of data sovereignty, transfer knowledge, recruit new members, and disseminate IDS-RAM-based use cases. To this end, they develop activities ranging from training sessions to meetings with decision-makers from different public administrations. They also promote and coordinate research and development projects with international organisations and companies, as well as with governments and other public entities.

Conclusion

As we said at the beginning, there is a great potential for synergies between these groups, which should be explored, discussed and articulated in concrete actions and projects. We are facing a promising opportunity to join forces and make further progress in the development and expansion of data spaces, in order to generate a significant impact on the Data Economy.

To stimulate the initial debate, the Data Spaces Business Alliance has prepared the document "Data Spaces Business Alliance Hubs: potential for synergies and impact", which explores the situation described above.

calendar icon
Noticia

The Constituent Assembly of the Gaia-X Spain Association, whose headquarters are located in Talavera de la Reina, was held on 18 March. The event brought together more than 150 entities of all kinds, including companies, public bodies, universities and innovation institutions, with the aim of joining forces to create an open, federated and interoperable data infrastructure, following the values of digital sovereignty and data availability. The ultimate goal is to drive the transformation of strategic sectors within the framework of the Data Economy.

To advance in this area, a number of working groups have been set up. These groups seek to leverage the advantages of Gaia-X to develop sectoral data spaces where different actors share data in a voluntary and secure manner.

The Gaia-X national hub working groups

Among the various working groups that have been set up, we find both sectoral groups, as well as horizontal groups focused on enabling technologies, ethics and legislation. The aim is to develop the vision of the data space in each sector of activity, as well as to obtain common and guiding principles for all of them.

The working groups of the Gaia-X national hub facilitate the connection between companies, administrations and other entities that offer and demand data services and technologies, with the aim of obtaining an adequate development and interoperability of the sectorial data spaces. They also act as a meeting point to disseminate experiences, success stories and lessons learned.

These groups also participate in various events. One of them was Global Mobility Call, organised by IFEMA MADRID and Smobhub on 16 June to discuss sustainable mobility. It was attended by members of the mobility working group of the national Gaia-X Hub. During the session, several challenges related to the creation of data spaces came up, which, although focused on the field of mobility, can be applicable to other sectors.

4 challenges to consider

1. Moving from Theory to Practice

The creation of a data space must be done from a clear bottom-up approach, starting from concrete use cases, and making the most of the technological building blocks already available.

To this end, the first step is to identify the needs and opportunities that can be addressed by the exchange and exploitation of data within each sector. Use cases must address a specific business need. An economic model must be agreed and the responsibilities to be acquired by the actors involved, as well as an incentive scheme, must be established. Beyond the technical components, certainty is needed about the operational, legal and commercial considerations that will govern the data exchanges.

In developing the use cases, it is essential to establish the data catalogues to be exchanged and exploited, as well as the semantics of the data. It is also important that the solution created seeks interoperability with analogous solutions, as well as establishing mechanisms for continuous improvement of the quality of the data exchanged.

The use cases must demonstrate the value derived from data sharing. This can be done by taking advantage of the experimentation spaces available within the Data Spaces Business Alliance (DSBA) hubs, as well as by using agile methodologies that allow results to be visualised early on.

2. Standardisation

The recently proposed EU Data Act provides for the development of interoperability standards for the re-use of data across sectors, in an attempt to remove barriers to data exchange. In the absence of applicable standards, the Commission will adopt implementing orders in this respect.

In this regard, there is a need to change the way in which standards are generated in order to gain agility. Working group members can identify specific standardisation needs and even propose solutions that become de facto standards.

On the other hand, it is necessary to focus on the convergence of the different perspectives and approaches to the creation of data spaces that currently exist. Their development should not be conceived in isolation, or from the exclusive vision of a single association, but as holistically as possible.

3.Awareness raising and change management actions

In order to share data externally, there must be a mindset of sharing internally. This change of mindset involves complex change management where top management support is essential. A data culture needs to be embedded in organisations to maximise the value of data.

4.Communication and dynamisation actions to disseminate the model

It is also essential to develop workshops and congresses to address common problems, best practices and seek synergies of action. In order to raise awareness, it is necessary to evangelise about the profitability that data sharing generates for the sector's agents. This profitability is not only derived from its sale, but also from the generation of new products or the enrichment of existing ones, which adds value to the business.

 

These four general considerations must be nuanced by taking into account the specific needs of each sector to drive the development of effective sectoral data spaces. These spaces will facilitate the creation of innovative solutions based on data and disruptive technologies, such as artificial intelligence, helping to boost the competitiveness of companies and the advancement of society in general.

calendar icon
Noticia

The Tourism Data Space event took place on 9 June, organised by Gaia-X, the European private sector initiative for the creation of an open, federated and interoperable data infrastructure to drive the Data Economy while respecting digital sovereignty. During the event, which was held online, international experts from the public and private sector discussed "How can data spaces contribute to the development of tourism in Europe through citizen-centric offerings?”. The event was a success with more than 250 attendees from 21 countries.

The tourism sector has a strong economic weight in Europe, although it has been affected by the COVID-19 pandemic and the drop in international tourist arrivals, which exceeded 70% worldwide. In this context, Gaia-X and data spaces are positioned as a great opportunity for companies in the sector. Gaia-X aims to make European data available to improve the ability to attract tourists by creating more personalised offers, products and services, resulting in an enhanced experience tailored to customers' needs. It was with this premise in mind that the event kicked off, focusing on the requirements and need for a secure, decentralised and citizen-oriented European tourism data space.

The opening speech of the event was given by Carme Artigas, Secretary of State for Digitalisation and Artificial Intelligence of the Spanish Ministry of Economy and Digital Transformation, who highlighted the importance of the tourism sector: "“Finally, we are giving the tourism the importance it deserves also in the data economy. At the EU level, the tourism sector directly contributes to almost 4% of GDP with 2.3 million businesses, majority of which are SMEs”. This sector also employs 22.4% of the service sector workforce, as Francesco Bonfiglio, Director General of Gaia-X AISBL, commented: "This market is worth billions of euros, and is one of the areas with the greatest impact if we decide to invest in a common European data space".

Artigas also stressed that "Before the end of the year we will have a new digital space for tourism at European level, and this is great news", always respecting the basic principles of data sovereignty, privacy, security and interoperability.

Yvo Volman, Chief Data Officer at DG-CNECT (European Commission), explained that in order to achieve the set objectives, empowerment and data sharing also across sectors is essential. This is the only way to establish better services and promote sustainability. The importance of education was also stressed by Natalia Bayona, Director of Innovation, Education and Investment at the World Tourism Organisation (UNWTO): "Tourism is the main employer of women and young people. However, 50% of people working in tourism have only secondary skills. If we want to develop a high-level economic sector, we have to develop education". In her speech, she also focused on the need for a public-private relationship, with projects such as Gaia-X as a spearhead to drive innovation.

This was followed by several presentations focused on providing an overview of the landscape of the Gaia-X Tourism data space in Europe, with experts from different countries. From Spain, Ana Moniche, Senior Analyst at Turismo Andaluz and NECSTourR, and Cristina Núñez, Director of Necstour, spoke about European regional practices for competitive and sustainable tourism, highlighting how European data sharing is fundamental to develop strategies based on quality information. Data sharing also offers companies with fewer resources the possibility of accessing large amounts of data, which they would not be able to access through their own mechanisms.

Dolores Ordóñez, Director of AnySolution and Vice President of the Spanish Gaia-X Hub, also spoke in this section. In her speech, she highlighted the need for collaboration between companies of different sizes and sectors, especially in four major areas: tourism, health, industry 4.0 and mobility. In the section dedicated to the pillars of tourism data spaces, among other speakers, Alberto Palomo, CDO of the Government of Spain, pointed out the importance of generating scalability in data sharing, as well as the creation of a common framework that shapes governance mechanisms that are useful and accepted by industry players. He also warned that the paradigm we are facing is that of an "innovative decentralised infrastructure", something that all participants must be clear about, because of the cultural change it implies.

To conclude, the event was divided into 3 thematic sessions, designed to create an atmosphere of cross-border collaboration and help create a sustainable data infrastructure for the tourism industry. These sessions focused on smart destinations, the tourism value chain and its technological enablers. More information about the event can be found in the video teaser. This event is part of a series of meetings organised by Gaia-X around data spaces. Two previous events have focused on mobility and health. Gaia-X will continue to hold such activities in the coming months, as can be seen in its calendar. In addition, it has launched a magazine and a podcast series to keep up to date with the latest trends related to the data space.

calendar icon