Unión Europea

Andalusia in the "Regions 2030" Project: Open Data for Monitoring the SDGs in EU Regions

Noticia

El Instituto de Estadística y Cartografía de Andalucía (IECA), in collaboration with the Andalusian Agency for International Development Cooperation (AACID), has incorporated new indicators at the municipal level into its Sustainable Development Indicators System for Andalusia for the Agenda 2030. This effort aims to integrate statistical and geographical information while enhancing the efficiency of the Andalusian public administration and the information services provided to society.

Thanks to these efforts, Andalusia has been selected as one of the participating regions in the European project "REGIONS 2030: Monitoring the SDGs in EU regions," along with nine other regions in the European Union. All of these regions share a strong commitment to the analysis and fulfillment of the Sustainable Development Goals (SDGs), recognizing the importance of this work in decision-making and sustainable regional development.

The "REGIONS 2030" project, funded by the European Parliament and developed by the Joint Research Centre (JRC) of the European Commission in collaboration with the Directorate-General for Regional and Urban Policy (DG REGIO) and EUROSTAT, aims to fill data gaps in monitoring the SDGs in EU regions.

Image 1: "REGIONS 2030" Project: Monitoring the SDGs in EU regions.

Source: Andalusian Institute of Statistics and Cartography (IECA)

The new indicators incorporated are essential for measuring the progress of the SDGs

The Andalusian Institute of Statistics and Cartography, in collaboration with AACID, has created a set of indicators that allow for evaluating the advancement of the Sustainable Development Goals at the regional level, available on their website. All the new municipal-level indicators are identified with the Joint Research Centre (municipal) for Andalusia, and they address 9 of the 17 Sustainable Development Goals.

The methodology used for most of the indicators is based on georeferenced information from the Andalusian Institute of Statistics and Cartography, using publications on the Spatial Distribution of the Population in Andalusia and the Characterization and Distribution of Built Space in Andalusia as reference points.

One of the indicators provides information on Goal 1: No Poverty and measures the risks of poverty by assessing the percentage of people residing at an address where none of their members are affiliated with Social Security. This indicator reveals more unfavorable conditions in urban municipalities compared to rural ones, consistent with previous studies that identify cities as having more acute poverty situations than rural areas.

Similarly, the per capita Built-up Area indicator for Goal 11: Sustainable Cities and Communities has been calculated using cadastral data and geospatial processes in geographic information systems.

Visualization and query of the new municipal indicators

Allow for obtaining information at the municipal level about the value and variation of the indicators compared to the previous year, both for the entire Andalusia region and different degrees of urbanization.

Image 2: Data visualization of the indicator.

Source: Andalusian Institute of Statistics and Cartography (IECA)

Moreover, the applied filter enables an analysis of the temporal and geographical evolution of the indicators in each of the considered areas, providing a temporal and territorial perspective.

Image 3: Visualization of the indicator's evolution by area.

Source: Andalusian Institute of Statistics and Cartography (IECA)

These results are presented through an interactive map at the municipal level, displaying the distribution of the indicator in the territory.

Image 4: Interactive map of the indicator.

Source: Andalusian Institute of Statistics and Cartography (IECA)

The data for the indicators are also available in downloadable structured formats (XLS, CSV, and JSON). Methodological information regarding the calculations for each indicator is provided as well.

The inclusion of Andalusia in the "REGIONS 2030" project

Has integrated all of this work with the existing Sustainable Development Indicators System for Andalusia for the Agenda 2030, which has been calculated and published by the IECA to date. This collective effort among different regions will serve to establish a methodology and select the most relevant regional indicators in Europe (NUTS2 European level) so that this methodology can be applied to all European regions in the future.

The "REGIONS 2030" project, after completing its initial work in Andalusia, has disseminated its results in the article "Monitoring the SDGs in Andalusia region, Spain," published by the European Commission in July 2023, and in an event held at the Three Cultures Foundation of the Mediterranean on September 27, under the title 'SDG Localisation and Monitoring Framework for 2030 Agenda Governance: Milestones & Challenges in Andalusia.' In this event, each selected region presented their results and discussed the needs, deficiencies, or lessons learned in generating their reports.

The "REGIONS 2030" project will conclude in December 2023 with the presentation and publication of a final report. This report will consolidate the ten regional reports generated during the monitoring of the Sustainable Development Goals at the regional level in Europe, contributing to their effective monitoring as part of the proper implementation of the Agenda 2030.

07/11/2023

Re3gistry: facilitating the semantic interoperability of data

Blog

The INSPIRE (Infrastructure for Spatial Information in Europe) Directive sets out the general rules for the establishment of an Infrastructure for Spatial Information in the European Community based on the Infrastructures of the Member States. Adopted by the European Parliament and the Council on 14 March 2007 (Directive 2007/2/EC), it entered into force on 25 April 2007.

INSPIRE makes it easier to find, share and use spatial data from different countries. The information is available through an online portal where it can be found broken down into different formats and topics of interest.

To ensure that these data are compatible and interoperable in a Community and cross-border context, the Directive requires the adoption of common Implementing Rules specific to the following areas:

Metadata
Data sets
Network services
Data sharing and services
Spatial data services
Monitoring and reporting

The technical implementation of these standards is done through Technical Guidelines, technical documents based on international standards and norms.

Inspire and semantic interoperability

These rules are considered Commission decisions or regulations and are therefore binding in each EU country. The transposition of this Directive into Spanish law is developed through Law 14/2010 of 5 July, which refers to the infrastructures and geographic information services of Spain (LISIGE) and the IDEE portal, both of which are the result of the implementation of the INSPIRE Directive in Spain.

Semantic interoperability plays a decisive role in INSPIRE. Thanks to this, there is a common language in spatial data, as the integration of knowledge is only possible when a homogenisation or common understanding of the concepts that constitute a domain or area of knowledge is achieved. Thus, in INSPIRE, semantic interoperability is responsible for ensuring that the content of the information exchanged is understood in the same way by any system.

Therefore, in the implementation of spatial data models in INSPIRE, in GML exchange format, we can find codelists that are an important part of the INSPIRE data specifications and contribute substantially to interoperability.

In general, a codelist (or code list) contains several terms whose definitions are universally accepted and understood. Code lists promote data interoperability and constitute a shared vocabulary for a community. They can even be multilingual.

INSPIRE code lists are commonly managed and maintained in the central Federated INSPIRE Registry (ROR) which provides search capabilities, so that both end-users and client applications can easily access code list values for reference.

Registers are necessary because:

They provide the codes defined in the Technical Guidelines, Regulations and Technical Specifications necessary to implement the Directive.
They allow unambiguous references of the elements.
Provides unique and persistent identifiers for resources.
Enable consistent management and version control of different elements

The code lists used in INSPIRE are maintained at:

The Inspire Central Federated Registry (ROR).
The register of code lists of a member state,
The list registry of a recognised external third party that maintains a domain-specific code list.

To add a new code list, you will need to set up your own registry or work with the administration of one of the existing registries to publish your code list. This can be quite a complicated process, but a new tool helps us in this task.

Re3gistry is a reusable open-source solution, released under EUPL, that allows companies and organisations to manage and share \"reference codes\" through persistent URIs, ensuring that concepts are unambiguously referenced in any domain and facilitating the management of these resources graphically throughout their lifecycle.

Funded by ELISE, ISA2 is a solution recognised by the Europeans in the Interoperability Framework as a supporting tool.

Illustration 3: Image of the Re3gister interface

Re3gistry is available for both Windows and Linux and offers an easy-to-use Web Interface for adding, editing, and managing records and reference codes. In addition, it allows the management of the complete lifecycle of reference codes (based on ISO 19135: 2005 Integrated procedures for the registration of reference codes)

The editing interface also provides a flag to allow the system to expose the reference code in the format that allows its integration with RoR, so that it can eventually be imported into the INSPIRE registry federation. For this integration, Reg3gistry makes an export in a format based on the following specifications:

The W3C Data Catalogue (DCAT) vocabulary used to model the entity registry (dcat:Catalog).
The W3C Simple Knowledge Organisation System (SKOS) which is used to model the entity registry (skos:ConceptScheme) and the element (skos:Concept).

Other notable features of Re3gistry

Highly flexible and customisable data models
Multi-language content support
Support for version control
RESTful API with content negotiation (including OpenAPI 3 descriptor)
Free-text search
Supported formats: HTML, ISO 19135 XML, JSON
Service formats can be easily added or customised (default formats): JSON and ISO 19135 XML
Multiple authentication options
Externally governed elements referenced through URIs
INSPIRE record federation format support (option to automatically create RoR format)
Easy data export and re-indexing (SOLR)
Guides for users, administrators, and developers
RSS feed

Ultimately, Re3gistry provides a central access point where reference code labels and descriptions are easily accessible to both humans and machines, while fostering semantic interoperability between organisations by enabling:

Avoid common mistakes such as misspellings, entering synonyms or filling in online forms.
Facilitate the internationalisation of user interfaces by providing multilingual labels.
Ensure semantic interoperability in the exchange of data between systems and applications.
Tracking changes over time through a well-documented version control system.
Increase the value of reference codes if they are widely reused and referenced.

More about Re3gistry:

	Holder	https://github.com/ec-jrc/re3gistry
	User manual	https://github.com/ec-jrc/re3gistry/blob/master/documentation/user-manual.md
	Administrator's Manual	https://github.com/ec-jrc/re3gistry/blob/master/documentation/administrator-manual.md
	Developer's Manual	https://github.com/ec-jrc/re3gistry/blob/master/documentation/developer-manual.md

References

https://github.com/ec-jrc/re3gistry

https://inspire.ec.europa.eu/codelist

https://ec.europa.eu/isa2/solutions/re3gistry_en/

https://live.osgeo.org/en/quickstart/re3gistry_quickstart.html

Content prepared by Mayte Toscano, Senior Consultant in Technologies linked to the data economy.

The contents and views expressed in this publication are the sole responsibility of the author.

20/09/2023

The dataspaces starter kit

Blog

A data space is a development framework that enables the creation of a complete ecosystem by providing an organisational, regulatory, technical and governance structure with the objective of facilitating the reliable and secure exchange of different data assets for the common benefit of all actors involved and ensuring compliance with all applicable laws and regulations. Data spaces are also a key element of the European Union's new data strategy and an essential building block in realising the goal of the European single data market.

As part of this strategy, the EU is currently exploring the creation of several data space pilots in a number of strategic sectors and domains: health, industry, agriculture, finance, mobility, Green Pact, energy, public administration and skills. These data spaces offer great potential to help organisations improve decision-making, increase innovation, develop new products, services and business models, reduce costs and avoid duplication of efforts. However, creating a successful data space is not a trivial activity and requires first carefully analysing the use cases and then facing major business, legal, operational, functional, technological and governance challenges.

This is why, as a support measure, the Data Spaces Support Centre (DSSC) has also been created to provide guidance, tools and resources to organisations interested in creating or participating in new data spaces. One of the first resources developed by the DSSC was the Data Spaces Starter Kit, the final version of which has recently been published and which provides a basic initial guide to understanding the basic elements of a data space and how to deal with the different challenges that arise when building them. We review below some of the main guidelines and recommendations offered by this starter kit.

The value of data spaces and their business models

Data spaces can be a real alternative to current unidirectional platforms, generating business models based on network effects that respond to both the supply and demand of data. Among the different business model patterns existing in data spaces, we can find:

Cost sharing: all participants save time and money by sharing data for a common purpose, such as the smart network of connected SCSN providers.
Joint innovation: innovation is only possible if data is shared as none of the participants have the complete data individually, e.g., the Eona-X platform for mobility, transport and tourism.
Combined forces: different actors join forces to prevent a single actor from dominating a certain space, as in the EuPro Gigant manufacturing data network.
Shared market: actors with common interests share data with each other in order to benefit each other, such as the Catena-X automotive network.
Greater common good: when the public and private sectors share data for a social purpose, as for example in the mobility data space developed in Spain through the Mobility Working Group of the Gaia-X Hub.

The legal aspects

The legal side of data spaces can be a major challenge as they necessarily move between multiple legal frameworks and regulations, both national and European. To address this challenge, the Data Spaces Support Centre proposes the elaboration of a reference framework composed of three main instruments:

The cross-cutting legal frameworks that will apply to all data spaces, such as contract law, data protection, intellectual property, competition or cybersecurity laws.
The organisational aspects to consider when establishing models and mechanisms for data governance in each specific case.
The contractual dimension to be taken into account when exchanging data and the agreements and terms of use to be established to make this possible.

Operational activities

The design of operational activities should address the arrangements that enable the organisational functioning of the data space, such as guidelines for onboarding new participants, decision-making and conflict resolution.

In addition, consideration should also be given to business operations, such as process streamlining and automation, marketing tasks and awareness-raising activities, which are also important components of operational activities.

Functionality of data spaces

Data spaces shall share a number of basic components (or building blocks) that will provide the minimum functionality expected of them, including at least the following elements:

Interoperability: data models and formats, data exchange interfaces and origin and traceability.
Trust: identity management, access and usage control and secure data exchanges.
Data value: metadata and location protocols, data usage accounting, publishing and commercial services.
Governance: cooperation and service level agreements and continuity models.

Building blocks

While these components can be expected to be common to all data spaces and provide similar functionality, each individual data space can make its own design choices in implementing and realising them.

Technological aspects

Data spaces are designed to be technology agnostic, i.e., defined solely in terms of functionality and with freedom in the choice of specific technologies for implementation. In this scenario it will be important to establish clear references in terms of:

A formal basis of de facto standards to be followed.
Specifications to serve as a reference for the different implementations.
Open source implementations of the basic components carried out by other actors.

Governance of data spaces

Designing, implementing and maintaining a data space requires multiple organisations to collaborate together across different functions. This requires these entities to build a common vision of the key aspects of such collaboration through a governance framework.

This will require a joint design exercise through which stakeholders formalise a set of agreements defining key strategic and operational aspects, such as legal issues, description of the network of participants, code of conduct, terms and conditions of use, data space incorporation and membership agreements, and governance model.

In the near future the DSSC support centre will identify the core components of each of the dimensions described above and provide additional guidance for each of them through the development of a common blueprint for data spaces. So, if you are considering participating in any of the data spaces initiatives that are being launched, but are not quite sure where to start, then this basic starter kit will certainly be a valuable resource in understanding the basic concepts - along with the glossary that explains all the related terminology. Also, don't forget to subscribe to the support centre's newsletter to keep up to date with all the latest news, documentation and support services on offer.

Content prepared by Carlos Iglesias, Open data Researcher and consultant, World Wide Web Foundation. The contents and views reflected in this publication are the sole responsibility of the author.

25/05/2023

Use case development model for data spaces

Blog

We live in the era of data, a lever of digital transformation and a strategic asset for innovation and the development of new technologies and services. Data, beyond the skills it brings to the generator and/or owner of the same, also has the peculiarity of being a non-rival asset. This means that it can be reused without detriment to the owner of the original rights, which makes it a resource with a high degree of scalability in its sharing and exploitation.

This possibility of non-rival sharing, in addition to opening potential new lines of business for the original owners, also carries a huge latent value for the development of new business models. And although sharing is not new, it is still very limited to niche contexts of sector specialisation, mediated either by trust between parties (usually forged in advance), or tedious and disciplined contractual conditions. This is why the innovative concept of data space has emerged, which in its most simplified sense is nothing more than the modelling of the general conditions under which to deploy a voluntary, sovereign and secure sharing of data. Once modelled, the prescription of considerations and methodologies (technological, organisational and operational) allows to make such sharing tangible based on peer-to-peer interactions, which together shape federated ecosystems of data sets and services.

Therefore, and given the distributed nature of data spaces (they are not a monolithic computer system, nor a centralised platform), an optimal way to approach their construction is through the creation and deployment of use cases.

The Data Office has created this infographic of a 'Model of use case development within data spaces', with the objective of synthetically defining the phases of this iterative journey, which progressively shapes a data space. This model also serves as a general framework for other technical and methodological deliverables to come, such as the 'Use Case Feasibility Assessment Guide', or the 'Use Case Design Guide', elements with which to facilitate the implementation of practical (and scalable by design) data sharing experiences, a sine qua non condition to articulate the longed-for European single data market.

The challenge of building a data space

To make the process of developing a data space more accessible, we could assimilate the definition and construction of a use case as a construction project, in which from an initial business problem (needs or challenges, desires, or problems to be solved) a goal is reached in which value is added to the business, providing a solution to those initial needs. This infographic offers a synthesis of that journey.

These are the phases of the model:

PHASE 1: Definition of the business problem. In this phase a group of potential participants detects an opportunity around the sharing of their data (hitherto siloed) and its corresponding exploitation. This opportunity can be new products or services (innovation), efficiency improvements, or the resolution of a business problem. In other words, there is a business objective that the group can solve jointly, by sharing data.

PHASE 2: Data-driven modelling. In this phase, those elements that serve to structure and organise the data for strategic decision-making based on its exploitation will be identified. It involves defining a model that possibly uses multidisciplinary tools to achieve business results. This is the part traditionally associated with data science tasks.

PHASE 3: Consensus on requirements specification. Here, the actors sponsoring the use case must establish the relationship model to have during this collaborative project around the data. Such a formula must: (i) define and establish the rules of engagement, (ii) define a common set of policies and governance model, and (iii) define a trust model that acts as the root of the relationship.

PHASES 4 and 5: Use case mapping. As in a construction project, the blueprint is the means of expressing the ideas of those who have defined and agreed the use case, and should explicitly capture the solutions proposed for each part of the use case development. This plan is unique for each use case, and phase 5 corresponds to its construction. However, it is not created from scratch, but there are multiple references that allow the use of previously identified materials and techniques. For example, models, methodologies, artefacts, templates, technological components or solutions as a service. Thus, just as an architect designing a building can reuse recognised standards, in the world of data spaces there are also models on which to paint the components and processes of a use case. The analysis and synthesis of these references is phase 4.

PHASE 6: Technology selection, parameterisation and/or development. The technology enables the deployment of the transformation and exploitation of the data, favouring the entire life cycle, from its collection to its valorisation. In this phase, the infrastructure that supports the use case is implemented, understood as the collection of tools, platforms, applications and/or pieces of software necessary for the operation of the application.

PHASE 7: Integration, testing and deployment. Like any technological construction process, the use case will go through the phases of integration, testing and deployment. The integration work and the functional, usability, exploratory and acceptance tests, etc. will help us to achieve the desired configuration for the operational deployment of the use case. In the case of wanting to incorporate a use case into a pre-existing data space, the integration would seek to fit within its structure, which means modelling the requirements of the use case within the processes and building blocks of the data space.

PHASE 8: Operational data space. The end point of this journey is the operational use case, which will employ digital services deployed on top of the data space structure, and whose architecture supports different resources and functionalities federated by design. This implies that the value creation lifecycle would have been efficiently articulated based on the shared data, and business returns are achieved according to the original approach. However, this does not prevent the data space from continuing to evolve a posteriori, as its vocation is to grow either with the entry of new challenges, or actors to existing use cases. In fact, the scalability of the model is one of its unique strengths.

In essence, the data shared through a federated and interoperable ecosystem is the input that feeds a layer of services that will generate value and solve the original needs and challenges posed, in a journey that goes from the definition of a business problem to its resolution.

25/05/2023

How 2023 will become the 'European Year of Skills', also in open data

Noticia

In recent years, we have been witnessing a technological revolution that increasingly pushes us to broaden our training to adapt to the new digital devices, tools and services that are already part of our daily lives. In this context, training in digital skills is more relevant than ever.

Last October, the European Commission, led by its President Ursula Von Der Leyen, announced its intention to make 2023 the "European Year of Skills", including digital skills. The reason lies in the difficulties that have been identified among European citizens in adapting to new technologies and exploiting their full potential, especially in the professional sphere.

The European digital skills gap

According to data provided by Eurostat, more than 75% of EU companies have reported difficulties in finding professionals with the necessary skills to do the job they are looking for. Moreover, the European Labour Agency warns that there is a severe shortage of ICT specialists and STEM-qualified workers in Europe. This is exacerbated by the fact that currently only 1 in 6 IT specialists is female.

When it comes to digital skills, the figures are not flattering either. As noted in the Digital Economy and Society Index (DESI), based on data from 2021, only 56% of the European population possesses basic digital skills, including information and data literacy, communication and collaboration, digital content creation, security and problem-solving skills.

EU citizens with the least basic digital skills by country. Source: Eurostat, 2022.

European initiatives to promote the development of digital competences

As mentioned above, improving education and digital skills is one of the major objectives that the European Commission has set for 2023. If we look at the longer term, the goal is more ambitious: The EU aims for at least 80% of EU adults to have basic digital skills such as those mentioned above by 2030. In terms of the professional sector, the target focuses on having around 20 million ICT professionals, with a significant number of women in the sector.

In order to realise these objectives, there are a number of measures and initiatives that have been launched at European level. One of them stems from the European Skills Agency, whose action points 6 and 7 are focused on improving all relevant skills for the green and digital transitions, as well as increasing the number of ICT graduates.

Through the NextGenerationEU funds and its Resilience and Recovery Mechanism, EU member states will be eligible for support to finance digital skills-related reforms, as €560 million will be made available for this purpose.

In addition, other EU funding programmes such as the Digital Europe Programme (DEP) or the Connecting Europe Facility (CEF) will respectively offer financial support for the development of education programmes specialised in digital skills or the launch of the European Digital Skills and Jobs Platform to make information and related resources available to citizens.

Alongside these, there are also other initiatives dedicated to digital skills training in the Digital Education Action Plan, which has created the European Centre for Digital Education, or in the mission of the Digital Skills and Jobs Coalition, which aims to raise awareness and address the digital skills gap together with public administrations, companies and NGOs.

The importance of open data in the 'European Year of Digital Skills'.

In order to promote the development of digital skills among European citizens, the European open data portal carries out several actions that contribute to this end and where open data plays a key role. Along these lines, Data Europa remains firmly committed to promoting training and the dissemination of open data. Thus, in addition to the objectives listed below, it is also worth highlighting the value at the knowledge level that lies behind each of the workshops and seminars programmed by its academic section throughout the year.

To support Member States in the collection of data and statistics on the demand for digital skills in order to develop specific measures and policies.
Work together with national open data portals to make data available, easily accessible and understandable.
Provide support to regional and local open data portals where there is a greater need for help with digitisation.
Encourage data literacy, as well as the collection of use cases of interest that can be reused.
Develop collaborative environments that facilitate public data providers in the creation of a smart data-driven society.

Thus, just as data.europa academy functions as a knowledge centre created so that open data communities can find relevant webinars and trainings to improve their digital skills, in Spain, the National Institute of Public Training includes among its training options several courses on data whose task is to keep public administration workers up to date with the latest trends in this field.

In this line, during the spring of 2023, a training on Open Data and Information Reuse will take place, through which an approach to the open data ecosystem and the general principles of reuse will be carried out. Likewise, from 24 May to 5 June, the INAP is organising another course on the Fundamentals of Big Data, which will address blocks of knowledge such as data visualisation, cloud computing, artificial intelligence and the different strategies in the field of Data Governance.

Likewise, if you are not a public sector worker, but you are interested in expanding your knowledge of open data, artificial intelligence, machine learning or other topics related to the data economy, in the blog and documentation section of datos.gob.es you can find adapted training materials, monographs on various topics, case studies, infographs and step-by-step visualisations that will help you to understand more tangibly the different theoretical applications involving open data.

At datos.gob.es, we have prepared publications that compile different free training courses on different topics and specialisations. For example, on artificial intelligence or data visualisations.

Finally, if you know of more examples or other initiatives dedicated to fostering digital skills both at national and European level, do not hesitate to let us know through our mailbox dinamizacion@datos.gob.es. We look forward to all your suggestions!

22/03/2023

Re-use in the future European Health Data Space regulation

Blog

The Commission's drive to promote data spaces within the framework of a European Strategy is based on the firm commitment to a regulatory framework that provides regulatory coherence throughout the Union. In particular, the aim is to establish a solid regulation that offers legal certainty to a model based on respect for rights and freedoms. Thus, initially, two initiatives have been promoted to, on the one hand, establish the regulatory bases of the governance model - already definitively adopted by Regulation (EU) 2022/868 of 30 May - and, on the other hand, to establish harmonised rules on the access and fair use of data throughout the Union.

However, while recognising the importance of the design of this general legal architecture, the effective opening and exchange of data requires a more concrete approach that takes into account the specificities of each sectoral area and, in particular, the difficulties and challenges to be faced. Therefore, taking into account the general regulatory framework referred to above, the Commission has presented the first regulatory initiative for one of these areas, related to health data, which is currently under public consultation and negotiation in the Council of the EU and in the European Parliament, and which is part of the project to create a European health data area.

In particular, beyond facilitating the development of cross-border e-services, the proposal aims to address a triple objective:

• Establish a uniform legal framework to facilitate the development, marketing and use of electronic health record systems by establishing a compulsory self-certification scheme for certain systems, which in any case provides for some exceptions, e.g. general purpose software used in healthcare environments.

• Facilitating patients' electronic access to their own data in the framework of healthcare provision (primary use of health data). In this respect, the proposal seeks to strengthen consistency across Member States in protecting health data irrespective of where the healthcare provision takes place or the type of entity carrying it out.

• Encourage the re-use of such data for other secondary purposes. To this end, a specific governance model is envisaged with a specific body at the head - the so-called European Health Data Space Board - and the deployment of duly coordinated state administrative structures - health data access bodies.

We will look at this last point in more detail below.

The promotion of secondary uses

With regard to the re-use of data for purposes other than health care, the proposed regulation is based on the following evidence: although health data are already being collected and processed using electronic means, in many cases, however, access to them is not facilitated to satisfy other purposes of general interest. For this reason, in general, it is intended to establish a broad regulation that facilitates secondary uses of health data. For example, the elaboration of statistics, the development of training and research activities, such as technological innovation -including the training of algorithms- or personalised medicine.

However, for the purposes of denying access to health data, some secondary uses are expressly declared incompatible, such as:

• The adoption of decisions detrimental to natural persons, meaning not only those that produce legal effects but also those that significantly affect them. In this respect, changes relating to insurance contracts, such as an increase in the amounts to be paid, are specifically highlighted.

• The carrying out of advertising or marketing activities aimed at healthcare professionals, organisations in the sector or natural persons.

• Making data available to third parties that are not covered by the data permission granted.

• The development of harmful products and services, including in particular illicit drugs, alcoholic beverages, tobacco products or goods or services that contravene public order or morality.

With regard to the parties obliged to share data, in principle the proposed regulation extends to those who collect and process data with public funding, who must make them available to the competent bodies for access to health data in order to facilitate their re-use. However, given their importance in some States, the regulation also extends its scope of application to private parties providing health services - except in the case of micro-enterprises - and also to professional associations. Specifically, this regulation would affect "any natural or legal person, which is an entity or a body in the health or care sector, or performing research in relation to these sectors, as well as Union institutions, bodies, offices and agencies who has the right or obligation, in accordance with this Regulation, applicable Union law or national legislation implementing Union law, or in the case of non-personal data, through control of the technical design of a product and related services, the ability to make available, including to register, provide, restrict access or exchange certain data".

Purpose and conditions of access to health data

The proposed Regulation is based on a broad concept of health data, which includes the following categories:

The regulation is based on a general rule: access to anonymised data as a measure to reduce privacy risks, although a specific regime is also envisaged for personal data. In this case, the request must include an adequate justification and the data will only be provided in pseudonymised form.

As regards the form of access, the particular sensitivity of health data determines that it is proposed that they should be made available through a secure processing environment that complies with the technical and security standards included in the proposal. In particular, the proposal does not allow that, except for non-personal data, the data are transmitted directly to the person who will re-use them. Furthermore, it provides for processing to take place in secure environments under the control of the access authorities.

Access authorities for health data

From the perspective of the governance model underpinning the proposal, States should have at least one health data access body to provide electronic access to health data for secondary purposes. In the case of multiple bodies due to requirements arising from their political-administrative organisation, one of them will have a coordinating role. Beyond the organisational freedom of the States to choose one or another organisational formula, it is essential that the independence of the coordinating body be guaranteed, without prejudice to the mechanisms of financial or judicial control.

As already indicated, the main purpose of this measure is to ensure a uniform and consistent application of the regulatory framework for access to health data for secondary purposes across the European Union, in particular as regards the protection of personal data in this sector. In this respect, it is proposed that these bodies should be given the powers to verify compliance with these rules and, in particular, to impose sanctions and other measures such as temporary or definitive exclusion from the European Health Data Area of those who do not comply with their obligations.

The harmonisation sought by the proposed Regulation is also envisaged in the establishment of a standardised process for the issuing of permissions to re-use data for secondary purposes. In particular, in cases where anonymised access to the data is not enough, reasons should be given as to why pseudonymised access is necessary. In the latter case, the request must specify the legal basis for requesting access to the data from the perspective of personal data protection law, the secondary purposes for which the data are intended to be re-used, as well as a description of the data and tools necessary for their processing.

Finally, the proposed regulation includes active disclosure obligations addressed to these bodies about the available datasets. This is an essential measure, since the existence of a catalogue of datasets at European level - based on the interconnection of national datasets - would be extremely useful for promoting not only research and innovation but also decision-making at regulatory and political level. Specifically, for each set of available data, the nature of the data, its source and the conditions for making it available will have to be indicated.

In short, this is a certainly innovative initiative to address the regulatory diversity existing in each Member State, which is, however, at an early stage of processing. Precisely, a participation procedure is currently open that allows for the submission of allegations against the initial drafting until 28 July 2022 through a simple procedure accessible via this link.

Content prepared by Julián Valero, Professor at the University of Murcia and Coordinator of the Research Group "Innovation, Law and Technology" (iDerTec).

The contents and views expressed in this publication are the sole responsibility of the author.

22/06/2022

European Commission launches public consultation to seek public opinion on its draft High Value Data law

Noticia

The current Directive 2019/1024 on open data and re-use of public sector information, adopted in June 2019, was established to replace and improve the former Directive 2003/98/EC. Among its objectives was to boost the availability of public sector data for re-use by establishing some minimum harmonisation rules that favour its use as a raw material for innovation in all economic sectors. It should be noted that this directive has been incorporated into Spanish law through Royal Decree-Law 24/2021, of 2 November, transposing several European Union directives.

Among the most significant changes introduced by Directive 2019/1024 was the drawing up of a list of high-value datasets to be highlighted among those held by public bodies.

High-value data: definition and characteristics

The Directive describes high-value data as "documents whose re-use is associated with considerable benefits for society, the environment and the economy, in particular because of their suitability for the creation of new, decent and quality value-added services, applications and jobs, and the number of potential beneficiaries of value-added services and applications based on such datasets".

This definition provides some clues as to how such high-value datasets can be identified. Identification can be carried out through a series of indicators including:

Potential to generate:
- Significant social or environmental benefits
- Economic benefits and new revenues
- Innovative services
Potential in terms of number of users benefited, with a particular focus on SMEs.
Ability to be combined with other datasets.

How should high-value data be published?

According to the Directive, the publication of these datasets has to meet the following requirements:

Be reusable free of charge.
Available through application programming interfaces (APIs).
Available in a machine-readable format.
Feature a bulk download option, where possible.

In addition, they should be compatible with open standard licences.

Which thematic categories are considered high-value data?

The European Data Strategy incorporates high-value data as a common data layer that facilitates, together with data from the private sector, the deployment of sectoral data spaces in strategic areas.

Originally, the directive included in its annex a number of priority themes that could be considered high-value data: geospatial data, earth observation and environmental data, meteorological data, statistical data, business registers or transport data.

However, these categories were very broad. The EU has therefore launched an initiative to establish a list indicating more precisely what types of data are considered high-value and how they should be published. Following an extensive consultation of stakeholders and taking into account the outcome of the impact assessment, the Commission identified, within each of the six data categories, a number of datasets of particular value and the arrangements for their publication and re-use.

The list takes the form of a binding implementing act. The granularity and modality of publication varies from one dataset to another, trying to strike a balance between the potential socio-economic and environmental benefits and the financial and organisational burden to be borne by public data holders. Existing sectoral legislation governing these datasets should also be taken into account.

Open comment period on the draft law "Open Data: Availability of public datasets".

The next step is to get citizens' feedback on the proposed datasets. The European Commission currently has a specific section open on its website, at the end of which any citizen of the European Union can provide their comments to help improve and enrich this initiative. The public consultation will run for four weeks, from 24 May to 21 June 2022.

In order to submit your comments, you need to register using your email address or popular social networks such as Twitter or Facebook.

Remember that in order to express your opinion and for it to be taken into account by the public body, your comment must comply with the established rules and standards. In addition, you can consult the comments already made by other citizens from different countries and which are offered publicly. The website also includes a visualisation that presents data on the number of opinions offered per country or the category to which the participants belong (private companies, academic institutions, research institutions, NGOs, citizens, etc.).

This list will be a really important milestone as, for the first time in many years, it will be possible to establish an explicit and common guide on what are the minimum datasets that should always be available and what should be the conditions for their re-use throughout the European Union.

At the Spanish level, the Data Office, in collaboration with stakeholders, will be in charge of landing this list and specifying other additional datasets, both public and private, based on what is indicated in Royal Decree-Law 24/2021.

08/06/2022

The registration period for the EU Datathon 2021 is now open

Noticia

Open Data Day was the date chosen for the launch of the EU Datathon 2021, an event that is now in its fifth edition. Organized by the Publications Office of the European Union within the framework of the EU Open Data Days, seeks to highlight the value of open data and show the opportunities of business models based on it.

Participating teams must create a mobile or web application that responds to challenges related to the priorities of the European Commission, using open data sets.

3 challenges to solve

The available challenges are:

Challenge 1: ‘A European Green Deal’. The European Green Deal it is the blueprint to drive a modern, sustainable and competitive European economy. Those who choose this challenge will need to develop applications or services aimed at creating a greener Europe, for example by promoting efficient use of resources or reducing pollution.
Challenge 2: ‘An economy that works for people’. In this case, applications or services aimed at companies, public administrations or citizens in general are sought in order to create a fairer economic and monetary union, which allows the growth of the economies of the member countries together with the reduction of poverty and inequality. This category would include, for example, a solution to boost youth job creation.
Challenge 3: ‘A Europe fit for the digital age’. The EU aspires to a digital transformation that works for people and businesses. Therefore, this challenge encourages the creation of applications or services that improve data competencies, increase connectivity or make data more understandable for everyone, based on the European Data Strategy.

Each challenge is organized as a separate competition of equal importance.

The solutions presented must combine at least one data set of data.europa.eu (EU Open Data Portal o European Data Portal) with any other publicly available data set.

Who can participate?

It is aimed at citizens around the world who are interested in prototyping products based on public open data and even creating new business models for profit or not for profit through the exploration of such data.

Participation is open to individuals or legal entities, integrated in teams of between one and four members.

Staff working in the institutions, agencies, bodies, partner organizations or contractors of the EU Publications Office cannot register.

How does the competition develop?

Los equipos participantes deben registrar su propuesta siguiendo este formulario antes del 21 de mayo de 2021 a las 23:59 CET (hora central europea). A partir de entonces la competición se desarrollará en 2 fase:

Preselection

All proposals will be evaluated by the jury, made up of experts from within and outside the EU institutions and agencies based on a series of criteria such as the relevance of the selected challenge or the potential and creativity of the proposed solution.

For each challenge, a maximum of three teams that have obtained the highest number of points will be shortlisted. All participating teams will receive notification of the results before June 11, 2021.

Final phase

Shortlisted teams (three per challenge) will be invited to convert their proposals into applications. In addition, each shortlisted team must produce a 60-second video, in which they present their application and the team working on it, before August 1, 2021.

The final of the competition will take place on November 25. The applications presented in each challenge will be evaluated by a jury made up of at least five experts. In addition to the relevance of the proposal, the open data used (economic and social potential, interoperability with other data sources, etc.) and the adequacy of the objective (product maturity, benefits offered, etc.) will be taken into account.

Which is the prize?

In total, 99,000 euros will be awarded (33,000 for each challenge), which will be divided for each challenge as follows:

First place: 18,000 euros
Second place: 10,000 euros
Third place: 5,000 euros

From datos.gob.es we invite you to submit your proposal. Good luck to all participants!

29/03/2021

Copernicus acceleration programs to boost businesses based on Earth observation data

Blog

COPERNICUS, the Earth observation program of the European Union, is the world's largest geospatial data provider, with a current production of twelve terabytes a day. The possible uses of this data, many of which are made available to citizens for free, are multiple and can have a place in various economic sectors: from promoting political decision-making to improve sustainability or emergency management to optimize transport or agricultural management. Its economic potential is therefore immense. But starting a business that uses this kind of data is not always easy.

In this context, COPERNICUS START-UP PROGRAMME was born, to help entrepreneurs and new companies to promote new businesses based on the data offered by the Copernicus project. The program consists of four components: prizes, hackathons, acceleration program and incubation program.

The Copernicus Prizes support the generation of new applications and business ideas. Aimed at entrepreneurs, start-ups and students, they are awarded in the context of the Copernicus Master project. In 2019, participants have had the opportunity to win prizes with a total value of more than 450,000 euros and additional benefits such as coaching, technical support or marketing assistance. To do this, they had to solve a series of challenges posed by partners such as Airbus, the European Space Agency (ESA) or the German Ministry of Transport.
Copernicus Hackathons are events in which programmers and subject-matter experts collaborate to develop software solutions based on Copernicus data and services that solve global social and economic problems. In this regard, the European Commission funds 20 Copernicus hackatons every year throughout Europe, which can be organized by any organization that is running for it.
Copernicus Accelerator is a 12-month acceleration program that welcomes 50 new entrepreneurs every year in the European territory. During the program, each participant has the help of a mentor who offers a personalized guide to achieve a commercial objective (for example, participate in an incubator programme or obtaining a first client). The program begins and closes with two BootCamps, where all participants meet their mentor and can expand their contacts network. In addition, participants have at their disposal a virtual training program with new online courses every month.
Copernicus Incubator is the incubation program for start-ups that are less than 5 years old. Each year, the program grants 50,000 euros to 20 European companies, to finance its consolidation. In 2020, three new rounds of financing will take place in February, May and October.

To these programs we must add other initiatives such as Copernicus Relay and Copernicus Academy. Copernicus Relay is a program made up of organizations that coordinate and promote local activities around the Copernicus program, while Copernicus Academy connects European universities, research institutions, business schools and private organizations from the countries participating in the program, facilitating collaboration between them. You can read more about both programs and the role of Spain in them in this article.

In short, thanks to all these activities, the development of innovative applications and products increase to make the lives of European citizens and other places in the world easier. In the use cases section of the Copernicus website you can see some of the projects carried out based on the information provided by the program, which can inspire you to dare to innovate with the geospatial data provided by Copernicus.

11/12/2019

Searching high-value data

Blog

The new Directive on the opening of data and the reuse of public sector information, which was adopted last June, will replace and improve the old Directive 2003/98 / EC on the reuse of public sector information. Among the most significant changes within this new Directive is the objective of specifying a list of high-value datasets among those held by public sector bodies.

The creation of a list like this is a very important milestone because, for the first time in 15 years of Directive, we will have an explicit and common guide on what are the minimum datasets that should always be available, as well as the conditions for their reuse throughout the European Union - which will include their reuse for free, through application programming interfaces (APIs), in a machine-readable format and, where appropriate, including the bulk download option.

The questions we all ask ourselves immediately are: what are the high-value data they refer to? And what are the specific criteria that we should apply when identifying such high-value data?

The Directive defines high-value data as “documents whose reuse is associated with important benefits for society, the environment and the economy, in particular because of their suitability for the creation of value-added services, applications and new, high-quality and decent jobs, and of the number of potential beneficiaries of the value-added services and applications based on those datasets”. This definition offers several clues as to how these high-value datasets are expected to be identified through a series of indicators that would include:

Their potential to generate significant social or environmental benefits.
Their potential to generate economic benefits and new income.
Their potential to generate innovative services;
Their potential to benefit a high number of users, in particular SMEs
Their potential to be combined with other datasets.

On the other hand, the Commission opened a consultation process some years ago that has served to evaluate public opinion on the priority of the data to be published. There are also several studies and reference entities in which the Commission has been inspired and which have been publishing its own recommendations related to high strategic value datasets, such as:

The results of the MEPSIR study on the exploitation of the information resources of the European Union.
The technical annex of the G8 Open Data Charter.
The matters that generate business by the infomediary sector in Spain, according to the analysis of the sector carried out by ONTSI.
The criteria established by the ISA program of interoperability solutions of the European Commission.
Standard UNE 178301:2015on Open Data in Smart Cities.
The data analyzed by the Open Data Barometer and the Global Open Data Index..
The datasets to be published proposed by the Federation of Municipalities and Provinces - FEMP.

In addition, the Directive itself offers us once again another additional clue in its annex on what datasets could be finally selected for their high-value, through a series of priority domains that largely coincide with the proposals made by the organisms mentioned above: geospatial data, earth observation and environmental, meteorological, statistical, companies records or transport data.

It should also be remembered that the data related to some of the aforementioned topics are also regulated by specific sectoral legislation - such as Directive 2007/2 / EC on spatial data (INSPIRE), Directive 2003/4/EC on environmental information and Directive 2010/40 / EU on transport data - and therefore such legislation should also be taken into account when defining the final scope of application.

However, as the new Directive clarifies, neither the thematic list is closed nor the specific datasets are still defined. And it is that the European Commission has recently commissioned a new impact study precisely with the objective of defining in detail and substantiating what those datasets called “high-value” should finally be. However, there are also critical voices that cry out for the need for a better definition of the analysis criteria when deciding what these data will eventually be, and also for involving the whole society in the process. Fortunately, both critics and the Commission agree that the solution is to broaden the debate and establish a series of public and expert consultations - as is already reflected in the Directive and in the planned impact study - such as case of the debate that will take place in the next edition of the Aporta Meeting on December 18 in Madrid and whose motto is precisely “Driving high-value data”.

Therefore, we will still have to wait for some time until all the studies and consultations planned are completed in order to finally know in detail what will be the high-value data of mandatory publication in the European Union, although it will surely be with sufficient margin before finalizing the deadline for the Directive transposition in July 2021.

Content prepared by Carlos Iglesias, Open data Researcher and consultan, World Wide Web Foundation.

Contents and points of view expressed in this publication are the exclusive responsibility of its author.

04/12/2019