Sectores
calendar icon
Blog

Today's industry is facing one of the biggest challenges in its recent history. Market demands, pressure to meet climate targets, consumer demand for transparency and technological acceleration are converging in a profound transformation of the production model. This transformation is not only aimed at greater competitiveness, but also at more resilient, flexible, efficient and sustainable production.

In this context, industrial digitisation - driven by technologies such as the Internet of Things (IoT), artificial intelligence, edge computing, or cyber-physical systems - is generating massive amounts of operational, environmental and logistical data. However, the mere existence of this data does not in itself guarantee value. The key is to govern it properly, i.e. to establish principles, processes, roles and technologies that ensure that this data is reliable, accessible, useful and secure. In other words, that the data is fit to be harnessed to improve industrial processes.

This is why industrial data governance is positioned as a strategic factor. It is not just a matter of ‘having data’, but of turning it into a critical infrastructure for decision-making, resource optimisation, intelligent automation and ecological transition. Without data governance, there is no Industry 4.0. And without Industry 4.0, the challenges of sustainability, energy efficiency or full traceability are intractable.

In this article we explore why data governance is essential in industry, what concrete benefits it brings to production processes and how initiatives such as the National Industry Data Space can accelerate this transformation.
 We then analyse its impact at different levels, from the most relevant use cases to the collaborative frameworks that are emerging in Spain.

Why is data governance key in industry?

Industrial data comes from a multitude of distributed sources: IoT sensors, SCADA systems, automated assembly lines, maintenance platforms, ERP or Manufacturing Execution Systems (MES), among others. This heterogeneity, if not properly managed, can become a barrier to the integration and useful analysis of information.

Data governance overcomes these barriers by establishing the rules of the game for data capture, storage, quality, interoperability, use, protection and disposal. This enables not only operational efficiency but also long-term sustainability. How?

  • Reducing operational inefficiencies: by having accurate, up-to-date and well-integrated data between systems, tasks can be automated, rework avoided, and unplanned downtime reduced. For example, a plant can adjust the speed of its production lines in real time based on the analysis of performance and demand data.
  • Improving sustainability: robust data management can identify patterns of energy consumption, materials or emissions. With this information, processes can be redesigned to be more sustainable, eco-design can be applied and the environmental footprint can be reduced. Data, in this case, acts as a compass towards decarbonisation.
  • Ensuring regulatory compliance and traceability: from ISO 9001 to the new circular economy regulations or the Digital Product Passport, industries must demonstrate compliance. This is only possible with reliable, traceable and auditable data.
  • Facilitating interoperability between systems: data governance acts as the ‘glue’ that binds together the different technological silos of an organisation: quality, production, logistics, maintenance, purchasing, etc. The standardisation and semantic alignment of data allows for more agile flows and better informed decisions.
  • Boosting the circular economy: thanks to the full traceability of a product's life cycle, from design to recycling, it is possible to identify opportunities for reuse, material recovery and sustainable design. This is supported by data that follows the product throughout its life.

What should data governance look like in the industrial environment?

A data governance model adapted to this context should include:

Specific roles: it is necessary to have a defined team, where everyone's responsibility and tasks are clear. Some of the roles that cannot be missing are:

  • Data owners: responsible for the use of the data in their area (production, quality, maintenance...).
  • Data stewards: ensure the consistency, completeness and accuracy of the information.
  • Data governance team: coordinates the strategy, defines common policies and evaluates compliance.

Structured processes: Like the roles, it is necessary to define the various phases and operations to be carried out. These include the following:

  • Classification and cataloguing of data assets (by type, criticality, use).
  • Data quality control: definition of validation rules, cleaning of duplicates, exception management.
  • Data life cycle: from its creation on the machine to its archiving or destruction.
  • Access and security: privilege management, usage audits, traceability.

Organisational policies: to ensure interoperability and data quality it is necessary to have standards, norms and guidelines to guide users. Some examples are:

  • Standards for nomenclature, formats, encoding and synchronisation.
  • Standards for interoperability between systems (e.g. use of standards such as OPC UA or ISA-95).
  • Guidelines for ethical and legally compliant use (such as Data Regulation, GDPR or environmental legislation).

This approach makes industrial data an asset managed with the same rigour as any physical infrastructure.

Industrial use cases enabled by data governance

The benefits of data governance in industry are realised in multiple practical applications. Some of the most representative use cases are:

1.Predictive maintenance

One of the great classics of Industry 4.0. By combining historical maintenance data with real-time sensors, organisations can anticipate machine failures and avoid unexpected downtime. But this is only possible if the data is governed: if its capture frequency, format, responsible parties, quality and availability have been defined.

2. Complete product traceability

From raw material to end customer, every event in the value chain is recorded and accessible. This is vital for sectors such as food, automotive or pharmaceuticals, where traceability is both an added value and a regulatory obligation. Data governance ensures that this traceability is not lost, is verifiable and meets the required interoperability standards.

3. Digital twins and process simulation

For a digital twin - a virtual replica of a physical process or system - to work, it needs to be fed with accurate, up-to-date and consistent data. Data governance ensures synchronisation between the physical and virtual worlds, and allows the generation of reliable simulation scenarios, from the design of a new production line to the optimisation of the factory layout, i.e. of the different elements within the plant.

4. Energy monitoring and emission control

Real-time monitoring of energy, water or gas consumption can reveal hidden inefficiencies and opportunities for savings. Through intelligent dashboards and KPIs defined on governed data, industrial plants can reduce their costs and advance their environmental sustainability goals.

5. Automation and intelligent quality control

Machine vision systems and machine learning algorithms trained with production data allow to detect defects in real time, adjust parameters automatically and improve final quality. Without good data quality (accuracy, completeness, consistency), these algorithms can fail or generate unreliable results.

The National Industry Data Space: key to collaboration and competitiveness

For industrial data governance to transcend the scope of each company and become a real lever for sectoral transformation, it is necessary to have infrastructures that facilitate the secure, reliable and efficient sharing of data between organisations. The National Data Space for Industry, framed within the Plan for the Promotion of Sectoral Data Spaces promoted by the Ministry for Digital Transformation and the Civil Service, is in this line.

This space aims to create an environment of trust where companies, associations, technology centres and administrations can share and reuse industrial data in an interoperable manner, in accordance with ethical, legal and technical principles. Through this framework, the aim is to enable new forms of collaboration, accelerate innovation and reinforce the strategic autonomy of the national productive fabric.

The industrial sector in Spain is enormously diverse, with an ecosystem made up of large corporations, SMEs, suppliers, subcontractors, clusters and R&D centres. This diversity can become a strength if it is articulated through a common data infrastructure that facilitates the integration and exchange of information in an orderly and secure manner. Moreover, these industrial data can be complemented with open data published by public bodies, such as those available in the National Catalogue of Open Data, thus extending the value and possibilities of reuse for the sector as a whole.

The strengths of this common infrastructure allow:

  • Detect synergies along the value chain, such as industrial recycling opportunities between different sectors (e.g. plastic waste from one chemical industry as raw material in another).
  • Reducing entry barriers to digitisation, especially for SMEs that do not have the resources to deploy advanced data analytics solutions, but could access shared services or data within the space.
  • Encourage open innovation models where companies share data in a controlled way for the joint development of solutions based on artificial intelligence or predictive maintenance.
  • Promote sectoral aggregate indicators, such as shared carbon footprints, energy efficiency levels or industrial circularity indices, which allow the country as a whole to make more coordinated progress towards sustainability and competitiveness objectives.

The creation of the National Industrial Data Space can be a true lever for modernization for the Spanish industrial fabric:

  • Increased international competitiveness, by facilitating compliance with European market requirements, such as the Data Regulation, the Digital Product Passport, and sustainability standards.
  • Regulatory agility and improved traceability, allowing industries to respond quickly to audits, certifications, or regulatory changes.
  • Proactive capacity, thanks to the joint analysis of production, consumption, or market data that allows for the prediction of disruptions in supply chains or the demand for critical resources.
  • Creation of new business models, based on the provision of products as a service, the reuse of materials, or the shared leasing of industrial capacities.

The deployment of this national data space not only seeks to improve the efficiency of industrial processes. It also aims to strengthen the country's technological and data sovereignty, enabling a model where the value generated by data remains within the companies, regions, and sectors themselves. In this sense, the National Industrial Data Space aligns with European initiatives such as GAIA-X and Manufacturing-X, but with an approach adapted to the context and needs of the Spanish industrial ecosystem.

Conclusions

Data governance is a fundamental pillar for the industry to move toward more efficient, sustainable, and resilient models. Having large volumes of information is not enough: it must be managed properly to generate real value.

The benefits are clear: operational optimization, improved traceability, a boost to the circular economy, and support for technologies such as artificial intelligence and digital twins. But the real leap forward comes when data is no longer managed in isolation and becomes part of a shared ecosystem.

The National Industrial Data Space offers this framework for collaboration and trust, facilitating innovation, competitiveness, and technological sovereignty. Investing in its development means investing in a more connected, intelligent industry that is prepared for the challenges of the future.


Content prepared by Dr. Fernando Gualo, Professor at UCLM and Data Governance and Quality Consultant. The content and point of view reflected in this publication are the sole responsibility of its author.

calendar icon
Blog

The value of open satellite data in Europe

Satellites have become essential tools for understanding the planet and managing resources efficiently. The European Union (EU) has developed an advanced space infrastructure with the aim of providing real-time data on the environment, navigation and meteorology.

This satellite network is driven by four key programmes:.

  • Copernicus: Earth observation, environmental monitoring and climate change.
  • Galileo: high-precision satellite navigation, alternative to GPS.
  • EGNOS: improved positioning accuracy, key to aviation and navigation.
  • Meteosat: padvanced meteorological prediction and atmospheric monitoring.

Through these programmes, Europe not only ensures its technological independence, but also obtains data that is made available to citizens to drive strategic applications in agriculture, security, disaster management and urban planning.

In this article we will explore each programme, its satellites and their impact on society, including Spain''s role in each of them.

Copernicus: Europe''s Earth observation network

Copernicus is the EU Earth observation programme, managed by the European Commission with the technical support of the European Space Agency (ESA) and the European Organisation for the Exploitation of Meteorological Satellites (EUMETSAT).. It aims to provide free and open data about the planet to monitor climate change, manage natural resources and respond to emergencies.

The programme is structured into three main components:

  1. Space component: consists of a series of satellites called Sentinel, developed specifically for the needs of Copernicus. These satellites provide high quality data for various applications, such as land, sea and atmospheric monitoring.
  2. Component in situ: includes data collected through ground, air and sea stations. These data are essential to calibrate and validate the information obtained by the satellites, ensuring its accuracy and reliability.
  3. Operational Services: offers six thematic services that transform collected data into useful information for users:
    • Atmospheric monitoring
    • Marine monitoring
    • Terrestrial monitoring
    • Climate change
    • Emergency management
    • Safety

These services provide information in areas such as air quality, ocean status, land use, climate trends, disaster response and security, supporting informed decision-making in Europe.

Spain has played a key role in the manufacture of components for the Sentinel satellites. Spanish companies have developed critical structures and sensors, and have contributed to the development of data processing software.  Spain is also leading projects such as the Atlantic Constellation, which will develop small satellites for climate and oceanic monitoring.

Sentinel satellite

Satellite Technical characteristics Resolution Coverage (capture frequency) Uses
Sentinel-1 C-band SAR radar, resolution up to 5m Up to 5m Every 6 days Land and ocean monitoring, natural disasters
Sentinel-2 Multispectral camera (13 bands), resolution up to 10m  10m, 20m, 60m Every 5 days Agricultural management, forestry monitoring, water quality
Sentinel-3 Radiometer SLSTR, Spectrometer OLCI, Altimeter SRAL 300m (OLCI), 500m (SLSTR) Every 1-2 days Oceanic, climatic and terrestrial observation
Sentinel-5P Tropomi spectrometer, resolution 7x3.5 km². 7x3.5 km² Daily global coverage Air quality monitoring, trace gases
Sentinel-6 Altimeter Poseidon-4, vertical resolution 1 cm 1cm Every 10 days Sea level measurement, climate change

Figure 1. Table satellites Sentinel. Source: own elaboration

Galileo: the european GPS

Galileo is the global navigation satellite system developed by the European Union, managed by the European Space Agency (ESA) and operated by the European Union Space Programme Agency (EUSPA). It aims to provide a reliable and highly accurate global positioning service, independent of other systems such as the US GPS, China''s Beidou or Russia''s GLONASS. Galileo is designed for civilian use and offers free and paid services for various sectors, including transport, telecommunications, energy and finance.

Spain has played a leading role in the Galileo programme. The European GNSS Service Centre (GSC), located in Torrejón de Ardoz, Madrid, acts as the main contact point for users of the Galileo system. In addition, Spanish industry has contributed to the development and manufacture of components for satellites and ground infrastructure, strengthening Spain''s position in the European aerospace sector.

Satellite Technical characteristics Resolution Coverage (capture frequency) Uses
Galileo FOC Medium Earth Orbit (MEO), 24 operatives N/A Continuous Precise positioning, land and maritime navigation
Galileo IOV First test satellites of the Galileo system  N/A Continuous Initial testing of Galileo before FOC

Figure 2. Satellite Galileo. Source: own elaboration

EGNOS: improving the accuracy of GPS and Galileo

 The European Geostationary Navigation Overlay Service (EGNOS) is the European satellite-based augmentation system (Satellite Based Augmentation System or SBAS) designed to improve the accuracy and reliability of global navigation satellite systems ( Global Navigation Satellite System, GNSS), such as GPS and, in the future, Galileo. EGNOS provides corrections and integrity data that allow users in Europe to determine their position with an accuracy of up to 1.5 metres, making it suitable for safety-critical applications such as aviation and maritime navigation.

Spain has played a leading role in the development and operation of EGNOS. Through ENAIRE, Spain hosts five RIMS Reference Stations located in Santiago, Palma, Malaga, Gran Canaria and La Palma. In addition, the Madrid Air Traffic Control Centre, located in Torrejón de Ardoz, hosts one of the EGNOS Mission Control Centres (MCC), operated by ENAIRE. The Spanish space industry has contributed significantly to the development of the system, with companies participating in studies for the next generation of EGNOS.

Satellite Technical characteristics Resolution Coverage (capture frequency) Uses
EGNOS Geo Geostationary GNSS correction satellites N/A Real-time GNSS correction GNSS signal correction for aviation and transportation

Figure 3. Table satellite EGNOS. Source: own elaboration

Meteosat: high precision weather forecasting

The Meteosat programme consists of a series of geostationary meteorological satellites initially developed by the European Space Agency (ESA) and currently operated by the European Organisation for the Exploitation of Meteorological Satellites (EUMETSAT). These satellites are positioned in geostationary orbit above the Earth''s equator, allowing continuous monitoring of weather conditions over Europe, Africa and the Atlantic Ocean. Its main function is to provide images and data to facilitate weather prediction and climate monitoring.

Spain has been an active participant in the Meteosat programme since its inception. Through the Agencia Estatal de Meteorología (AEMET), Spain contributes financially to EUMETSAT and participates in the programme''s decision-making and operations. In addition, the Spanish space industry has played a key role in the development of the Meteosat satellites. Spanish companies have been responsible for the design and supply of critical components for third-generation satellites, including scanning and calibration mechanisms.

Satellite Technical characteristics Resolution Cobertura (frecuencia de captura) Usos
Meteosat Primera Gen.  Initial weather satellites, low resolution Low resolution Every 30min Basic weather forecast, images every 30 min.
Meteosat Segunda Gen. Higher spectral and temporal resolution, data every 15 min. High resolution Every 15min Improved accuracy, early detection of weather events
Meteosat Tercera Gen. High-precision weather imaging, lightning detection High resolution High frequency High-precision weather imaging, lightning detection

Figure 4. Metosat satellite. Source: own elaboration

Access to the data of each programme

Each programme has different conditions and distribution platforms in terms of access to data:

  • Copernicus: provides free and open data through various platforms.  Users can access satellite imagery and products through the Copernicus Data Space Ecosystem, which offers search, download and processing tools. Data can also be obtained through APIs for integration into automated systems.
  • Galileo: its open service (Open Service - OS) allows free use of the navigation signals for any user with a compatible receiver, free of charge. However, direct access to raw satellite data is not provided. For information on services and documentation, access is via the European GNSS Services Centre (GSC):
    • Galileo Portal.
    • Registration for access to the High Accuracy Service (HAS) (registration required).
  • EGNOS: This system improves navigation accuracy with GNSS correction signals.  Data on service availability and status can be found on the EGNOS User Support platform..
  • Meteosat: Meteosat satellite data are available through the EUMETSAT platform. There are different levels of access, including some free data and some subject to registration or payment.  For imagery and meteorological products you can access the EUMETSAT Data Centre..

In terms of open access, Copernicus is the only programme that offers open and unrestricted data. In contrast, Galileo and EGNOS provide free services, but not access to raw satellite data, while Meteosat requires registration and in some cases payment for access to specific data.

Conclusions

The Copernicus, Galileo, EGNOS and Meteosat programmes not only reinforce Europe''s space sovereignty, but also ensure access to strategic data essential for the management of the planet. Through them, Europe can monitor climate change, optimise global navigation, improve the accuracy of its positioning systems and strengthen its weather predictioncapabilities, ensuring more effective responses to environmental crises and emergencies.

Spain plays a fundamental role in this space infrastructure, not only with its aerospace industry, but also with its control centres and reference stations, consolidating itself as a key player in the development and operation of these systems.

Satellite imagery and data have evolved from scientific tools to become essential resources for security, environmental management and sustainable growth. In a world increasingly dependent on real-time information, access to this data is critical for climate resilience, spatial planning, sustainable agriculture and ecosystem protection.

The future of Earth observation and satellite navigation is constantly evolving, and Europe, with its advanced space programmes, is positioning itself as a leader in the exploration, analysis and management of the planet from space.

Access to this data allows researchers, businesses and governments to make more informed and effective decisions. With these systems, Europe and Spain guarantee their technological independence and strengthen their leadership in the space sector.

Ready to explore more? Access the links for each programme and discover how this data can transform our world.

Copernicus https://dataspace.copernicus.eu/ Download centre
Meteosat https://user.eumetsat.int/data-access/data-centre/  Download centre
Galileo  https://www.gsc-europa.eu/galileo/services/galileo-high-accuracy-servic…/   Download centre, after registration
EGNOS https://egnos-user-support.essp-sas.eu/ Project

Figure 5. Source: own elaboration


Content prepared by Mayte Toscano, Senior Consultant in Data Economy Technologies. The contents and points of view reflected in this publication are the sole responsibility of the author.

calendar icon
Blog

Digital transformation has reached almost every aspect and sector of our lives, and the world of products and services is no exception. In this context, the Digital Product Passport (DPP) concept is emerging as a revolutionary tool to foster sustainability and the circular economy. Accompanied by initiatives such as CIRPASS (Circular Product Information System for Sustainability), the DPP promises to change the way we interact with products throughout their life cycle. In this article, we will explore what DPP is, its origins, applications, risks and how it can affect our daily lives and the protection of our personal data.

What is the Digital Product Passport (DPP)? Origin and importance

The Digital Product Passport is a digital collection of key information about a product, from manufacturing to recycling. This passport allows products to be tracked and managed more efficiently, improving transparency and facilitating sustainable practices. The information contained in a DPP may include details on the materials used, the manufacturing process, the supply chain, instructions for use and how to recycle the product at the end of its life.

The DPP has been developed in response to the growing need to promote the circular economy and reduce the environmental impact of products. The European Union (EU) has been a pioneer in promoting policies and regulations that support sustainability. Initiatives such as the EU's Circular Economy Action Plan have been instrumental in driving the DPP forward. The objectives of this plan are as follows:

  • Greater Transparency: Consumers no longer have to guess about the origin of their products and how to dispose of them correctly. With a machine-readable DPP (e.g. QR code or NFC tag) attached to end products, consumers can make informed purchasing decisions and brands can eliminate greenwashing with confidence.
  • Simplified Compliance: By creating an audit of events and transactions in a product's value chain, the DPP provides the brand and its suppliers with the necessary data to address compliance demands efficiently.
  • Sustainable Production: By tracking and reporting the social and environmental impacts of a product from source to disposal, brands can make data-driven decisions to optimise sustainability in product development.
  • Circular Economy: The DPP facilitates a circular economy by promoting eco-design and the responsible production of durable products that can be reused, remanufactured and disposed of correctly.

The following image summarises the main advantages of the digital passport at each stage of the digital product manufacturing process:

CIRPASS as a facilitator of DPP implementation

CIRPASS is a platform that supports the implementation of the DPP. This European initiative aims to standardise the collection and exchange of data on products, facilitating their traceability and management throughout their life cycle. CIRPASS plays a crucial role in creating an interoperable digital framework that connects manufacturers, consumers and recyclers.

DPP applications in various sectors

On 5 March 2024, CIRPASS, in collaboration with the European Commission, organised an event on the future development of the Digital Product Passport. The event brought together various stakeholders from different industries and organisations, who, with an eminently practical approach presented and discussed various aspects of the upcoming regulation and its requirements, possible solutions, examples of use cases, and the obstacles and opportunities for the affected industries and businesses.

The following are the applications of DPP in various sectors as explained at the event:

  1. Textile industry: It allows consumers to know the origin of the garments, the materials used and the working conditions in the factories.
  2. Electronics: Facilitates recycling and reuse of components, reducing electronic waste.
  3. Automotive: It assists in tracking parts and materials, promoting the repair and recycling of vehicles.
  4. Power supply: It provides information on food traceability, ensuring safety and sustainability in the supply chain.

 The impact of the DPP on citizens' lives

But what impact will the use of this kind of novel paradigm have on our daily lives? And how does this impact on us as end users of multiple products and services such as those mentioned above? We will focus on four base cases: informed consumers in any field, ease of product repair, trust and transparency, and efficient recycling.

The DPP provides consumers with access to detailed information about the products they buy, such as their origin, materials and production practices. This allows consumers to make more informed choices and opt for products that are sustainable and ethical. For example, a consumer can choose a garment made from organic materials and produced under fair labour conditions, thus promoting responsible and conscious consumption.

Similarly, one of the great benefits of the DPP is the inclusion of repair guides within the digital passport. This means that consumers can easily access detailed instructions on how to repair a product instead of discarding it when it breaks down. For example, if an appliance stops working, the DPP can provide a step-by-step repair manual, allowing the user to fix it himself or take it to a technician with the necessary information. This not only extends the lifetime of products, but also reduces e-waste and promotes sustainability.

Also, access to detailed and transparent product information through the DPP can increase consumers' trust in brands. Companies that provide a complete and accurate DPP demonstrate their commitment to transparency and accountability, which can enhance their reputation and build customer loyalty. In addition, consumers who have access to this information are better able to make responsible purchasing decisions, thus encouraging more ethical and sustainable consumption habits.

Finally, the DPP facilitates effective recycling by providing clear information on how to break down and reuse the materials in a product. For example, a citizen who wishes to recycle an electronic device can consult the DPP to find out which parts can be recycled and how to separate them properly. This improves the efficiency of the recycling process and ensures that more materials are recovered and reused instead of ending up in landfill, contributing to a circular economy.

Risks and challenges of the DPP

Similarly, as a novel technology and as part of the digital transformation that is taking place in the product sectors, the DPP also presents certain challenges, risks and challenges such as:

  1. Data Protection: The collection and storage of large amounts of data can put consumers' privacy at risk if not properly managed.
  2. Security: Digital data is vulnerable to cyber-attacks, which requires robust security measures.
  3. Interoperability: Standardisation of data across different industries and countries can be complex, making it difficult to implement the DPP on a large scale.
  4. Costs: Creating and maintaining digital passports can be costly, especially for small and medium-sized enterprises.

Data protection implications

The implementation of the DPP and systems such as CIRPASS implies careful management of personal data. It is essential that companies and digital platforms comply with data protection regulations, such as the EU's General Data Protection Regulation (GDPR). Organisations must ensure that the data collected is used in a transparent manner and with the explicit consent of consumers. In addition, advanced security measures must be implemented to protect the integrity and confidentiality of the data.

Relationship with European Data Spaces

The European Data Spaces are an EU initiative to create a single market for data, promoting innovation and the digital economy. The DPP and CIRPASS are aligned with this vision, as they encourage the exchange of information between different actors in the economy. Data interoperability is essential for the success of the European Data Spaces, and the DPP can contribute significantly to this goal by providing structured and accessible product data.

Conclusion

In conclusion, the Digital Product Passport and the CIRPASS initiative represent a significant step towards a more circular and sustainable economy. Through the collection and exchange of detailed product data, these systems can improve transparency, encourage responsible consumption practices and reduce environmental impact. However, their implementation requires overcoming challenges related to data protection, security and interoperability. As we move towards a more digitised future, the DPP and CIRPASS have the potential to transform the way we interact with products and contribute to a more sustainable world.


Content prepared by Dr. Fernando Gualo, Professor at UCLM and Data Governance and Quality Consultant The content and the point of view reflected in this publication are the sole responsibility of its author.

calendar icon
Documentación

1. Introduction

In the information age, artificial intelligence has proven to be an invaluable tool for a variety of applications. One of the most incredible manifestations of this technology is GPT (Generative Pre-trained Transformer), developed by OpenAI. GPT is a natural language model that can understand and generate text, providing coherent and contextually relevant responses. With the recent introduction of Chat GPT-4, the capabilities of this model have been further expanded, allowing for greater customisation and adaptability to different themes.

In this post, we will show you how to set up and customise a specialised critical minerals wizard using GPT-4 and open data sources. As we have shown in previous publications critical minerals are fundamental to numerous industries, including technology, energy and defence, due to their unique properties and strategic importance. However, information on these materials can be complex and scattered, making a specialised assistant particularly useful.

The aim of this post is to guide you step by step from the initial configuration to the implementation of a GPT wizard that can help you to solve doubts and provide valuable information about critical minerals in your day to day life. In addition, we will explore how to customise aspects of the assistant, such as the tone and style of responses, to perfectly suit your needs. At the end of this journey, you will have a powerful, customised tool that will transform the way you access and use critical open mineral information.

Access the data lab repository on Github.

2. Context

The transition to a sustainable future involves not only changes in energy sources, but also in the material resources we use. The success of sectors such as energy storage batteries, wind turbines, solar panels, electrolysers, drones, robots, data transmission networks, electronic devices or space satellites depends heavily on access to the raw materials critical to their development. We understand that a mineral is critical when the following factors are met:

  • Its global reserves are scarce
  • There are no alternative materials that can perform their function (their properties are unique or very unique)
  • They are indispensable materials for key economic sectors of the future, and/or their supply chain is high risk

You can learn more about critical minerals in the post mentioned above.

3. Target

This exercise focuses on showing the reader how to customise a specialised GPT model for a specific use case. We will adopt a "learning-by-doing" approach, so that the reader can understand how to set up and adjust the model to solve a real and relevant problem, such as critical mineral expert advice. This hands-on approach not only improves understanding of language model customisation techniques, but also prepares readers to apply this knowledge to real-world problem solving, providing a rich learning experience directly applicable to their own projects.

The GPT assistant specialised in critical minerals will be designed to become an essential tool for professionals, researchers and students. Its main objective will be to facilitate access to accurate and up-to-date information on these materials, to support strategic decision-making and to promote education in this field. The following are the specific objectives we seek to achieve with this assistant:

  • Provide accurate and up-to-date information:
    • The assistant should provide detailed and accurate information on various critical minerals, including their composition, properties, industrial uses and availability.
    • Keep up to date with the latest research and market trends in the field of critical minerals.
  • Assist in decision-making:
    • To provide data and analysis that can assist strategic decision making in industry and critical minerals research.
    • Provide comparisons and evaluations of different minerals in terms of performance, cost and availability.
  • Promote education and awareness of the issue:
    • Act as an educational tool for students, researchers and practitioners, helping to improve their knowledge of critical minerals.
    • Raise awareness of the importance of these materials and the challenges related to their supply and sustainability.

4. Resources

To configure and customise our GPT wizard specialising in critical minerals, it is essential to have a number of resources to facilitate implementation and ensure the accuracy and relevance of the model''s responses. In this section, we will detail the necessary resources that include both the technological tools and the sources of information that will be integrated into the assistant''s knowledge base.

Tools and Technologies

The key tools and technologies to develop this exercise are:

  • OpenAI account: required to access the platform and use the GPT-4 model. In this post, we will use ChatGPT''s Plus subscription to show you how to create and publish a custom GPT. However, you can develop this exercise in a similar way by using a free OpenAI account and performing the same set of instructions through a standard ChatGPT conversation.
  • Microsoft Excel: we have designed this exercise so that anyone without technical knowledge can work through it from start to finish. We will only use office tools such as Microsoft Excel to make some adjustments to the downloaded data.

In a complementary way, we will use another set of tools that will allow us to automate some actions without their use being strictly necessary:

  • Google Colab: is a Python Notebooks environment that runs in the cloud, allowing users to write and run Python code directly in the browser. Google Colab is particularly useful for machine learning, data analysis and experimentation with language models, offering free access to powerful computational resources and facilitating collaboration and project sharing.
  • Markmap: is a tool that visualises Markdown mind maps in real time. Users write ideas in Markdown and the tool renders them as an interactive mind map in the browser. Markmap is useful for project planning, note taking and organising complex information visually. It facilitates understanding and the exchange of ideas in teams and presentations.

Sources of information

With these resources, you will be well equipped to develop a specialised GPT assistant that can provide accurate and relevant answers on critical minerals, facilitating informed decision-making in the field.

5. Development of the exercise

5.1. Building the knowledge base

For our specialised critical minerals GPT assistant to be truly useful and accurate, it is essential to build a solid and structured knowledge base. This knowledge base will be the set of data and information that the assistant will use to answer queries. The quality and relevance of this information will determine the effectiveness of the assistant in providing accurate and useful answers.

Search for Data Sources

We start with the collection of information sources that will feed our knowledge base. Not all sources of information are equally reliable. It is essential to assess the quality of the sources identified, ensuring that:

  • Information is up to date: the relevance of data can change rapidly, especially in dynamic fields such as critical minerals.
  • The source is reliable and recognised: it is necessary to use sources from recognised and respected academic and professional institutions.
  • Data is complete and accessible: it is crucial that data is detailed and accessible for integration into our wizard.

 In our case, we developed an online search in different platforms and information repositories trying to select information belonging to different recognised entities:

Selection and preparation of information

We will now focus on the selection and preparation of existing information from these sources to ensure that our GPT assistant can access accurate and useful data.

RMIS of the Joint Research Center of the European Union:

  • Selected information:

We selected the report "Supply chain analysis and material demand forecast in strategic technologies and sectors in the EU - A foresight study". This is an analysis of the supply chain and demand for minerals in strategic technologies and sectors in the EU. It presents a detailed study of the supply chains of critical raw materials and forecasts the demand for minerals up to 2050.

  • Necessary preparation: 

The format of the document, PDF, allows the direct ingestion of the information by our assistant. However, as can be seen in Figure 1, there is a particularly relevant table on pages 238-240 which analyses, for each mineral, its supply risk, typology (strategic, critical or non-critical) and the key technologies that employ it. We therefore decided to extract this table into a structured format (CSV), so that we have two pieces of information that will become part of our knowledge base.

Table of minerals contained in the JRC PDF

Figure 1: Table of minerals contained in the JRC PDF

To programmatically extract the data contained in this table and transform it into a more easily processable format, such as CSV(comma separated values), we will use a Python script that we can use through the platform Google Colab platform (Figure 2).

Python script for the extraction of data from the JRC PDF developed on the Google Colab platform.

Figure 2: Script Python para la extracción de datos del PDF de JRC desarrollado en plataforma Google Colab.

To summarise, this script:

  1. It is based on the open source library PyPDF2capable of interpreting information contained in PDF files.
  2. First, it extracts in text format (string) the content of the pages of the PDF where the mineral table is located, removing all the content that does not correspond to the table itself.
  3. It then goes through the string line by line, converting the values into columns of a data table. We will know that a mineral is used in a key technology if in the corresponding column of that mineral we find a number 1 (otherwise it will contain a 0).
  4. Finally, it exports the table to a CSV file for further use.

International Energy Agency (IEA):

  • Selected information:

We selected the report "Global Critical Minerals Outlook 2024". It provides an overview of industrial developments in 2023 and early 2024, and offers medium- and long-term prospects for the demand and supply of key minerals for the energy transition. It also assesses risks to the reliability, sustainability and diversity of critical mineral supply chains.

  • Necessary preparation:

The format of the document, PDF, allows us to ingest the information directly by our virtual assistant. In this case, we will not make any adjustments to the selected information.

Spanish Geological and Mining Institute''s Minerals Database (BDMIN)

  • Selected information:

In this case, we use the form to select the existing data in this database for indications and deposits in the field of metallogeny, in particular those with lithium content.

Dataset selection in BDMIN.

Figure 3: Dataset selection in BDMIN.

  • Necessary preparation:

We note how the web tool allows online visualisation and also the export of this data in various formats. Select all the data to be exported and click on this option to download an Excel file with the desired information.

BDMIN Visualization and Download Tool

Figure 4: Visualization and download tool in BDMIN

Data downloaded BDMIN

Figure 5: BDMIN Downloaded Data.

All the files that make up our knowledge base can be found at GitHub, so that the reader can skip the downloading and preparation phase of the information.

5.2. GPT configuration and customisation for critical minerals

When we talk about "creating a GPT," we are actually referring to the configuration and customisation of a GPT (Generative Pre-trained Transformer) based language model to suit a specific use case. In this context, we are not creating the model from scratch, but adjusting how the pre-existing model (such as OpenAI''s GPT-4) interacts and responds within a specific domain, in this case, on critical minerals.

First of all, we access the application through our browser and, if we do not have an account, we follow the registration and login process on the ChatGPT platform. As mentioned above, in order to create a GPT step-by-step, you will need to have a Plus account. However, readers who do not have such an account can work with a free account by interacting with ChatGPT through a standard conversation.

Screenshot of the ChatGPT login and registration page.

Figure 6: ChatGPT login and registration page.

Once logged in, select the "Explore GPT" option, and then click on "Create" to begin the process of creating your GPT.

Screenshot of the creation page of a new GPT.

Figure 7: Creation of new GPT.

The screen will display the split screen for creating a new GPT: on the left, we will be able to talk to the system to indicate the characteristics that our GPT should have, while on the left we will be able to interact with our GPT to validate that its behaviour is adequate as we go through the configuration process.

Screenshot of the new GPT creation screen.

Figure 8: Screen of creating new GPT.

In the GitHub of this project, we can find all the prompts or instructions that we will use to configure and customise our GPT and that we will have to introduce sequentially in the "Create" tab, located on the left tab of our screens, to complete the steps detailed below.

The steps we will follow for the creation of the GPT are as follows:

  1. First, we will outline the purpose and basic considerations for our GPT so that you can understand how to use it.

Capture the basic instructions of GPT again.

Figure 9: Basic instructions for new GPT.

2. We will then create a name and an image to represent our GPT and make it easily identifiable. In our case, we will call it MateriaGuru.

Screenshot for name selection for new GPT.

Figure 10: Name selection for new GPT.

Screenshot for image creation for GPT.

Figure 11: Image creation for GPT.

3.We will then build the knowledge base from the information previously selected and prepared to feed the knowledge of our GPT.

Capturing the information upload to the knowledge base of new GPT

Capturing the Knowledge Base Load of New GPT II

Figure 12: Uploading of information to the new GPT knowledge base.

4. Now, we can customise conversational aspects such as their tone, the level of technical complexity of their response or whether we expect brief or elaborate answers.

5. Lastly, from the "Configure" tab, we can indicate the  conversation starters desired so that users interacting with our GPT have some ideas to start the conversation in a predefined way.

Screenshot of the Configure GPT tab.

Figure 13: Configure GPT tab.

In Figure 13 we can also see the final result of our training, where key elements such as their image, name, instructions, conversation starters or documents that are part of their knowledge base appear.

5.3. Validation and publication of GPT

Before we sign off our new GPT-based assistant, we will proceed with a brief validation of its correct configuration and learning with respect to the subject matter around which we have trained it. For this purpose, we prepared a battery of questions that we will ask MateriaGuru to check that it responds appropriately to a real scenario of use.

# Question Answer
1 Which critical minerals have experienced a significant drop in prices in 2023? Battery mineral prices saw particularly large drops with lithium prices falling by 75% and cobalt, nickel and graphite prices falling by between 30% and 45%.
2 What percentage of global solar photovoltaic (PV) capacity was added by China in 2023? China accounted for 62% of the increase in global solar PV capacity in 2023.
3 What is the scenario that projects electric car (EV) sales to reach 65% by 2030? The Net Zero Emissions (NZE) scenario for 2050 projects that electric car sales will reach 65% by 2030.
4 What was the growth in lithium demand in 2023? Lithium demand increased by 30% in 2023.
5 Which country was the largest electric car market in 2023? China was the largest electric car market in 2023 with 8.1 million electric car sales representing 60% of the global total.
6 What is the main risk associated with market concentration in the battery graphite supply chain? More than 90% of battery-grade graphite and 77% of refined rare earths in 2030 originate in China, posing a significant risk to market concentration.
7 What proportion of global battery cell production capacity was in China in 2023? China owned 85% of battery cell production capacity in 2023.
8 How much did investment in critical minerals mining increase in 2023? Investment in critical minerals mining grew by 10% in 2023.
9 What percentage of battery storage capacity in 2023 was composed of lithium iron phosphate (LFP) batteries? By 2023, LFP batteries would constitute approximately 80% of the total battery storage market.
10 What is the forecast for copper demand in a net zero emissions (NZE) scenario for 2040? In the net zero emissions (NZE) scenario for 2040, copper demand is expected to have the largest increase in terms of production volume.

Figure 14: Table with battery of questions for the validation of our GPT.

Using the preview section on the right-hand side of our screens, we launch the battery of questions and validate that the answers correspond to those expected.

Capture of the GPT response validation process.

Figure 15: Validation of GPT responses.

Finally, click on the "Create" button to finalise the process. We will be able to select between different alternatives to restrict its use by other users.

Screenshot for publication of our GPT.

Figure 16: Publication of our GPT.

6. Scenarios of use

In this section we show several scenarios in which we can take advantage of MateriaGuru in our daily life. On the GitHub of the project you can find the prompts used to replicate each of them.

6.1. Consultation of critical minerals information

The most typical scenario for the use of this type of GPTs is assistance in resolving doubts related to the topic in question, in this case, critical minerals. As an example, we have prepared a set of questions that the reader can pose to the GPT created to understand in more detail the relevance and current status of a critical material such as graphite from the reports provided to our GPT.

Capture of the process of resolving critical mineral doubts. 

Figure 17: Resolution of critical mineral queries.

We can also ask you specific questions about the tabulated information provided on existing sites and evidence on Spanish territory.

Screenshot of the answer to the question about lithium reserves in Extremadura.

Figure 18: Lithium reserves in Extremadura.

6.2. Representation of quantitative data visualisations

Another common scenario is the need to consult quantitative information and make visual representations for better understanding. In this scenario, we can see how MateriaGuru is able to generate an interactive visualisation of graphite production in tonnes for the main producing countries.

Capture of the interactive visualization generated with our GPT.

Figure 19: Interactive visualisation generation with our GPT.

6.3. Generating mind maps to facilitate understanding

Finally, in line with the search for alternatives for a better access and understanding of the existing knowledge in our GPT, we will propose to MateriaGuru the construction of a mind map that allows us to understand in a visual way key concepts of critical minerals. For this purpose, we use the open Markmap notation (Markdown Mindmap), which allows us to define mind maps using markdown notation.

Capture of the process for generating mind maps from our GPT.

Figure 20: Generation of mind maps from our GPT

We will need to copy the generated code and enter it in a  markmapviewer in order to generate the desired mind map. We facilitate here a version of this code generated by MateriaGuru.

Capturing Mind Map Visualization

Figure 21: Visualisation of mind maps.

7. Results and conclusions

In the exercise of building an expert assistant using GPT-4, we have succeeded in creating a specialised model for critical minerals. This wizard provides detailed and up-to-date information on critical minerals, supporting strategic decision making and promoting education in this field. We first gathered information from reliable sources such as the RMIS, the International Energy Agency (IEA), and the Spanish Geological and Mining Institute (BDMIN). We then process and structure the data appropriately for integration into the model. Validations showed that the wizard accurately answers domain-relevant questions, facilitating access to your information.

In this way, the development of the specialised critical minerals assistant has proven to be an effective solution for centralising and facilitating access to complex and dispersed information.

The use of tools such as Google Colab and Markmap has enabled better organisation and visualisation of data, increasing efficiency in knowledge management. This approach not only improves the understanding and use of critical mineral information, but also prepares users to apply this knowledge in real-world contexts.

The practical experience gained in this exercise is directly applicable to other projects that require customisation of language models for specific use cases.

8. Do you want to do the exercise?

If you want to replicate this exercise, access this this repository where you will find more information (the prompts used, the code generated by MateriaGuru, etc.)

Also, remember that you have at your disposal more exercises in the section "Step-by-step visualisations".


Content elaborated by Juan Benavente, industrial engineer and expert in technologies linked to the data economy. The contents and points of view reflected in this publication are the sole responsibility of the author.

calendar icon
Blog

The energy transition is also a transition of raw materials. When we imagine a sustainable future, we conceive it based on a series of strategic sectors such as renewable energies or electric mobility. Similarly, we imagine a connected and digital future, where new innovations and business models related to the fourth industrial revolution allow us to solve global challenges such as food shortages or access to education. In short, we focus on technologies that help us improve our quality of life.

Why are critical minerals important?

These sectors depend on a series of key technologies, such as energy storage batteries, wind turbines, solar panels, electrolyzers, drones, robots, data transmission networks, electronic devices and space satellites. These are technologies that in recent years have undergone a great technological evolution and an enormous growth in demand worldwide. If we analyze the development forecasts to 2030, we can expect annual growth of at least double digits for many of them, as shown in Figure 1.

 10% CAGR U.S. CPD demand (2022-2030); 16% CAGR Renewable Energy investments (wind and solar PV 2021-2030); 27% CAGR Li-Ion Battery Demand (2022-2030)

Figure 1: Expected growth up to 2030 of some of the key technologies for strategic sectors. Source: McKinsey (image 1, image 2, image 3)

However, as can be seen in Figure 2, many of these future technologies are highly dependent on a set of critical raw materials necessary for their development. Indium and gallium are key to the manufacture of energy-efficient LED lighting, silicon is indispensable for the manufacture of microchips and semiconductors, and the platinum group of metals (such as iridium, palladium, platinum rhodium or ruthenium) are used in catalysts for hydrogen electrolyzers.

Semi-quantitative representation of raw material flows to the fifteen key technologies and five strategic sectors

Figure 2: Semi-quantitative representation of raw material flows to the fifteen key technologies and five strategic sectors. Source: JRC Study

So, when does a material become critical? There are several factors that allow us to determine whether a raw material is considered critical:

  • Its world reserves are scarce
  • There are no alternative materials that can perform their function (their properties are unique or very unique).
  • They are indispensable materials for key economic sectors of the future, and/or their supply chain is high-risk.

In the words of Margrethe Vestager, Executive Vice-President of the European Commission, "without a secure and sustainable supply of critical raw materials, there will be no green (sustainable) and industrial transition".

Research into sources of critical minerals data

In order to know in detail the situation of public minerals in Europe, we need to locate quality data. A task for which we will have to look into several sources.

First of all, we go to the European open data portal. From its search engine, in a first iteration, we see that there are more than 46,000 datasets for the query "critical raw materials" (Figure 3).

Screenshot of the search for critical raw materials in the European data portal.

After a first analysis of the available data categories, we adjusted the filters until we narrowed down the datasets of interest to 190 (Figure 4). Particular attention is drawn to the data published by the JRC (European Commission Joint Research Center) and, in particular, to the dataset entitled Critical Raw Materials (CRM), 2020 assessment.

Screenshot of a second search for critical raw materials in the European data portal

Figure 4: Second search for critical raw materials in the European data portal.

This dataset contains a direct link to a web portal, the RMIS (Raw Material Information System), which is actually the European Commission's reference knowledge base on raw materials through which we can access very relevant data and analysis.

Capture of RMIS (Raw Material Information System), the European Commission's reference knowledge base on raw materials.

Figure 5: RMIS - European Commission's knowledge base for raw materials

Through the RMIS, we find a very interesting publication for any study on the subject. Although this publication is in PDF format, it allows us to access the list of strategic, critical and non-critical materials identified by the European Commission indicating their level of criticality and their use in different key technologies as shown in Figure 6.

Table of strategic, critical and non-critical raw materials used in different key technologies contained in PDF file

Figure 6: Table of strategic, critical and non-critical raw materials used different key technologies contained in the PDF file. Source: Supply chain analysis and material demand forecast in strategic technologies and sectors in the EU - A foresight study, JRC 2023.

Continuing our exploration, in this case in search of data on mineral reserves in the European continent, we found the European Gelological Data Infrastructure (EDGI) platform, which has an extensive catalog with more than 5,700 datasets and geological services. In our case, after performing a search in its data catalog, we selected three datasets containing interesting information in terms of findings of critical lithium, cobalt and graphite minerals (Figure 7).

Screenshot of EDGI catalog dataset search

Figure 7: Searching for datasets in the EDGI catalog

From the EDGI viewer, we can view the contents of these three datasets before downloading them in GeoJSON format (Figure 8). The three datasets have been originated from the  FRAME project (Forecasting And Assessing Europe's Strategic Raw Materials Needs), in which multiple European entities participate, including the Geological and Mining Institute of Spain (IGME).

Capture of selected datasets query through EDGI visualization platform

Figure 8: Querying selected datasets through EDGI visualization platform. Source: Map of cobalt occurrences in Europe, Map of graphite occurrences in Europe, Map of lithium occurrences in Europe, FRAME project.

Lastly, we went to the data portal of the International Energy Agency (IEA) (Figure 9). In this case, we found, among its more than 70 datasets, one directly related to our field of research, entitled Critical Minerals Demand Dataset, which we proceeded to download for further analysis in excel format.

Capture from the IEA website

Figure 9: Capture of the International Energy Agency (IEA) data portal.

After this search, we have located some interesting data that can help us to carry out different analyses.

Although this exercise has been carried out under the theme of critical minerals, European open data portals provide a large amount of information and diverse data sets on many areas of interest that can help us understand the challenges we face as a society, from the energy transition to the fight against poverty or food waste. Data that will allow us to carry out analyses aimed at making better decisions to move towards a more prosperous and sustainable future.


Content elaborated by Juan Benavente, industrial engineer and expert in technologies linked to the data economy. The contents and points of view reflected in this publication are the sole responsibility of the author.

calendar icon
Noticia

The European Commission has published the first Report of the state of the Digital Decade. It takes stock of the progress of the European Union on its way to the success of the digital transformation, as set out in the Digital Decade Policy Agenda 2030. The document builds on the agreements and recommendations adopted at European level on digitization towards 2030, including the Council Recommendation (EU) 2021/2122 of 26 November 2021 on a Pact for Research and Innovation in Europe.  

Various studies and statistical sources, such as the Eurobarometer on the Digital Decade, the Strategic Foresight Report for 2021 and 2022, as well as different statistical sets from Eurostat on the state of technology, have been taken into account in compiling the report. It also includes the monitoring of the European Declaration on Digital Rights and Principles for the Digital Decade, which translates the EU's vision on digital transformation into principles and commitments. 

According to the report, the success of the Digital Decade will be crucial for the future prosperity of the EU. Achieving the agenda could unlock more than €2.8 trillion in economic value, equivalent to 21% of the EU's current economy. To achieve this end, the text highlights the need to incentivize policies and investments in digital technologies, skills and infrastructure.   

On this basis, the report includes concrete recommendations to Member States prior to the adoption of their national strategic roadmaps and for their future adjustments. 

Spain, a major player  

The Commission has prepared a report for each member country in which it exhaustively analyzes its strengths and future duties. About Spain it points out that the country is "in an ambitious digital transformation of the economy" and high rates in connectivity: Spain is, according to the report, one of the EU states with the best results in digital infrastructure.  

The three key aspects analyzed in the report are: 

  1. Connectivity and Public Administration  

In this regard, the report points out that Spain is "well above the EU average" in terms of very high capacity fixed networks, with 93% availability compared to 73% of the EU average. In fiber coverage "far exceeds" the EU average, with 91% connectivity compared to 56% of the EU average.   

In terms of other technologies such as 5G, Spain is at the European average, with 82% improved broadband access. In addition, the document reveals that 98% of all pioneering 5G bands have already been allocated in the country and that "appropriate measures are being implemented to achieve the objectives of the Digital Decade" such as the Telecommunications Law of 2022. 

The report also points out that Spain "is at the forefront of e-Government and digital public services in the EU". It also highlights the effort to update services and infrastructures to adapt them to the "rapid technological evolution" and the needs of companies. In fact, Spain is "well above the EU average" in indicators that measure the number of Internet users who use electronic administration services, with 84% compared to 74% in the EU or in digital public services for citizens, with 86% access, and companies, with 91%. In this sense, it highlights the work that has been done in the interoperability of digital public services at national, regional and local level.  

The report includes some recommendations on healthcare digitization and notes that the country should continue to implement its public service digitization policies to "connect more types of healthcare providers to medical records" and implement measures "to ensure regionally comparable quality of service and completeness of healthcare data." 

  1. Economic development and companies  

 In terms of economic and technological development, the report praises the country's progress with PERTE Chip, whose objective is the development of semiconductors and cutting-edge technologies to strengthen their design and production in Spanish industry in order to "promote national and EU strategic sovereignty". In this field, it also highlights Spain's participation in the Important Project of Common European Interest (IPCEI) on Microelectronics and Communication Technologies with 11 direct participants active in various fields (material, open source design, equipment, packaging, connectivity, photonics) and that "could help to achieve the first European computer with quantum acceleration by 2025". 

As for SMEs, the level of digital development is at the EU level (68%) and highlights the efforts "to improve the digitization of companies" that can be seen in figures: 12.3% of Spanish companies already use Artificial Intelligence and 14.3% use Big Data for analysis and internal use. In addition, it joins the Kit Digital initiative for the digital acceleration of SMEs and the Agents of Change program to hire experts in digital transformation in the field of small and medium-sized enterprises, a type of company with a high number of companies in the country and a "significant participation in the Spanish economy", which makes them a target for reforms and investments that "will have an indirect multiplier impact". The report encourages the country to continue "implementing these policies" in the area of business digitalization, particularly in the case of SMEs. 

  1. Citizen digitization  

In addition, at the citizen level, the report analyzes the digitalization of the Spanish population. In this sense, the document points out that Spain obtains good results in terms of digital skills with 64% and 38% of the population benefiting from these skills respectively. In both cases, the Spanish average is above the EU.  

Within the efforts being made when it comes to boosting ICT profiles both in the educational area and in the labor market, the report highlights measures to increase the number of ICT specialists, in particular a law to modernize the vocational education and training (VET) system, approved in March 2022, and a new VET specialization course on AI and big data that allows training in the new digital environment. 

The report, which analyzes the state of play in 2022, notes the importance of that year, which stands out as having been the period "of a further acceleration of key trends affecting the digital transformation of the EU digital EU" with increasingly rapid technological advances, such as AI, climate change and associated social and economic concerns, a growing demand for high-speed connectivity or the multipolarization of the global scene in the face of an ever-increasing technological race.   

The report concludes that the potential impact of technological change is significant and will require the EU to be agile and swift in its transformation. 

calendar icon
Evento

The health situation we have been experiencing for a couple of years now had changed the way of celebrating major events to date, moving from a face-to-face to an online format.

This year, however, we are slowly beginning to see a return to the normality that existed before the pandemic. Several of the major technology events to be held in 2022 have already announced that they will be held in hybrid format, or even exclusively in person, if health conditions permit.

Here is a brief review of the events related to the world of technology and data, both public and private, that will be held over the coming months and that you should mark in red on your calendar.

Mobile World Congress (MWC)

February 28 to March 03, 2022 – Barcelona

We begin by talking about the most global technological event in our country, which is about to begin, and which aims to reconnect and reinvent the connectivity industry. It is one of the most influential events worldwide, as this congress represents the largest exhibition of telephony, Internet and mobile applications within the industry.

The MWC is attended by mobile operators, device manufacturers and service providers to create a place to facilitate networking and showcase the most innovative and current technologies.

Among the conferences that will take place at this event, we find some related to the field of data, such as ‘Strategies for energising the data economy’ or ‘The data opportunity: making mobility smart’. Also noteworthy is the conference ‘Digital policies to speed the post-Covid recovery’,   which will feature a presentation by Nadia Calviño, First Vice President of Spain and Minister of Economy and Digitalization.

Four Years From Now (4YFN)

February 28 to March 03, 2022 – Barcelona

As has become customary in recent years, the MWC will also host the internal event for start-ups 4YFN (4 Years From Now), which seeks to support contact between new companies and investors, thus facilitating access to an international network of contacts and different business opportunities. Red.es will collaborate in this event by making a selection of Spanish companies and startups to participate in the different representation spaces that are organized. Among the participating companies there are some focused on the world of data and its reuse.

In addition, 4YFN will have outstanding speakers such as Francisco Polo, High Commissioner for Spain Entrepreneurial Nation, or Carme Artigas, Secretary of State for Digitalization and Artificial Intelligence, who will participate on March 1 in a session that will analyze the existing challenges in developing a European ecosystem that is favorable to the creation, growth and investment of new companies and start-ups.

OpenExpo Europe 2022

June 30, 2022 – Madrid

OpenExpo Europe is a space for the dissemination of technological innovation, digital transformation and open source in Europe. Among its main objectives is to disseminate among professionals in the technology sector the latest trends, tools and services in innovation and technology, as well as helping them to increase their network of contacts.

The OpenExpo Virtual Experience initiative emerged in 2020, following the success of the dissemination of content in online format about fields such as: cybersecurity, blockchain, AI, Virtual Reality, IoT or big data, among other topics.

As of today, the agenda of this event is not yet available. However, through this link you can pre-register for the event that will take place on June 30 at La Nave (Madrid).

Advanced Factories

March 29th to 31st, 2022 – Barcelona

The city of Barcelona will once again host this annual summit, which brings together the most cutting-edge companies in Industry 4.0. This event, European leader in advanced and digital industry, will bring together more than 17,000 professionals, in addition to offering 100 hours of conferences on fields such as robotics, industry 4.0, automation or 3D printing, among others.

Some of the central themes of this world-class meeting will be: productivity improvement, connectivity, data analysis, industrial symbiosis or reduction of the digital gap between large manufacturing factories and SMEs.

South Summit

June 08 to 10, 2022 – Madrid

Next June will see the arrival of South Summit, a contest-like showcase designed to offer more visibility to disruptive projects seeking new clients, financing or strategic partnerships. As in previous editions, it will feature investors and leading innovation companies from Spain, Southern Europe and Latin America. You can register for the start-up competition through this link.

Unlike last year, in 2022 South Summit will once again be held in a fase-to-face format in Madrid, in addition to maintaining the virtual presentations of the projects through a 100% digital omnichannel model.

IoT Solutions World Congress

May 10-12, 2022 - Barcelona, Spain

After the cancellation of the last edition last October, due to the high incidence of the pandemic in our country, this event returns with more strength in 2022. Barcelona will bring together industry experts to analyze how IoT is transforming production, transportation, logistics and public services, as well as sectors such as healthcare and energy.

The central theme of this edition will be "The new normal", and will analyze the main innovation challenges faced by companies through five thematic axes such as security, connectivity, business optimization, intelligence and customer experience.

Among the topics to be presented are some related to the world of data such as: "Data are the fuel that drives the energy transition: exploring the smart energy network at Eneco" or "Konecranes: driving business and providing commercial advantages with industrial IoT, perimeter and data analytics".

Smart City Expo World Congress

November 15-17, 2022 - Barcelona, Spain

Smart City Expo World Congress (SCEWC) has established itself over the last few years as a benchmark event that blends technological innovation with the Smart Cities sector. This smart cities congress will bring together once again experts, companies and entrepreneurs with the aim of collectivizing urban innovation and promoting new projects around the world.

The last editions were developed in digital format, which has led to the creation of Tomorrow.City, a digital content platform aimed at public administrations and new talents, which seeks to disseminate knowledge, promote training and promote research on an uninterrupted basis. This partnership combines the event with a digital platform open all year round and dedicated to the growth of sustainable mobility.

EU Datathon

October 20, 2022 – Online

The EU Datathon is the competition that seeks to boost the creation of products based on open data, such as mobile or web applications, that offer an answer to different challenges related to EU priorities. The deadline for submissions is March 31, 2022, with the final on October 20.

This challenge is linked to the European Union Open Data Days, organized by the Publications Office of the European Union, which last year held its first edition and will presumably hold the second one during the last months of 2022, although this information is yet to be confirmed.

Other events of interest

There are a multitude of technological events related to the field of data that can also be very interesting. We can not collect them all in detail in a single article, so here are some of the most popular both nationally and internationally:

In Spain

Internationally

This has been just a selection of the main technological events that 2022 has in store for us. If you know of any other that you would like to highlight, do not hesitate to write us a comment or send us your proposal through our email contacto@datos.gob.es.

calendar icon
Evento

The current healthcare situation has changed the way in which major events are held, with most of them moving from being held in person to online. However, little by little, the face-to-face format is being taken up again, returning to the offline format and even combining both experiences.

In this article we are going to discover some events related to the world of technology and data, both private and public, that will be held in the coming weeks and that you should not miss. Join us to discover them!

OpenExpo Virtual Experience 2021

8 to 10 June 2021 – Online

OpenExpo Europe has positioned itself in recent years as one of the main windows for dissemination in technological innovation, digital transformation and open source in Europe. Its main objective is to disseminate the latest trends, tools and services in innovation and technology among professionals in the technology sector, as well as helping them to increase their network of contacts.

The OpenExpo Virtual Experience initiative was launched last year, following the success achieved with the dissemination of online content on cybersecurity, blockchain, AI, virtual reality, IoT and big data, among other topics.

At this event, attendees will be able to enjoy more than 50 activities led by professional experts in technology and innovation: presentations, case studies, interviews, debates, workshops, Q&A sessions, 1to1 meetings, etc.  Some of the topics to be addressed are Govtech and the public administration's commitment to innovation, free educational software and Gaia-X, one of the European Commission's major projects in the field of data.

Advanced Factories

8 to 10 June 2021 – Barcelona

Barcelona will host the annual Advanced Factories summit, which brings together the most cutting-edge companies in Industry 4.0. Some of the focal points of this world-class meeting will be: industrial automation, sensors, energy efficiency, artificial intelligence, blockchain, machine learning and big data.

For the fourth consecutive year, this summit will host the Industry 4.0 Congress under the slogan "We are the future of automation", which will begin with a presentation on the role of data in the transformation of this sector.

Mobile World Congress (MWC) 2021

28 June to 01 July 2021 – Barcelona

This great technological event was suspended in 2020, but in 2021 it will re-emerge as a new event with great guarantees of health safety. As a novelty, this year's MWC will feature several virtual activities that will complement the on-site edition of the event. "Connected Impact" is the chosen theme, which places the COVID-19 pandemic as the main element influencing this year's technological trends.

As usual, leading professionals from the sector and prominent speakers will be taking part. Among them is Carme Artigas, Secretary of State for Digitalisation and Artificial Intelligence, who will participate with a presentation on data in the age of intelligence.

As in previous years, the in-house event for startups 4YFN (4 Years From Now) will be held as part of the MWC. Its aim is to support contact between startups and investors, providing access to an international network of contacts and different business opportunities. Among the participating companies we can find many focused on the world of data and its reuse. Red.es selects Spanish companies and startups to participate in the different representation spaces that are organised.

South Summit

5 to 7 October 2021 – Online

The autumn will see the arrival of South Summit, a showcase in the form of a competition to give more visibility to disruptive projects seeking new customers, funding or strategic partnerships. It will feature investors and leading innovation companies from Spain, southern Europe and Latin America, regardless of the industry, country of origin or stage of development of the project.

This year the organisation has decided not to hold the event in person, so the project presentations will take place virtually.

IoT Solutions World Congress

5 to 7 October 2021 – Barcelona

This is undoubtedly one of the most high-profile IoT events in the world. Due to the growing demand from the sector, more than 8,000 visitors are expected to attend an event that will bring together industry experts to analyse how the Internet of Things is transforming production, transport, logistics, public services and sectors such as healthcare and energy.

Some of the papers to be presented include "Leveraging EdgeX Foundry as an Open, Trusted Data Framework for Smart Meter Monitoring", "Using Mobile, IoT and Data Analytics to Take a Localized Approach to the Global Waste Problem" and "Making Cities, Infrastructures & Construction Sites Smarter with Time Series Data".

Semantic Web for E-Government

24 October - online

This online event will focus on a review of the semantic web and its importance in achieving interoperability and integration between the different organisational levels of public administrations. Two current e-government and open data initiatives will be presented:

  • The European Data Portal, a platform for integrating and assessing Europe's Linked Open Government Data. It will address the multiple applications of semantic web standards in the European Data Portal, such as DCAT, SKOS, SHACL and DQV. Special attention will also be given to the measurement and publication of quality information.
  • Ciudades Abiertas: good practices for data harmonisation with local public administrations. It will be explained how a set of vocabularies is being developed to support a homogeneous provision of open data in the framework of Ciudades Abiertas, a collaborative project with four Spanish cities (Zaragoza, A Coruña, Madrid and Santiago de Compostela).

Smart City Expo World Congress

16 to 18 November 2021 – Barcelona

For several years now, Smart City Expo World Congress (SCEWC) has become a benchmark event that combines technological innovation with the field of Smart Cities. It brings together experts, companies and entrepreneurs with the aim of creating synergies and promoting new projects.

In 2021, the congress celebrates its tenth anniversary and its organisers will once again opt to hold the event in person, combined with a digital platform that will offer a multitude of opportunities to its attendees.

This event is usually the framework chosen by Open Data Barcelona to showcase the finalists of its World Data Viz Challenge, although the 2021 edition has not yet been announced.

EU Open Data Days

23 to 25 November 2021 - Online

This year we will also attend the first edition of the EU Open Data Days, organised by the Publications Office of the European Union in collaboration with the Aporta Initiative.  The event will be virtual and will be divided into two activities:

  • EU Dataviz 2021 (23-24 November). A programme of conferences focusing on open data and visualisations. They are currently defining the agenda which we will share with you soon.
  • EU Datathon 2021 (25 November). In the months leading up to this event, a competition will be held to encourage the creation of products based on open data, such as mobile or web applications, that offer a response to different challenges related to EU priorities. The deadline for submissions is 11 June. The final will be held on 25 November as part of the Open Days.

 

This is just a selection of some of the major technology events coming up - do you know of any more you would like to highlight? Then don't hesitate to write us a comment or send us your proposal by email to contacto@datos.gob.es.

 

calendar icon
Blog

Artificial intelligence is transforming companies, with supply chain processes being one of the areas that is obtaining the greatest benefit. Its management involves all resource management activities, including the acquisition of materials, manufacturing, storage and transportation from origin to final destination.

In recent years, business systems have been modernized and are now supported by increasingly ubiquitous computer networks. Within these networks, sensors, machines, systems, vehicles, smart devices and people are interconnected and continuously generating information. To this must be added the increase in computational capacity, which allows us to process these large amounts of data generated quickly and efficiently. All these advances have contributed to stimulating the application of Artificial Intelligence technologies that offer a sea of ​​possibilities.

In this article we are going to review some Artificial Intelligence applications at different points in the supply chain.

Technological implementations in the different phases of the supply chain

Planning

According Gartner, volatility in demand is one of the aspects that most concern entrepreneurs. The COVID-19 crisis has highlighted the weakness in planning capacity within the supply chain. In order to properly organize production, it is necessary to know the needs of the customers. This can be done through techniques of predictive analytics that allow us to predict demand, that is, estimate a probable future request for a product or service. This process also serves as the starting point for many other activities, such as warehousing, shipping, product pricing, purchasing raw materials, production planning, and other processes that aim to meet demand.

Access to real-time data allows the development of Artificial Intelligence models that take advantage of all the contextual information to obtain more precise results, reducing the error significantly compared to more traditional forecasting methods such as ARIMA or exponential smoothing.

Production planning is also a recurring problem where variables of various kinds play an important role. Artificial intelligence systems can handle information involving material resources; the availability of human resources (taking into account shifts, vacations, leave or assignments to other projects) and their skills; the available machines and their maintenance and information on the manufacturing process and its dependencies to optimize production planning in order to satisfactorily meet the objectives.

Production

Within of the stages of the production process, one of the stages more driven by the application of artificial intelligence is the quality control and, more specifically, the detection of defects. According to European Comission, 50% of the production can end up as scrap due to defects, while, in complex manufacturing lines, the percentage can rise to 90%. On the other hand, non-automated quality control is an expensive process, as people need to be trained to be able to perform the inspections properly and, furthermore, these manual inspections could cause bottlenecks in the production line, delaying delivery times. Coupled with this, inspectors do not increase in number as production increases.

In this scenario, the application of computer vision algorithms can solve all these problems. These systems learn from defect examples and can thus extract common patterns to be able to classify future production defects. The advantages of these systems is that they can achieve the precision of a human or even better, since they can process thousands of images in a very short time and are scalable.

On the other hand, it is very important to ensure the reliability of the machinery and reduce the chances of production stoppage due to breakdowns. In this sense, many companies are betting on predictive maintenance systems that are capable of analyzing monitoring data to assess the condition of the machinery and schedule maintenance if necessary.

Open data can help when training these algorithms. As an example, the Nasa offers a collection of data sets donated by various universities, agencies or companies useful for the development of prediction algorithms. These are mostly time series of data from a normal operating state to a failed state. This article shows how one of these specific data sets (Turbofan Engine Degradation Simulation Data Set, which includes sensor data from 100 engines of the same model) can be taken to perform a exploratory analysis and a model of linear regression reference.

Transport

Route optimization is one of the most critical elements in transportation planning and business logistics in general. Optimal planning ensures that the load arrives on time, reducing cost and energy to a minimum. There are many variables that intervene in the process, such as work peaks, traffic incidents, weather conditions, etc. And that's where artificial intelligence comes into play. A route optimizer based on artificial intelligence is able to combine all this information to offer the best possible route or modify it in real time depending on the incidents that occur during the journey.

Logistics organizations use transport data and official maps to optimize routes in all modes of transport, avoiding areas with high congestion, improving efficiency and safety. According to the study “Open Data impact Map”, The open data most demanded by these companies are those directly related to the means of transport (routes, public transport schedules, number of accidents…), but also geospatial data, which allow them to better plan their trips.

In addition, exist companies that share their data in B2B models. As stated in the Cotec Foundation report “Guide for opening and sharing data in the business environment”, The Spanish company Primafrio, shares data with its customers as an element of value in their operations for the location and positioning of the fleet and products (real-time data that can be useful to the customer, such as the truck license plate, position, driver , etc.) and for billing or accounting tasks. As a result, your customers have optimized order tracking and their ability to advance billing.

Closing the transport section, uOne of the objectives of companies in the logistics sector is to ensure that goods reach their destination in optimal conditions. This is especially critical when working with companies in the food industry. Therefore, it is necessary to monitor the state of the cargo during transport. Controlling variables such as temperature, location or detecting impacts is crucial to know how and when the load deteriorated and, thus, be able to take the necessary corrective actions to avoid future problems. Technologies such as IoT, Blockchain and Artificial Intelligence are already being applied to these types of solutions, sometimes including the use of open data.

Customer service

Offering good customer service is essential for any company. The implementation of conversational assistants allows to enrich the customer experience. These assistants allow users to interact with computer applications conversationally, through text, graphics or voice. By means of speech recognition techniques and natural language processing, these systems are capable of interpreting the intention of users and taking the necessary actions to respond to their requests. In this way, users could interact with the wizard to track their shipment, modify or place an order. In the training of these conversational assistants it is necessary to use quality data, to achieve an optimal result.

 

In this article we have seen only some of the applications of artificial intelligence to different phases of the supply chain, but its capacity is not only limited to these. There are other applications such as automated storage used by Amazon at its facilities, dynamic prices depending on the demand or the application of artificial intelligence in marketing, which only give an idea of ​​how artificial intelligence is revolutionizing consumption and society.


Content elaborated by Jose Antonio Sanchez, expert in Data Science and enthusiast of the Artificial Intelligence.

Contents and points of view expressed in this publication are the exclusive responsibility of its author.

calendar icon