Documentación

In order to extract the full value of data, it is necessary to classify, filter and cross-reference it through analytics processes that help us draw conclusions, turning data into information and knowledge. Traditionally, data analytics is divided into 3 categories:

  • Descriptive analytics, which helps us to understand the current situation, what has happened to get there and why it has happened.
  • Predictive analytics, which aims to anticipate relevant events. In other words, it tells us what is going to happen so that a human being can make a decision.
  • Prescriptive analytics, which provides information on the best decisions based on a series of future scenarios.  In other words, it tells us what to do.

The third report in the "Awareness, Inspire, Action" series focuses on the second stage, Predictive Analytics. It follows the same methodology as the two previous reports on Artificial Intelligence and Natural Language Processing.

Predictive analytics allows us to answer business questions such as: Will we suffer a stockout, will the price of a certain share fall, or will more tourists visit us in the future? Based on this information, companies can define their business strategy, and public bodies can develop policies that respond to the needs of citizens.

After a brief introduction that contextualises the subject matter and explains the methodology, the report, written by Alejandro Alija, is developed as follows:

  • Awareness. The Awareness section explains the key concepts, highlighting the three attributes of predictive analytics: the emphasis on prediction, the business relevance of the resulting knowledge and its trend towards democratisation to extend its use beyond specialist users and data scientists. This section also mentions the mathematical models it makes use of and details some of its most important milestones throughout history, such as the Kyoto protocol or its usefulness in detecting customer leakage.
  • Inspire. The Inspire section analyses some of the most relevant use cases of predictive analytics today in three very different sectors. It starts with the industrial sector, explaining how predictive maintenance and anomaly detection works. It continues with examples relating to price and demand prediction, in the distribution chain of a supermarket and in the energy sector. Finally, it ends with the health sector and augmented medical imaging diagnostics.
  • Action. In the Action section, a concrete use case is developed in a practical way, using real data and technological tools. In this case, the selected dataset is traffic accidents in the city of Madrid, published by the Madrid City Council. Through the methodology shown in the following figure, it is explained in a simple way how to use time series analysis techniques to model and predict the number of accidents in future months.

The report ends with the Last stop section, where courses, books and articles of interest are compiled for those users who want to continue advancing in the subject.

In this video, the author tells you more about the report and predictive analytics (only available in Spanish).

Below, you can download the full report in pdf and word (reusable version), as well as access the code used in the Action example at this link.

calendar icon
Documentación

The pandemic that originated last year has brought about a significant change in the way we see the world and how we relate to it. As far as the education sector is concerned, students and teachers at all levels have been forced to change the face-to-face teaching and learning methodology for a telematic system.

In this context, within the framework of the Aporta Initiative, the study "Data-based educational technology to improve learning in the classroom and at home", by José Luis Marín, has been developed. This report offers several keys to reflect on the new challenges posed by this situation, which can be turned into opportunities if we manage to introduce changes that promote the improvement of the teaching-learning process beyond simply replacing face-to-face classes with online training.

The importance of data to improve the education sector

Through innovative educational technology based on data and artificial intelligence, some of the challenges facing the education system can be addressed. For this report, 4 of these challenges have been selected:

·      Non-presential supervision of assessment tests: monitoring and surveillance of evaluative tests through telematic resources.

·      Identification of behavioral or attention problems: alerting teachers to activities and behaviors that indicate attention, motivation or behavioral problems.

·      Personalized and more attractive training programs: adaptation of learning routes and pace of students' learning.

·      Improved performance on standardized tests: use of online learning platforms to improve results on standardized tests, to reinforce mastery of a particular subject, and to achieve fairer and more equitable assessment.

To address each of these four challenges, a simple structure divided into three sections is proposed:

1.     Description of the problem, which allows us to put the challenge in context.

2.     Analysis of some of the approaches based on the use of data and artificial intelligence that are used to offer a technological solution to the challenge in question.

3.     Examples of relevant or highly innovative solutions or experiences.

The report also highlights the enormous complexity involved in this type of issues, so they should be approached with caution to avoid negative consequences on individuals, such as cybersecurity issues, invasion of privacy or risk of exclusion of some groups, among others. To this end, the document ends with a series of conclusions that converge in the idea that the best way to generate better results for all students, alleviating inequalities, is to combine excellent teachers and excellent technology that enhances their capabilities. In this process, open data can play an even more relevant role in improving the state of the art in educational technology and ensuring more widespread access to certain innovations that are largely based on machine learning or artificial intelligence technologies.

In this video, the author tells us more about the report:

calendar icon
Blog

Climate change, air pollution and sea pollution. These are the 3 main environmental problems for Spanish citizens according to the latest Eurobarometer "Attitudes of European citizens towards the Environment", published in March 2020. The survey also highlights that 90% of the Spanish population considers to protect the natural environment important. Citizens want more action to be taken and believe that responsibility should be shared between companies, national governments, the EU and citizens themselves.

As with other sectors, such as education or health, open data can also help us to overcome environmental challenges. On the one hand, it provides us with detailed information to understand the situation and to be able to look for solutions. On the other hand, they serve to implement solutions that help us raise awareness and reduce our environmental footprint.

In its report "The Economic Impact of Open Data Opportunities for value creation in Europe”, ", the European Data Portal highlights that data can "contribute to reducing total energy consumption in Europe, increasing the use of sustainable energy sources, protecting biodiversity and raising awareness of CO2 emissions", as well as generating an economic impact of around 1€ billion. Let's look at some examples:

  • Reducing energy consumption. Open data-based applications can serve as a tool to help households and institutions to understand and reduce their energy consumption. In this sense, the Regional Government of Castilla y León has launched an Energy Datahub that provides detailed and visual information on energy consumption in the more than 1,500 centres managed by the Regional Government: educational centres, health centres, administrative offices, hospitals, etc. The scorecard facilitates the disaggregated analysis by type of energy (electricity, gas, diesel) and by geographical location, allowing informed decisions to be made. According to the Board itself, thanks to the energy scorecard they have saved more than 12 million euros in electricity since 2015.
  • Increase the use of sustainable energy. The 2018 EU directive on the promotion of the use of energy from renewable sources sets a target for the share of renewable energy in gross final consumption to be 32% by 2030. To help achieve this target, publicly available data on urban planning, use of public services or weather conditions can be cross-checked to create solutions that help make housing more sustainable. One example is Solarmap, which allows the profitability of a solar panel installation to be calculated for a specific location.
  • Improving biodiversity: Changes in ecosystems can lead to mass extinctions of species and the limitation of available natural resources, which can affect not only our economy but also our own way of life. In this sense, open data serves as a basis for solutions that, for example, help us monitor various parameters of port waters (such as Environmental Control Panel), improve waste management (Punto limpio) or publicise noise pollution (MAdb).
  • Raise awareness of air pollution and the reduction of CO2 emissions. Although, as we saw at the beginning of this article, we Spaniards are very aware of climate change, however, if we look at the individual habits of citizens, we see that there is still much to be done. Only 38% try to reduce their energy consumption, 28% choose less polluting forms of transport, and 36% buy local products. It is therefore necessary to continue raising awareness, for example through indicators of air quality in cities such as GV Aire. There is also a need for applications that promote the use of more efficient means of transport, such as bicycles or public transport (Barcelona metro bus rodalies bici).

You can find more examples in the applications section of datos.gob.es and in our environment section.

calendar icon
Documentación

Public administrations have a fundamental role in the infomediary sector, that is, in the business of data reuse. On the one hand, on the supply side, thanks to their role as publishers of open data, an original source of free information that can be used to create valuable products and services; and on the other hand, on the demand side, since administrations can also reuse this information and even be customers of the infomediaries.

This is the starting point of the new report from the National Observatory of Telecommunications and the Information Society (ONTSI), entitled "Public Administrations and the Reuse of Public Information". This report analyzes the weight and the way in which the administration interacts with the infomediary sector, becoming one of its main clients.

Specifically, the report determines that almost 50% of the infomediary companies have the public administrations as clients. The services most demanded by public bodies are consultancy and specific studies, data processing and the generation of visualization tools.

In order to produce the report, representatives of various bodies at all levels of administration were interviewed in depth: all the Ministries, 6 Autonomous Communities and 10 City Councils. All those interviewed agreed on the need to promote collaboration between the public and private spheres to improve the publication of information and the creation of valuable services. From the point of view of reuse, these collaborations boost economic activity and have an impact on three areas:

  • Impact on users: They have a greater knowledge of public activity, improve their perception of administrations and can benefit from more personalized and higher quality services. In addition, they can take advantage of public information to create new businesses.
  • Internal impact on the administrations themselves: The benefits are multiple, from the improvement of management and administrative processes, to cost savings or boosting the skills of public workers.
  • Political impact: Greater knowledge of the environment allows for more informed decisions, in accordance with the reality and needs of citizens.

In this sense, the report contains a series of examples of good practices, which clearly show this impact, and which are included in the infographic below.

The report also includes an analysis of the challenges to be overcome by the administrations and a series of recommendations to obtain greater value and promote such public-private partnerships.

You can download the full report at the end of the article. Accompanying this report, the ONTSI has also published the study "From the Infomediary Sector to the Daya Economy. Characterization of the Infomediate Sector. Edition 2020", focused on the evolution of the market and business models based on data and information.

calendar icon
Documentación

In Spain there are a large number of local, regional and state data initiatives. In order to harmonise this ecosystem, datos.gob.es is a single access point, so that re-users can easily find the information they require in a single space. In addition, datos.gob.es links to the European Data Portal (EDP), which means that the Spanish initiatives included in the catalogue are more visible at an international level. 

A series of simple steps must be followed to appear in the datos.gob.es catalogue. The first one is to request a user account, through this form. User accounts are reserved only for open data publishing bodies or entities; they are not necessary to access or download datasets from the catalogue. The following video (in Spanish) explains what you have to do to get your user account:

Once registered, the publishing user can register his / her datasets in two different ways:

  1. Manual: This involves registering each dataset individually, completing a form for each dataset where its metadata are detailed manually.
  2. Automatic (federated): The data registration and update are done periodically from a RDF file that include the metadata available through a url on the publisher website. It is no longer necessary to work individually with each dataset, but the process can automatically cover several datasets.

datos.gob.es also performs a comprehensive analysis to ensure that the metadata meets quality standards. Among other actions, metadata is checked for validity, broken links are analysed and distributions are checked.

The following infographic shows the process in a simple way. Click on the lateral arrows to change the slide. You can download the complete infographic at the end of the article.

If you have any questions or comments, you can contact datos.gob.es support team.

 
calendar icon
Blog

Did you know that open digital heritage is fundamental to understanding the world around us, to boosting a more creative economy and to meeting agreed educational goals? It is estimated that around 90% of the world's cultural heritage has not yet been digitized. Of the remaining 10% that has been digitized, just 34% is available online while only 3% of that work is open

Most cultural institutions have a great deal of valuable information. This information is usually found mainly in the form of collections, but also in the files of authors, dates, technique, etc. Both the works and their information are susceptible to being converted into open data.

Some examples of museums that have digitized and opened their collection are The Rijksmuseum (Netherlands), the Statens Museum for Kunst (Denmark) or the Metropolitan Museum (USA).

What new products can be created by reusing cultural data?

New technologies offer us endless possibilities when it comes to creating new products through the reuse of cultural data. This is the case of new educational platforms such as BNEscolar, which allow for the learning of art from a different point of view. Open cultural data can even be used to generate scaperoom games or new art pieces. They can also be used as design tools or to complement films and documentaries.

In short, it can be established that cultural data are endowed with a great value that, through greater openness, could be of great benefit for the creation of new initiatives that provide benefits for the whole society.

calendar icon