Evento

Valencia City Council has launched a call to reward projects that promote the culture of open information and open data in the city. Specifically, it seeks to promote the culture of government transparency and good governance through the reuse of open data.

If you are thinking of participating, here are some of the keys you should take into account (although do not forget to read the complete rules of the call for more information).

What do the prizes consist of?

The awards consist of a single category that encompasses projects that demonstrate the potential of the reuse of public open data , and may also include private data. Specifically, applications, technological solutions, services, works, etc. may be presented. that use public data from the city of Valencia to benefit the community.

The requirements that must be met are the following:

  • To present an innovative character and highlight its impact on improving the lives of people and their environment.
  • Be current  and be implemented in general, in the territorial area of the municipality of Valencia. The final projects of bachelor's, master's or doctoral theses can have been carried out at any university, but it is mandatory that they refer to and base their research on areas of transparency in the city of Valencia.
  • Use inclusive and non-sexist language.
  • Be written in Spanish or Valencian.
  • Have a single author, which may be a legal entity or association.
  • Be written in accordance with the terms and conditions of the call, and articles previously published in journals may not participate.
  • Not have received a subsidy from the Valencia City Council for the same purpose.

Who can participate?

The contest is aimed at audiences from wide sectors: students, entrepreneurs, developers, design professionals, journalists or any citizen with an interest in open data.

Both natural and legal persons from the university field, the private sector, public entities and civil society can participate, provided that they have developed the project in the municipality of Valencia.

What is valued and what do the prizes consist of?

The projects received will be evaluated by a jury that will take into account the following aspects:

  • Originality and degree of innovation.
  • Public value and social and urban impact.
  • Viability and sustainability.
  • Collaborative nature.

The jury will choose three winning projects, which will receive a diploma and a financial prize consisting of:

  • First prize: 5,000 euros.
  • Second prize: 3,000 euros.
  • Third prize: 2,000 euros.

In addition, the City Council will disseminate and publicize the projects that have been recognized in this call, which will be a loudspeaker to gain visibility and recognition.

The awards will be presented at a public event in person or virtually in the city of Valencia, to which all participants will be invited. An opportunity to engage in conversation with other citizens and professionals interested in the subject.

How can I participate?

The deadline for submitting projects is 7 July 2025. The application can be made in two ways:

In both cases, in addition, an explanatory report of the project will have to be presented. This document will contain the description of the project, its objectives, the actions developed and the results obtained, detailed in a maximum of 20 pages. It is also necessary to review the additional documentation indicated in the rules, necessary according to the nature of the participant (natural person, legal entity, associations, etc.).

For those participants who have doubts, the email address sctransparencia@valencia.es has been enabled. You can also ask any questions on the 962081741 and 962085203 phones.

You can see the complete rules at this link.

calendar icon
Blog

General ethical frameworks

The absence of a common, unified, ethical framework for the use of artificial intelligence in the world is only apparent and, in a sense, a myth. There are a multitude of supranational charters, manuals and sets of standards that set out principles of ethical use, although some of them have had to be updated with the emergence of new tools and uses. The OECD guide on ethical standards for the use of artificial intelligence, published in 2019 but updated in 2024, includes value-based principles as well as recommendations for policymakers. The UNESCO Global Observatory on Ethics and Governance of AI published in 2021 a material called Recommendation on the Ethics of AI adopted in the same year by 193 countries, and based on four basic principles: human rights, social justice, diversity and inclusiveness, and respect for the environmental ecosystem. Also in 2021, the WHO specifically included a document on Ethics and Governance of AI for Health in which they indicated the need to establish responsibilities for organisations in the use of AI when it affects patients and healthcare workers. However, various entities and sectors at different levels have taken the initiative to establish their own ethical standards and guidelines, more appropriate to their context. For example, in February 2024, the Ministry of Culture in Spain developed a good practice guide to establish, among other guidelines, that works created exclusively with generative AI would not be eligible for awards.

Therefore, the challenge is not the absence of global ethical guidelines, but the excessive globality of these frameworks. With the legitimate aim of ensuring that they stand the test of time, are valid for the specific situation of any country in the world and remain operational in the face of new disruptions, these general standards end up resorting to familiar concepts, such as those we can read in this other ethical guide from the World Economic Forum: explainability, transparency, reliability, robustness, privacy, security. Concepts that are too high, predictable, and almost always look at AI from the point of view of the developer and not the user.

Media manifestos

Along these lines, the major media groups have invested their efforts in developing specific ethical principles for the use of AI in the creation and dissemination of content, which for now constitutes a significant gap in the major frameworks and even in the European Regulation itself. These efforts have sometimes materialised individually, in the form of a manifesto, but also at a higher level as a collective. Among the most relevant manifestos are the one by Le Figarowhich editorial staff states that it will not publish any articles or visual content generated with AI, or that of The Guardian which, updated in 2023, states that AI is a common tool in newsrooms, but only to assist in ensuring the quality of their work. For their part, the Spanish media have not issued their own manifestos, but they have supported different collective initiatives. The Prisa Group, for example, appears in the list of organisations that subscribe to the Manifesto for Responsible and Sustainable AI, published by Forética in 2024. Also interesting are the statements of the heads of innovation and digital strategy at El País, El Español, El Mundo and RTVE that we found in an interview published on Fleet Street in April 2023. When asked whether there are any specific red lines in their media on the use of AI, they all stated that they are open-minded in their exploration and have not limited their use too much. Only  RTVE, is not in the same position with a statement: "We understand that it is something complementary and to help us. Anything a journalist does, we don't want an AI to do. It has to be under our control.

Global principles of journalism

In the publishing context, therefore, we find a panorama of multiple regulations on three possible levels: manifestos specific to each medium, collective initiatives of the sector and adherence to general codes of ethics at national level. Against this backdrop, by the end of 2023 the News Media Alliance published the Global Principles for AI in Journalism, a document signed by international editorial groups that includes, in the form of a decalogue, 12 fundamental ethical principles divided into 8 blocks:

Visual 1: Global Principles on AI in Journalism. 1..ntellectual Property Developers, operators, and deployers of AI systems must respect intellectual property rights; publishers are entitled to negotiate for and receive adequate remuneration for use of their IP; copyright and ancillary rights protect content creators and owners from the unlicensed use of their content; existing markets for licensing creators’ and rightsholders’ content should be recognised.   2. Transparency: AI systems should provide granular transparency to creators, rightsholders, and users. 3. Accountability: providers and deployers of AI systems should cooperate to ensure accountability for system outputs. 4. Quality and Integrity: ensuring quality and integrity is fundamental to establishing trust in the application of AI tools and services.  5. Fairness: AI systems should not create, or risk creating, unfair market or competition outcomes. 6. Safety: AI systems should be trustworthy; AI systems should be safe and address privacy risks. 7. By Design: these principles should be incorporated by design into all AI systems, including general purpose AI systems, foundation models, and GenAI systems. 8. Sustainable Development: The multi-disciplinary nature of AI systems ideally positions them to address areas of global concern. Source: News Media Alliance

Figure 1. Global principles of AI in journalism, News Media Alliance.

When we review them in depth, we find in them some of the major conflicts that are shaping the development of modern artificial intelligence, connections with the European AI Regulation and claims that are constant on the part of content creators:

  • Block 1: Intellectual property. It is the first and most comprehensive block, specifically developed in four complementary ethical principles. Although it seems the most obvious principle, it is aimed at focusing on one of the main conflicts of modern AI: the indiscriminate use of content published on the internet (text, image, video, music) to train learning models without consulting or remunerating the authors. The first ethical principle expresses the duty of AI system developers to respect restrictions or limitations imposed by copyright holders on access to and use of content. The second expresses the ability of these authors and publishing groups to negotiate fair remuneration for the use of their intellectual property. Third, it legitimises copyright as a sufficient basis in law to protect an author's content. The fourth calls for recognising and respecting existing markets for licensing, i.e. creating efficient contracts, agreements and market models so that AI systems can be trained with quality, but legitimate, authorised and licensed content.
  • Block 2: Transparency. The second block is a logical continuation of the previous one, and advocates transparency in operation, a feature that brings value to both content authors and users of AI systems. This principle coincides with the central obligation that the European Regulation places on generative AI systems: they must be transparent from the outset and declare what content they have trained on, what procedures they have used to acquire it and to what extent they comply with the authors' intellectual property rights.  This transparency is essential for creators and publishing groups to be able to enforce their rights, and it is further established that this principle must be universally adhered to, regardless of the jurisdiction in which the training or testing takes place.
  • Block 3: Accountability. This word  refers to the ability to be accountable for an action. The principle states that developers and operators of AI systems should be held accountable for the outputs generated by their systems, for example if they attribute content to authors that is not real, or if they contribute to misinformation or undermine trust in science or democratic values.
  • Block 4: Quality and integrity. The basis of the principle is that AI-generated content must be accurate, correct and complete, and must not distort the original works. However, this superficial idea builds on a more ambitious one: that publishing and media groups should be guarantors of this quality and integrity, and thus official suppliers to AI system developers and providers. The fundamental argument is that the quality of the training content will define the quality of the outcomes of the system.
  • Block 5: Fairness. The word fairness can also be translated as equity or impartiality. The principle states in its headline that the use of AI should not create market unfairness, anti-competitive practices or unfair competition, meaning that it should not be allowed to be used to promote abuses of dominance or to exclude rivals from the market. This principle is not aimed at regulating competition between AI developers, but between AI developers and content providers: AI-generated text, music or images should never compete on equal terms with author-generated content.
  • Block 6: Safety. It is composed of two ethical principles. Building on the above, the first security principle states that generative AI systems must be reliable in terms of the information sources they use and promote, which must not alter or misrepresent the content, preserving its original integrity. The opposite could result in a weakening of the public's trust in original works, in authors and even in major media groups. This principle applies to a large extent to new AI-assisted search engines, such as the new Google Search (SGE), the new SearchGPT or Microsoft's own Copilot, which collect and recast information from different sources into a single generated paragraph. The second point unifies user data privacy issues into a single principle and, in just one sentence, refers to discriminatory bias. Developers must be able to explain how, when and for what purpose they use user data, and must ensure that systems do not produce, multiply or chronic biases that discriminate against individuals or groups.
  • Block 7: By design. This is an overarching meta-principle, which states that all principles should be incorporated by design in all AI systems, generative or otherwise. Historically, ethics has been considered at the end of the development process, as a secondary or minor issue, so the principle argues that ethics should be a significant and fundamental concern from the very process of system design. Nor can ethical auditing be relegated only to cases where users file a complaint.
  • Block 8: Sustainable development. It is apparently a global, far-reaching principle that AI systems should be aligned with human values and operate in accordance with global laws, in order to benefit all of humanity and future generations. However, in the last sentence we find the real orientation of the principle, a connection to publishing groups as data providers for AI systems: "Long-term funding and other incentives for providers of high-quality input data can help align systems with societal goals and extract the most relevant, up-to-date and actionable knowledge

The document is signed by 31 associations of publishing groups from countries such as Denmark, Korea, Canada, Colombia, Portugal, Brazil, Argentina, Japan or Sweden, by associations at European level, such as the European Publishers Council or News Media Europe, and associations at global level such as WAN-IFRA (World Association of News Publishers). The Spanish groups include the Asociación de Medios de Información (AMI) and the Asociación de Revistas (ARI).

Ethics as an instrument

The global principles of journalism promoted by the News Media Alliance are particularly precise in proposing grounded solutions to ethical dilemmas that are very representative of the current situation, such as the use of authored content for the commercial exploitation of AI systems. They are useful in trying to establish a solid and, above all, unified and global ethical framework that proposes consensual solutions. At the same time, other conflicts affecting the profession, which would also be included in this Decalogue, are conspicuously absent from the document. It is possible that the omnipresence of the constantly referenced data licensing conflict has overshadowed other concerns such as the new speed of disinformation, the ability of investigative journalism to verify authentic content, or the impact of fake news and deepfakes on democratic processes. The principles have focused on setting out the obligations that the big tech companies should have regarding the use of content, but perhaps an extension could be expected to address ethical responsibilities from the media's point of view, such as what ethical model the integration of AI into newsroom activity should be based on, and what the responsibility of journalists is in this new scenario. Finally, the document reveals a common duality: the channelling, through the ethical proposal, of the suggestion of concrete solutions that even point to possible trade and market agreements. It is a clear reflection of the potential capacity of ethics to be much more than a moral framework, and to become a multidimensional instrument to guide decision-making and influence the creation of public policy.


Content prepared by Carmen Torrijos, expert in AI applied to language and communication. The contents and points of view reflected in this publication are the sole responsibility of the author.

calendar icon
Evento

There is never an end to opportunities to discuss, learn and share experiences on open data and related technologies. In this post, we select some of the upcoming ones, and tell you everything you need to know: what it's about, when and where it takes place and how you can register.

Don't miss this selection of events on cutting-edge topics such as geospatial data, accessible data reuse strategies and even innovative trends in data journalism. the best thing? All are free of charge.

Let's talk about data in Alicante

The National Association of Big Data and Analytics (ANBAN) is organising an open and free event in Alicante to debate and exchange views on data and artificial intelligence. During the meeting, not only will use cases that relate data with AI be presented, but also a part of the event will be dedicated to encourage networking among the attendees.

Open Data Day in Barcelona: Re-using data to improve the city

Open Data Day is an international event that brings together open data activities around the world. Within this framework, the barcelona Open Data initiative initiative has organised an event to discuss projects and strategies for the publication and reuse of open data to make a clean, safe, friendly and accessible city possible.

  • What is it about? Through open data projects and data-driven strategies, the challenge of security, coexistence of uses and maintenance of shared spaces in municipalities will be addressed. The aim is to generate dialogue between organisations that publish and reuse data to add value and develop strategies together.
  • When and where? On 6 March from 5 to 7.30 p.m. at Ca l'Alier (C/ de Pere IV, 362).
  • How do I register? Through this link: https://www.eventbrite.es/e/entradas-open-data-day-2024-819879711287?aff=oddtdtcreator

Presentation of the "Good Practice Guide for Data Journalists"

The Valencian Observatory of Open Data and Transparency of the Universitat Politècnica de València has created a guide for journalists and data professionals with practical advice on how to turn data into attractive and relevant journalistic stories for society. The author of this reference material will talk to a data journalist about the challenges and opportunities that data offers in journalism.

Geodata Conference of the Madrid City Council Geoportal

Madrid hosts the sixth edition of this event which brings together heads of institutions and benchmark companies in cartography, geographic information systems, digital twin, BIM, Big Data and artificial intelligence. The event will also be used as an opportunity to award the prizes of the Geodata Stand.

  • What is it about? Followingin the footsteps of previous years, the Madrid Geodata Conference will present case studies and new developments in cartography, digital twinning, reuse of georeferenced data, as well as the best papers presented at the Geodata Stand.
  • When and where? The event starts on 12 March at 9am in the Auditorio de La Nave in Madrid and will last until 2pm. The following day, 13 March, the session will be virtual and will present the projects and new developments in geo-information production and distribution via the Madrid Geoportal.
  • How do I register? Through the event portal. Places are limited https://geojornadas.madrid.es/

3rd URJC Free Culture Conference

The Free Culture Conference of the Universidad Rey Juan Carlos is a meeting point, learning and exchange of experiences about free culture in the university. Topics such as open publishing of teaching and research materials, open science, open data, and free software will be addressed.

  • What is it about? The two-day event will feature presentations by experts, workshops on specific topics and an opportunity for the university community to present papers. In addition, there will be an exhibition space where tools and news related to culture and free software will be shared, as well as a poster exhibition area
  • When and where? 20 and 21 March at the Fuenlabrada Campus of the URJC
  • How do I register? Registration is free of charge via this link: https://eventos.urjc.es/109643/tickets/iii-jornadas-de-cultura-libre-de-la-urjc.html

These are some of the upcoming events. In any case, don't forget to follow us on social media so you don't miss any news about innovation and open data. We are on Twitter y LinkedIn you can also write to us at dinamizacion@datos.gob.es if you would like us to add another event to the list or if you need extra information.

calendar icon
Noticia

The coming days will see the beginning of a season of sun, heat, beach and, in many cases, more free time, which makes this season a perfect opportunity to expand our training on a wide variety of subjects, among which data, a cross-cutting subject in different sectors, cannot be missing.

There are an increasing number of courses related to Big Data, science, analytics and even data journalism that we find in the summer offers of training centres. There is a growing interest in further training in these subjects due to the high demand for professional profiles with these skills.

Whether you are a student or a working professional, here are some examples of summer courses that may be of great interest to broaden your knowledge during these weeks:

Data science

The University of Castilla-La Mancha is offering the course "Data science: impact on society" on 22 and 23 June at the Albacete campus, where the new ways of acquiring and using data resulting from advances in technology and artificial intelligence will be discussed.

This summer, the University of Deusto will offer online training on "Data analysis and applied machine learning". This course, which will begin on 27 June, will teach you to master the main technologies for analysing and processing large amounts of data, as well as some techniques to increase the value of the data analysed, promoting optimal decision-making.

The University of Alicante focuses on artificial intelligence with the course "Introduction to Deep Learning" from 11 to 15 July 2022, in classroom mode. The course will begin by explaining basic concepts and the use of basic and advanced packages such as NumPy, Pandas, scikit-learn or tf.Keras, and then continue to delve deeper into neural networks.

The University of Alcalá de Henares will talk about "Introduction to financial data science with R" in an on-site course from 20 to 24 June. The aim of the course is twofold: to familiarise students with the use of the statistical language and to show some of the techniques linked to advanced statistical calculation, as well as its practical applications.

Open data

The Complutense University of Madrid offers, once again this year, its course “Big & Open Data. Analysis and programming with R and Python” from 4 to 22 July 2022 (mornings from 9:00 to 14:00, Monday to Friday). The course will cover the data life cycle, examples of Big Data use cases and ethics applied to the management of massive data, among other topics.

Geographic Information Systems

If you are passionate about geographic data, the University of Santiago is offering the course "Introduction to geographic information systems and cartography with the R environment" from 5 to 8 July 2022. In classroom format and with 29 teaching hours, it aims to introduce students to spatial analysis, visualisation and working with raster and vector files. The main geostatistical interpolation methods will be covered during the course.

The University of Alcalá de Henares, for its part, will give the course "Applications of GIS to Hydrology", from 6 to 8 July, also in classroom format. This is a practical course that covers everything from the different sources of hydro-meteorological data to carrying out evapotranspiration and runoff analyses and obtaining results.

Data journalism

El Institut de Formació Contínua – IL3 de la Universitat de Barcelona organizará del 4 al 7 de julio de 2022 el curso online en castellano “Bulos y periodismo de datos”. Esta formación de 8 horas de duración te aportará los conocimientos necesarios para comprender, identificar y combatir el fenómeno de la desinformación. Además, conocerás las herramientas esenciales que se utilizan en el periodismo de datos, la verificación de datos (fact-checking) políticos y la investigación basada en peticiones de transparencia.

Data protection

The Menéndez Pelayo International University will hold a course on "Strategies for data protection in the face of the challenges of the digital environment" on 4, 5 and 6 July. The programme is aimed at students related to the business world, the provision of digital services, public administrations, researchers and those interested in the subject. “Smart-cities and personal data processing” or “the European Data Protection Committee and the European initiatives of the digital package” will be just some of the topics to be addressed in this course.

Another of the training courses related to data protection that will be given over the coming months will be "Are our data really ours? Risks and guarantees of personal data protection in digital societies". The International University of Andalusia will be responsible for giving this course, which will be held in person in Seville from 29 August, and which will address the current situation of personal data protection within the framework of the European Union. Through this training you will discover the benefits and risks involved in the processing of our personal data.

In addition to this specific summer training, those users who wish to do so can also go to the large MOOC course platforms, such as Coursera, EDX or Udacity, which offer interesting courses on a continuous basis so that any student can start learning whenever they need to.

These are just a few examples of courses that are currently open for enrolment this summer, although the offer is very wide and varied. Moreover, it should be noted that the summer has not yet begun and new training courses related to the field of data could emerge in the coming weeks. If you know of any other course that might be of interest, do not hesitate to leave us a comment below or write to us at contacto@datos.gob.es.

 

calendar icon
Blog

We recently reflected on the rise of data journalism in the media, a discipline that is becoming increasingly important. Through technologies and tools linked to data, this journalistic modality makes it possible to offer readers more documented, attractive and comprehensible information. To this end, data journalists perform actions such as interpreting municipal budgets, combining statistics and financial reports or exploring geospatial information to tell stories based on large amounts of data.

Here, we want to highlight the role that data journalism plays in serving as an impetus for the creation of new companies, associations and user groups that reuse data to boost the economy or drive accountability.

Next, we will analyse how the reuse of data, within the field of journalism, has favoured the emergence of new business models, non-profit organisations and transparency, research or political fact-checking activities in which data journalism is at the core of their business or daily activity.

Organisations involved in data journalism

Title: Some organisations linked to data journalism; includes the logos of EPDATA, MALDITO DATO, CIVIO and STORYDATA.

News agencies: EPDATA

EpData is the platform created by Europa Press whose main objective is to facilitate the use of public data by journalists to enrich the news through graphics, as well as to analyse the context and contrast the figures offered by the different sources.

This platform was created with the aim of transferring the spirit of a news agency to the world of data journalism. To achieve this, they have created an extensive data repository with all the statistics that are often reported in Europa Press, with more than 60 million data from public and private sources. Any user can consult them through this link.

EpData's activities linked to data journalism

This repository allows EpData to offer graphs, statistics and data visualisations related to current events on a daily basis. For example, over the last year they have been showing updated data on the current pandemic that other media and scientific publications have used for their news.

However, the range of topics they combine on their platform is very broad, as they offer statistics and graphs based on data on topics as diverse as employment, economic developments, society, politics, etc.

Fact-checking: Maldito Dato

According to its coordinators, "Maldito Dato is the project within the verification media Maldita.es in which data journalism, transparency, investigation and political fact-checking, i.e. the verification of political discourse, are carried out".

In other words, Maldito Dato is the section of Maldita.es that carries out data journalism, made up of specialists in this field. Among its main functions is the production of journalistic pieces based on data, political fact-checking and topics based on the use of the Transparency Law.

Maldito Dato's activities linked to data journalism

Open data is an important part of Maldito Dato's nature, both in terms of publishing and reusing it in its news. Some examples are:

In addition to using open data from public administrations, sometimes they publish open data themselves. For example, when they release databases obtained by requests for information through the Transparency Law. Some examples are the database of public information requests made by citizens on the Transparency Portal, the database of health inspections of swimming pools in Madrid or the database of Ministry of the Interior subsidies for 'catastrophic situations'.

Non-profit organisation: CIVIO

CIVIO is a non-profit organisation that promotes the transparency of institutions and that people can access public information through technology combined with journalism. To achieve this goal, it has three main lines of action:

  • Journalism
  • Public Advocacy
  • Transparency services of public authorities

CIVIO's activities linked to data journalism

This organisation has recently published research on the use of emergency procurement during 2020. They have used the emergency contracts awarded during the past year that have been published on the Public Sector Procurement Portal and the platforms linked to the different autonomous communities. A search engine for emergency contracts can be found at this link and the main conclusions of the research can be found here.

At the end of this article, CIVIO explains the methodology used to carry out the analysis of this data, as well as the difficulties encountered in the process. The data used in this research have been made available to all users in Datos Civio in formats that facilitate their reuse. Thanks to this, several media outlets have used the CIVIO database to produce their own information.

Data analysis, visualisation and communication: STORYDATA

Storydata is an organisation of women experts in data research, analysis, visualisation and communication. They are characterised by offering a global communication service aimed at public and private entities that want to transform a set of their own or external data into a communicative story. To do so, they use a variety of media and formats, from the preparation of reports to the creation of campaigns and the dissemination of content on social networks.

Storydata's activities linked to data journalism

Open data is part of Storydata's DNA. Some of its most relevant projects are:

Currently, Storydata collaborates with media such as Crític, for example, creating a data visualisation that shows inequality in times of confinement or this one on how the coronavirus affects the different municipalities of Catalonia.

User groups and list of data journalists

If you are a data journalist, or are interested in the subject, you may be interested to know that there are different platforms where you can find user groups related to the reuse of data in the field of journalism.

  • On LinkedIn, the “Periodismo de datos group stands out, created so that interested parties and professionals in data journalism can contribute and share knowledge related to this discipline.
  • On MeetUp, we find the user group Periodismo y visualización de datos. Organised by Adolfo Antón, it has more than 1,200 participants who interact in relation to the use of data for its reuse in journalism.
  • Also on MeetUp is the “Madrid Abriendo Datos” group.
  • Telegram or Slack groups: many users organise through Telegram (in groups such as postdata, periodismodatos and visualizar) or Slack (periodismodatos.slack.com).

On the web we can also find a list of communicators specialised in data in Spain and Latin America, made up of more than 300 journalists. This project brings together a series of professionals who see data as the true raw material of journalism. If you want to consult the complete list, you can do so through the following link.

This is just a sample of organisations, companies and user groups linked to data journalism. If you know of any others that may be of interest, do not hesitate to write us a comment or send us an email to dinamizacion@datos.gob.es.

 


Content prepared by the datos.gob.es team.

calendar icon
Entrevista

Data journalism is one of the disciplines that has grown the most in the media in recent years, both within and outside our borders. This form of journalism consists of collecting accurate data, analysing it and making it available to the public through articles and/or graphic and interactive resources, facilitating the understanding of complex issues. 

In datos.gob.es we have interviewed Adolfo Antón, designer, journalist, trainer and passionate about Free Software and open knowledge. Adolfo has been president of Open Knowledge Foundation Spain, coordinator of School of Data Spain, head of the Datalab at Medialab Prado, coordinator of the Data Journalism working group, of the Data Journalism Conference and curator of the Data Journalism (2014-2019) and Visualizar (2015-2018) workshops. He is currently coordinator of the Master's Degree in Journalism and Data Visualisation at the University of Alcalá and Professor of the Master's Degree in Digital and Data Journalism at the Nebrija University.

Full interview:

1. What does a data journalist do?

Thank you, first of all, for your interest in journalism and data visualisation and for what I can contribute to these fields. I appreciate and welcome the existence of datos.gob.es, an essential project that will soon be ten years old and which, paradoxically, I think is not sufficiently known, used and recognised in these fields. 

To answer your first question, I am going to focus on what you have defined as data journalism, which begins with the collection of accurate data. There are currently many projects in the field of news verification and I see this, in my opinion, as a reaction to an exaggerated proliferation of false, manipulated news, hoaxes, lies and other faunas and floras of bad practices, not only journalistic but also communicative. The data we work with must be verified, certified, accredited and/or curated, providing context, source and methodology. Otherwise, we will develop flawed analysis and distorted stories. 

There is a journalistic saying that goes "don't let a bad piece of information spoil a good headline", and in this temptation it is very important a strong journalistic ethics in journalists, editors and the media itself.

We need to verify, certify, accredit and/or curate the data we work with, providing context, source and methodology. Otherwise, we will develop flawed analyses and falsified stories.

Data journalism is essentially the use of computer applications to work with data, whether it is few, many or very many. Statistics, infographics and data visualisation are also important in data journalism. 

With these IT tools, what Paul Bradshaw called the inverted pyramid of data journalism is realised: 

  • Compile (gather, collect).
  • Cleaning (scrubbing, digging, investigating, interrogating)
  • Contextualise (data and story context, methodology), and
  • Combine (data, visualisations, infographics, maps, texts, interactives...).

It is therefore necessary to use IT tools and computer languages that cover one, several or all of the tasks in the work process. It is not mandatory, but it is advisable not to be cloistered in proprietary software, as this will determine the use we make of it. Third-party services can also be of great help, but it is preferable to use your own services that you have full control over

2. Why is data journalism important, and can you point us to any success stories or investigations where it has been key?

Data journalism is journalism that investigates with data and, therefore, is as important, necessary and primordial as journalism, if by this we mean critical and independent journalism, a fourth power in today's society. Not knowing how to work with data using IT tools limits us from doing good journalism, be it economic, political or sports journalism. Increasingly, data journalism is no longer the exceptional success story of generalist journalism, but the methodology of journalism in general. 

The first cases of success in Spain can be circumscribed around Civio, an organisation that carries out data journalism in projects such as España en llamas, among others. Then, the projects that emerged around the conjunction of three elements that made data journalism grow in Spain are relevant: 

  • The Data Journalism group at Medialab-Prado, which awakens public interest in this discipline and enables the creation of an incipient community; 
  • The Unidad Editorial/URJC Master's Degree in Investigative Journalism, Data and Visualisation, which trains a first generation of data journalists; 
  • The media that are committed to it to a greater or lesser extent, such as El Español, El Mundo, El Confidencial, RTVE, El Diario de Navarra, eldiario.es or Ara.cat.  

A high point in international data journalism, and also in Spain, was undoubtedly the investigation into the Panama Papers by the International Consortium of Investigative Journalists (ICIJ) in 2016, in which 109 media from 76 countries took part. In Spain, LaSexta and El Confidencial were the participating media and achieved a wide repercussion and the resignation of the Minister of Industry, Energy and Tourism.

Unfortunately, the Medialab-Prado Data Lab (the continuation of the data journalism group between 2016 and 2019) no longer exists, nor have all of these media maintained or strengthened their teams. But, in general, the discipline has spread in terms of community, universities and practices. This process has accelerated so much with the COVID crisis19, so that the current period is already considered the second golden age of data visualisation.

3. What challenges does data journalism face today?

It is a difficult question to answer because I believe that in addition to the traditional challenges of journalism, there are also those produced, as I said at the beginning, by the abundance of false, manipulated, biased news disseminated by social networks where issues such as ethics, privacy, authorship, anonymity or the automatic and mass-replicated production of content generates a deafening noise. In addition, intensive polarisation is used to collect data on people in order to create consumer profiles. This bombards the rational, reflective, discursive and greyscale process that good journalism can foster. 

If I focus on data journalism as a methodology, the main challenge I see is to train journalists in the use of computer applications to work with data and thereby, little by little, improve journalistic output so that good data journalism products are valued by the general public.

The main challenge I see is to train journalists in the use of computer applications to work with data and thereby gradually improve journalistic output so that the general public appreciates good journalistic data products.

4. Is there a firm commitment by the traditional media to data journalism?

The Panama Papers were a hopeful moment for data journalism and also for the fact that a generalist television station was committed to this discipline. This has not happened in general terms, but it is true that the coronavirus crisis has produced an increase in work where some analysis and visualisation of data is produced, which can be seen on the front pages of media websites, for example. Without an in-depth analysis, I would say that most of them are more showcases with easy products than complete data journalism works in the sense that not all the stages of the journalistic project are carried out, but fragments that cover the demand. 

It is worth highlighting the work in data analysis and visualisation being done by El País, RTVE.es and eldiario.es. At the same time, media specialised in news verification such as Newtral and Maldita are constantly producing news with innovative formats that also include data analysis and visualisation. 

On the other hand, there are people who do not work in the media but who have come together since the beginning of the pandemic to work on COVID19 data in a commendable effort that combines data collection, analysis and visualisation and leaves the work practically ready for the media to take it and finish it off, but that magical connection has not yet been made. 

From the experience of Medialab-Prado's data journalism workshops, I would say that working with data takes time, requires professionals, equipment, ideas, etc., but these are not investments that can be far from any newsroom, regardless of the size of the media outlet. The fact that such a firm commitment has not been made also leaves the field open for other proposals to position themselves better, as has happened with news verification.

5. Open data is essential for data journalists to have accurate information from official sources. What types of data are most in demand by data journalists for their research?

My impression is journalists do not normally take advantage of the open data that are available, either because it is very complex, because it requires extensive knowledge in data processing, because it requires a lot of work, because it is unknown or, finally, because it is not "attractive", it is not fashionable. 

In other words, having an open data portal and an open data publication policy does not ensure that the data will be used, which does not mean that this, the publication of quality open data, should not be the default policy of any self-respecting public administration and source of information. 

There are many different cases and to cite them all would take a more precise exercise of collecting them. Let us take two examples. INE data, in addition to their complexity, microdata, come in different formats. There are search engines with which to create your own set whose interfaces are very old and not very usable. Another case is the Zaragoza city council data portal. One of the best, but it requires registration to work with the API and the data can be extracted in JSON... I put an ellipsis because although it is one of the most used and manageable data formats, not everyone, as with microdata, knows how to use it. Ultimately, not all the problems in data journalism come from the absence of data but also from the formats and the skills to handle them

Open data is often not exploited, either because it is too complex, because it requires extensive data processing skills, because it requires a lot of work, because it is unknown or, finally, because it is not "attractive", it is not fashionable.

In this sense, that of skills management, I remember that lately I have seen more than one media outlet embedding visualisations made by third parties. It could be one more example of those niches that data journalism is producing so that there are news agencies specialised in data. This should not be a negative thing, but it seems to me that third party tools are being used in limited formats. At the other extreme is the BBC, which makes a style guide on how to make graphics with R and creates a library so that the style of their graphics is different. That's betting on data too. 

In the data journalism or visualisation workshops we always found that we lacked the magic dataset to work with, we had to create it. But we also encountered surprises and I think we certainly don't use most of the available data because we don't know it exists. So, in addition to demanding data, I would push in parallel for learning how to use existing data or create it.

6. How important are visualisations in data reporting? What technologies and tools do you use?

If I go by the usual story, visualisations are used in two main phases:

  • On the one hand, at the data analysis stage. Data is visualised more easily with all kinds of graphical tools or charts that help us to find outliers, patterns, averages, etc. 
  • On the other hand, in the final part of the project, the journalistic product. The visualisation(s) can be just another part or the main piece of the journalistic story.  

Lately I have been trying to explain what data visualisation is in computer terms. In paper format, visualisation is done with manual tools, manual or digital printing. But on the screen, on the Web, you can do any visualisation you want! With characters, text, images, video, audio, interaction, etc. If we understand the language and languages of this medium, we will be able to produce in a more integrated way data journalism works where any element has a place

For this we do not need hardware other than what we already have, desktop or laptop computers, but we do need a compendium of free or open source software tools

It goes without saying that there is a very wide spectrum of possibilities in the field of proprietary and/or proprietary software, but the use of free or open source software is essential to make a leap in the use of technologies in journalism and data visualisation.

If we understand the language and languages of this medium, we will be able to produce in a more integrated way data journalism works where any element has a place.

7. You are currently involved in 2 masters on data journalism, why should journalists have knowledge of data analysis and visualisation?

In addition to the Medialab-Prado experience, I have received or given courses in media and universities. I have been part of the Master in Data Journalism at Centro Universitario Villanueva in its three editions; I made the teaching guide for two modules of the Master in Data Journalism at UNIR and started teaching, although at that time I was not convinced by online training, possibly due to the use and abuse of proprietary software; I have taught a data module in the Master in Agency Journalism at Agencia EFE-UC3M. Now I am teaching in the Data Journalism and Visualisation module of the Master's in Digital and Data Journalism at Nebrija University where I try to transmit this basic knowledge about journalism and data visualisation. 

I never stop learning and practising every day. I have created this Master's Degree in Journalism and Data Visualisation at the University of Alcalá because I understand that there is no training programme that addresses these issues in a comprehensive way, from free or open source software, and because since I started to relate to this world I have seen that data analysis and visualisation are essential for data journalism, but they have not been addressed in this way in the different university programmes. 

It is true that, from the beginning, I have also heard or read that data journalism is collaboration and that there are many profiles in the newsroom and one person can't have them all, and the virtue is cooperation. That is true, but in order to cooperate you have to know how to cooperate on the one hand and know what you want to cooperate about on the other. In classical journalism, cooperation is commonplace - let's hope it is not lost - so all that is missing is the skills. The training that is done, almost always, tends to cater for different profiles, so that you also need to have an overview, to know what others do, what things might interest me, what strengths I should develop or compensate for. And then, having a good base and with practice, to use one or other skills in one or other roles. 

calendar icon
Blog

More and more media outlets have articles on their pages linked to so-called data journalism. This form of journalism uses the technologies and tools related to to data to provide readers with more documented, easier to understand and more engaging information.

In this article we explain what data journalism consists of and we show you some examples of media that already incorporate this modality within their informational processes.

What is data journalism?

Data journalism or data journalism is a journalistic discipline that incorporates different fields such as computer science, programming, engineering, statistics, design and journalism. to combine in the same space the analysis of data together with the narrative of the press. According to him Data Journalism Handbookdata can be the tool used to tell a story, the source a story is based on, or both.

Data journalism has its origin in Precision Journalism -evolution of investigative journalism-, where disciplines such as sociology and statistics are incorporated into traditional journalism, and Computer Assisted Journalism (Computer Assisted Reporting or CAR), which emerged in 1969 when journalists began to use computer systems to support them when dealing with the information they collected.

Data Journalism goes one step further and is the result of the digital transformation present today in the daily work of many newsrooms throughout the world. Using resources and tools related to data analytics, information is extracted from large databases. In this way, information of greater value and more complete is offered, adapted to the dynamism that digital reading requires.

What products does data journalism offer?

According to the digital magazine Journalists notebooks, there are at least four productions that can derive from this discipline, and that are generally complementary to each other:

  • Data-driven articles: These are short articles that are made from large databases. These types of articles are typical of investigative journalism, since their common denominator is based on surveys or statistics.
  • Applications news: The services that group information and send notifications to users about news of their interest from different media are becoming more and more common. For example, the main providers. For instance, Google Discover or Samsung Daily.
  • Open datasets (datasets): Some media offer open data, the result of their research, in order to democratize information through the accessibility of data and its availability on the Internet in reusable and free formats. As an example, the New York Times offers data on the coronavirus openly.
  • Interactive visualizations: such as infographics, graphics or applications that allow the information obtained from the databases to be viewed more clearly, facilitating the understanding of complex topics by readers. Visualizations can complement articles or be a product of their own when accompanied by short explanatory texts.

Data journalism in the media 

More and more media have a news production based on data. Here are some examples:

At the national level

Among others, The country,The world or The newspaper They have a specific section within their digital newspapers dedicated to data journalism. Within both, newspapers address current affairs from the perspective offered by the data and through which they generate visualizations.like the following. In it, you can see a map that shows the inequalities in mortality according to the area in which you reside.

El mapa de la mortalidad barrio a barrio en España revela enormes desigualdades incluso en la misma calle

For its part, EpData is the platform created by Europa Press to facilitate the use of public data by journalists, with the aim ofboth to enrich the news with graphs and context analysis and to contrast the figures offered by the various sources. EpData also offers tools for creating and editing charts. An example of the activity they carry out is this article, where you can check the status of the dependency waiting lists in Spain.

Gráfico, Gráfico de líneas, Esta es la situación de la lista de espera a la dependencia

Another example of data journalism is Newtral. It is an audiovisual content startup founded in 2018 in which data is the basis of all his work, especially in the fight against fake news. For instance, in this article different visualizations of data related to the oscillation of the price of light can be seen during different months.

Gráfico, Gráfico de líneas, índice de precios de consumo (IPC) de vivienda, agua electricidad, gas y otros combustibles

On an international level

Data journalism is part of some of the most important international newspapers. It is the case of The Guardian, which also has a specific section dedicated to the production of journalistic material through articles, graphics and data-based visualizations. For instance, on this interactive map You can check which areas of Australia suffered the greatest natural disasters in 2020.

MapaDescripción generada automáticamente

Another international media that also has a specific section for data-based journalism is the Argentine newspaper The nation, that through its section The Data Nation offers numerous informative productions in which it combines facts and news. For instance, in this article you can see an indicator of mobility of Argentine citizens.  

Imagen que contiene MapaDescripción

Masters and studies related to data journalism

The digital transformation has meant that data journalism is here to stay. For this reason, more and more universities and education centers offer studies related to data journalism. For example, him Own Master's Degree in Data and Visualization Journalism from the University of Alcalá; the Master in Digital and Data Journalism taught by Atresmedia and the University of Nebrija or the Data Journalism Course from the UAM-El País School of Journalism.

In short, we are facing a modality with a future, which needs media that continue to bet on this discipline and professional capecesto handle data analysis and treatment tools, but also to tell stories, capable of transmitting what is happening in our environment with the support of data in a truthful and close way.

Therefore, it is not surprising that this form of journalism continues to grow in the coming years and that, in addition to the examples included in this article, the mass media and studies related to data journalism will increase. If you know any more and want to share it, do not hesitate to write to us contacto@datos.gob.es or leave us a message in the comments.


Content prepared by the datos.gob.es team.

calendar icon
Evento

This Saturday, March 6, Open Data Day 2021 is celebrated worldwide. This event, which takes place annually on the first Saturday in March, seeks to promote the reuse of open data around the world.

In our country there are several organizations that have chosen the next few days to launch events related to the opening and reuse of information. Next, we collect several of them:

Datathon 2021: The role of open data in the face of challenges in a Post-Covid world

The “Datathon 2021: Challenges in a Post-Covid world” seeks to promote the interest and use of open data, mainly in the areas of tourism, agriculture or health. Its objective is to stimulate participants by offering data and tools of interest for the development of applications related to the use and reuse of open data, in addition to promoting citizen participation in new challenges that improve citizens' day-to-day lives.

The competition is promoted by different local and university institutions of the Valencian Community:

The process will be as follows:

  • The competition will begin on March 5 with an informative day at 5:00 p.m. in remote format. Different representatives of the organizing entities will participate and there will be a round table on open data in the field of tourism, health and agriculture with those responsible in these fields. The session will end with practical information on how to participate in the Datathon.
  • From then on, and for 3 months, six workshops will be held, together with experts in the field. With these workshops, participants will learn about the resources of the Valencian Government's open data portal, learn how to process the request for this information, and address the processes of collection, integration, visualization and geolocation of different types of data.
  • Throughout the process, a team of student volunteers from the UA MultiTEC association will tutor the participants and resolve their doubts.

The call is open to any person of legal age interested in data management. The dates to consider are the following:

  • The registration period ends on March 5. Registration is done using Eventbrite.
  • The configuration of equipment can be done until March 18 through the following form.
  • The completion and delivery of projects will end on May 14. A prize of 1,000 euros and three second prizes of 500 euros will be awarded for each of the challenges: tourism, health and agriculture.

The Datathon evaluation team will take into account the impact and importance of the challenge solved, the creativity and innovation of the application, the use of data, the suitability for the award category and the degree of completion achieved.

Journalism and open data visualization

The MPVD Agora on Journalism and Data Visualization organizes a session on Thursday, March 4 to celebrate the Open Data Day 2021 that will be broadcast on YouTube.

The session, which will begin at 3:00 p.m. and will last an hour and a half, will have 2 different parts:

The session will be recorded for those who cannot see it live. It can be accessed through the YouTube channel MPVD Master of Journalism and Data Visualization. In this channel you can find previous sessions, dedicated, for example, to how to work with data and electronic records available in different formats and sources or how to create simple, clear and attractive stories through the analysis and visualization of data allows.

Although the videos are developed within the framework of the Master in Journalism and Data Visualization at the University of Alcalá, both the videos of this conference and those of the previous ones are available open to any user interested in the subject.

Conference - The DATA in the Local Administrations 20/30. Experiences and challenges March 9, 2021. On-line.

The Alcobendas City Council and the Institute for the Transfer of Knowledge in Public Innovation - ITCIP, with the support of the FEMP, organize a conference to publicize projects that local administrations are developing around the world of data.

The session will take place on March 9 online. Among other issues, subjects related to architecture, life cycle, governance, security and data protection and analysis will be addressed. Some of the experiences and proposals that will be discussed are the Data Offices, the Data Delegate, efficient forms of storage, data management and conservation, its application to public services in smart city projects and its role in the openness, transparency and reuse.

The event will have the participation of representatives from the FEMP, the European Commission, the Community of Madrid and cities such as Madrid, Barcelona, ​​Zaragoza, Malaga, Gijón, Logroño, Alcobendas, Sant Feliu de Llobregat and Mataró. In addition, the conference has the support of Oracle, which will provide the vision of the most innovative trends related to data.

You can see the full agenda at this link.

The conference can be followed for free, through the ITCIP page. You can sign up at this link.

At datos.gob.es we invite you to join some of these proposals and celebrate Open Data Day.

calendar icon
Evento

The Data Journalism Workshop 2019 is already underway. It will be held from March 25 to 29 at Medialab Prado (Calle Alameda 15, Madrid). Under the motto "Follow the food", this year's edition will focus on the world of food, with reference to the Sustainable Development Goals (SDG).

The objective is that participants " analyse, narrate or redesign, in a creative and radical way, changes in the food models that can serve to reduce the likelihood of environmental collapse." For this purpose, they will have to search and analyze public data from different sources, as well as reuse data to create new visualizations or applications.

Among all the ideas presented in the call for projects, 7 have been selected. Collaborators who want to develop these projects are now sought. To this end, a call for collaborators has been launched, aimed at any citizen who wants to be involved in the development of a project based on data: from professionals in the field of journalism, programming, sociology or design, to students or pensioners. It should be noted that Medialab-Prado offers 20 free accommodation places for those who sign up for the call and do not live in Madrid.

Participants can sign up to the following projects:

  • School feeding in Spain. Project promoted by Del Campo al Cole entity, which seeks to promote healthy and sustainable nutrition in schools in Madrid. In collaboration with other organizations such as Carro de Combate and SEO / BirdLife, they have compiled a large amount of data about this subject, as the result of requests for public information to Autonomous Communities and the Ministry of Education. Now they are looking for help to interpret and visualize these data in a simple way, in order to facilitate their dissemination and create awareness about the reality of school feeding in Spain, taking into account the different aspects (public purchase, catering companies, environmental impact, social inequalities ...).
  • Food ultraprocessing: the hidden truth. This project seeks to disseminate the NOVA classification system, which classifies food according to its level of processing. Taking the example of Mexico, an interactive map will be developed to help understand the relationship between habits and dietary choices, as well as their repercussions on the environment by analyzing energy consumption and waste generation associated with each type of food.
  • Intesive livestock farming = extensive problems. This project seeks to create awareness of the problems arising from intensive livestock farming. With a holistic and global vision, the dimension of the problem will be investigated in the whole chain of production: from animal husbandry to final consumption. Some of the questions the participants will have to answer are: how many animals do we sacrifice? Is the way we feed them sustainable and healthy? How do exports and imports work? Do we eat everything we produce?
  • Undressing foods. Through the mapping and artistic visualization of the route that the food takes from the place of production to the final point of consumption, this project seeks to stimulate responsible consumption habits. The final result of the project will be a technological tool that allows the user to locate the origin of a product, using  information about the route taken, the environmental impact generated (CO2, especially) and the aesthetic impact that this route could generate in the product.
  • Food in km. The objective of this project is to create a tool that facilitates the search for information about products on the market, to promote responsible consumption. The tool will allow ordering the results depending on the distance between the origin of the food and the province where it is going to be consumed, as well as filtering products according to labels - bio or not - and ingredients - those ingredients involved in environmental conflicts or restrictive in certain diets.
  • The oil we eat. This project seeks to analyze how the import of agri-food products has evolved in recent years, and calculate how much oil has been consumed to bring a certain food to our table. The participants in this project will create a website and a recipe book whiout cooking time, calories or nutritional contribution. Instead, it will talk about oil costs. Later, maybe it is possible to carry out a transmedia work, involving several platforms, promoting citizen participation using gamification.
  • From the seed to your plate. This project is the continuation of the work started in the Visualizar'18 data visualization workshop. Migrations: Soy de temporada. Now their creators look to deepen the environmental and economic costs of consuming products exported from faraway places. To this end, the entire production line of maximum four products will be analysed, comparing their implications: two vegetables and two fruits, one fromlocal production and one from global trade,

In addition, during the workshop, participants can also participate in a series of training workshops, such as Blockchain's Introduction to journalism or PLN with Python NLTK.

If you are interested in telling stories based on data, sign up before March 16th.

calendar icon
Empresa reutilizadora

EpData is the platform created by Europa Press to facilitate the use of public data by journalists, with the aim of both enriching news with graphics and context analysis and verifying the figures offered by various sources. The database is maintained by a multidisciplinary team of computer scientists and journalists who use new technologies and data analysis to improve the efficiency of data consumption and find relevant and informative patterns in the data.

calendar icon