Documentation
Type
- Reports and studies (54) Apply Reports and studies filter
- Guides (40) Apply Guides filter
- Data science exercises (18) Apply Data science exercises filter
- Training materials (13) Apply Training materials filter
- Infographics (7) Apply Infographics filter
- Regulations and strategies (7) Apply Regulations and strategies filter
Audience
Authorship
Document date
Tag
-
Guidance for the deployment of data portals. Good practices and recommendations
Open data portals help municipalities to offer structured and transparent access to the data they generate in the exercise of their functions and in the provision of the services they are responsible for, while also fostering the creation of applications...
-
Practical guide for the publication of linked data
It is important to publish open data following a series of guidelines that facilitate its reuse, including the use of common schemas, such as standard formats, ontologies and vocabularies. In this way, datasets published by different organizations will...
-
Introduction to data anonymisation: Techniques and case studies
Data anonymization defines the methodology and set of best practices and techniques that reduce the risk of identifying individuals, the irreversibility of the anonymization process, and the auditing of the exploitation of anonymized data by monitoring...
-
Practical guide to publishing tabular data in CSV files
Nowadays we have more and more sources of data at our fingertips. According to the European Data Portal, the impact of the open data market could reach up to EUR 334 billion and generate around 2 million jobs by 2025 ('The Economic Impact of Open Data...
-
A practical guide to publishing Open Data using APIs
An application programming interface or API is a mechanism that allows communication and information exchange between systems. Open data platforms, such as datos.gob.es, have this type of tool to interact with the information system and consult the...
-
From theory to practice: creating a RAG-based conversational agent.
Introduction In previous content, we have explored in depth the exciting world of Large Language Models (LLM) and, in particular, the Retrieval Augmented Generation (RAG) techniques that are revolutionising the way we interact with conversational...
-
Los portales de datos abiertos son una fuente invaluable de información pública. Sin embargo, extraer insights significativos de estos datos puede resultar desafiante para usuarios sin conocimientos técnicos avanzados. En este ejercicio práctico,...
-
Practical guide for improving the quality of open data
When publishing open data, it is essential to ensure its quality. If data is well documented and of the required quality, it will be easier to reuse, as there will be less additional work for cleaning and processing. In addition, poor data quality can be...
-
DCAT-AP and its extensions: Context and evolution
One of the main challenges that arise when addressing an Open Data initiative is to define the information architecture and facilitate interoperability between data catalogs published by different portals on the Web. In order to solve this challenge, the...
-
Word Embeddings - Practical Exercise on Tag Processing
Open data portals play a fundamental role in accessing and reusing public information. A key aspect in these environments is the tagging of datasets, which facilitates their organization and retrieval. Word embeddings represent a transformative...