Blog

In this section, we present notes prepared by expert partners on different topics related to data.

21 posts found

Subscribe to Blog

Explainable artificial intelligence (XAI): how open data can help understand algorithms

The increasing adoption of artificial intelligence (AI) systems in critical areas such as public administration, financial services or healthcare has brought the need for algorithmic transparency to the forefront. The complexity of AI models used to make decisions such as granting credit or making a…

inteligencia artificial procesamiento del lenguaje natural LLM +4

Foto de stock

The role of open data in the evolution of SLM and LLM: efficiency vs. power

Language models are at the epicentre of the technological paradigm shift that has been taking place in generative artificial intelligence (AI) over the last two years. From the tools with which we interact in natural language to generate text, images or videos and which we use to create creativ…

inteligencia artificial datos abiertos aprendizaje automático +5

Foto de un robot

Understanding Word Embeddings: how machines learn the meaning of words

Natural language processing (NLP) is a branch of artificial intelligence that allows machines to understand and manipulate human language. At the core of many modern applications, such as virtual assistants, machine translation and chatbots, are word embeddings. But what exactly are they and why are…

Ciencia y tecnología ciencia de datos procesamiento del lenguaje natural +2

Word cloud in which some terms such as security, computer or electronic are highlighted.

SLM, LLM, RAG and Fine-tuning: Pillars of Modern Generative AI

In the fast-paced world of Generative Artificial Intelligence (AI), there are several concepts that have become fundamental to understanding and harnessing the potential of this technology. Today we focus on four: Small Language Models(SLM), Large Language Models(LLM), Retrieval Augmented Generation…

inteligencia artificial procesamiento del lenguaje natural PLN +5

Foto de stock

Safe rooms in Spain: What kind of data can researchers access?

There are a number of data that are very valuable, but which by their nature cannot be opened to the public at large. These are confidential data which are subject to third party rights that prevent them from being made available through open platforms, but which may be essential for research that p…

salas seguras INE Banco de España +2

Computer screen with data

RAG techniques: how they work and examples of use cases

In recent months we have seen how the large language models (LLMs ) that enable Generative Artificial Intelligence (GenAI) applications have been improving in terms of accuracy and reliability. RAG (Retrieval Augmented Generation) techniques have allowed us to use the full power of n…

inteligencia artificial procesamiento del lenguaje natural RAG +3

mobile phone with ai

The agreement to provide statistical data to researchers, in the context of the Data Governance Regulation

The European Union has devised a fundamental strategy to ensure accessible and reusable data for research, innovation and entrepreneurship. Strategic decisions have been made both in a regulatory and in a material sense to build spaces for data sharing and to foster the emergence of intermediar…

Legislación y justicia salas seguras espacios de datos +3

researcher in front of screen with data

Linguistic corpora: the knowledge engine for AI

The transfer of human knowledge to machine learning models is the basis of all current artificial intelligence. If we want AI models to be able to solve tasks, we first have to encode and transmit solved tasks to them in a formal language that they can process. We understand as a solved task informa…

inteligencia artificial procesamiento del lenguaje natural corpus lingüísticos +1

Photo of a computer

GRAPH QL. Your best ally for the creation of data products.

The era of digitalisation in which we find ourselves has filled our daily lives with data products or data-driven products. In this post we discover what they are and show you one of the key data technologies to design and build this kind of products: GraphQL. Introduction Let's start at the beginni…

inteligencia artificial procesamiento del lenguaje natural RAG +3

Stock photography depicting data networks

UNE specifications as a complement to ISO standards for the governance, management and quality of Information Systems and Technologies

Standardisation is essential to improve efficiency and interoperability in governance and data management. The adoption of standards provides a common framework for organising, exchanging and interpreting data, facilitating collaboration and ensuring data consistency and quality. The ISO standards,…

inteligencia artificial procesamiento del lenguaje natural PLN +1

Stock photography of a computer