How to generate value from data: formats, techniques and tools to analyse open data

Fecha del documento: 26-06-2019

Modelo_de_datos_RDF

In the digital world, data becomes a fundamental asset for companies. Thanks to them, they can better understand their environment, business and competition, and make convenient decisions at the right time.

In this context, it is not surprising that an increasing number of companies are looking for professional profiles with advanced digital capabilities. Workers who are able to search, find, process and communicate exciting stories based on data.

The report "How to generate value from data: formats, techniques and tools to analyse open data" aims to guide those professionals who wish to improve the digital skills highlighted above. It explores different techniques for the extraction and descriptive analysis of the data contained in the open data repositories.

The document is structured as follows:

  • Data formats. Explanation of the most common data formats that can be found in an open data repository, paying special attention to csv and json.
  • Mechanisms for data sharing through the Web. Collection of practical examples that illustrate how to extract data of interest from some of the most popular Internet repositories.
  • Main licenses. The factors to be considered when working with different types of licenses are explained, guiding the reader towards their identification and recognition.
  • Tools and technologies for data analysis. This section becomes slightly more technical. It shows different examples of extracting useful information from open data repositories, making use of some short code fragments in different programming languages.
  • Conclusions. A technological vision of the future is offered, with an eye on the youngest professionals, who will be the workforce of the future.

The report is aimed at a general non-specialist public, although those readers familiar with data treatment and sharing o in the web world will find a familiar and recognizable reading.

Next, you can then download the full text, as well as the executive summary and a presentation.

Note: The published code is intended as a guide for the reader, but may require external dependencies or specific settings for each user who wishes to run it.

Documentation

    • How to generate value from data
      docx
      1.61 MB
    • How to generate value from data
      pdf
      1.58 MB
    • Resumen ejecutivo: Cómo generar valor a través de los datos (only available in Spanish)
      docx
      57.68 KB
    • Presentación: Cómo generar valor a través de los datos (only available in Spanish)
      pptx
      1.72 MB
    • Source code example: Phyton (Call to Europeana)
      txt
      484 bytes
    • Source code example: R (DPLA call)
      rmd
      2.16 KB