Aarhus University Seal / Aarhus Universitets segl

Tools for data analysis

Here are some examples of free programs that are suitable for analysing different types of data.

Cultural Heritage Cluster – a high performance computer platform

  • The purpose of the Cultural Heritage Cluster  is to provide researchers, primarily within the humanities and social sciences, with the opportunity to work quantitatively with big data
  • The Cultural Heritage Cluster is a high performance computer platform operated by DeiC (Danish e-Infrastructure Cooperation) and the Royal Danish Library
  • Located at the Royal Danish Library in Aarhus, it gives you the opportunity to carry out analyses of the library's digital collections, which are normally difficult to access due to copyright.
  • In addition to computer processing power, you can also get development support for your project
  • The Royal Danish Library provides support for users of the Cultural Heritage Cluster.
  • Read more about the Cultural Heritage Cluster (in Danish).

OpenRefine – A tool for cleaning up data

  • OpenRefine is a tool for cleaning up data
  • The program makes it easy to form an overview of, and tidy up, cluttered datasets with inconsistent row and column names.
  • Using different filters, the program can transform textual data into a format that makes it more suitable for further analysis.
  • Through external APIs, OpenRefine can also be used to enrich datasets with metadata fields such as language and geography
  • AU Library offers help to get started with OpenRefine 
  • Read more about OpenRefine

Python – A programming language

  • Python is a programming language with many applications and uses
  • Python can, among other things, be used for collecting, cleaning up, analysing and visualising data
  • It can also be used to automate recurring tasks
  • Python is free open-source software, and there are expansion packs for many purposes
  • AU Library offers help to get started with Python
  • Read more about Python

Get started with Python

R and Rstudio – a programming language and an environment for statistical data processing

  • R is a programming language and environment for statistical data processing and graphics
  • R can be expanded with program packages for specialised tasks such as analysis and visualisation of different types of data
  • RStudio is an integrated development environment that makes it easier to work in R
  • R can be run on different UNIX platforms and similar systems, as well as Windows and MacOS operating systems
  • R is freely available within the Free Software Foundations’s GNU General Public License
  • AU Library offers help to get started with R and RStudio
  • Read more about R and RStudio

Get started with R

VOSviewer - a visualisation tool

  • VOSviewer is a visualisation tool that can highlight connections in bibliographic and textual data
  • Based on scientific articles, the programme can visualise networks of citations, co-citations and bibliographical couplings.
  • The programme can also be used to extract keywords and key concepts within specific fields of study.
  • AU Library offers help to get started with VOSviewer
  • Read more about VOSviewer

Get started with VOSviewer 

Data Management and Open Science support

  • AU Library supports good data management practice at Aarhus University
  • The library offers support to researchers and students on the handling of research data, supervision, planning and sharing of data throughout the research process  

Need help?

If you have any questions or concerns, feel free to contact the liason librarian associated with your field, who will be happy to assist you with your questions.

Alternatively, you are always welcome to contact your local library.