Interesting Resources#

The Turing Way - handbook to reproducible, ethical and collaborative data science#

An excellent reference if you want to learn more about any aspect of research data management. This guide both explains concepts such as versioning, computer environments and metadata, and offer plenty of hands-on advice. It is a community-driven project funded by the Alan Turing Institute, UK.[1]

RDMkit - The research data management toolkit for life sciences#

Best practices and guidelines to support your research data management needs. Particularly useful for more specialized advice, as you can easily navigate resources depending on your role, research field or European country to find tailored information. RDMkit is a community-built resource led by ELIXIR and funded by the European Union.[1]

Library of Congress - File Formats#

Rules of Thumb
  • Avoid proprietary formats (prefer standardized ones)

  • Avoid pseudo-standards, such as xlsx, docx, which you can recognize by the fact that there do not exist multiple implementations across different applications and platforms.

  • Anything “plain text” that is not pure ASCII should be UTF-8 encoded.

  • Simpler is better (as long as no info is lost).

  • If your original data comes in a “low-quality” format (e.g. MP3, GIF, …) just keep it like that.

It is difficult to give comprehensive advice on file formats suited for archiving and digital preservation. The Library of Congress (LoC) Recommended Formats Statement is a good starting point, as is the List of archivable File Formats of the Swiss Federal Archives.

Research Data Life-Cycle Management#

A large, national-level project that aims at providing guidelines, training activities, policy support and various tools to support researchers and their institutions in all aspects of data management.

Metadata Standards#

Maintained by the RDA Metadata Standards Directory Working Group, this directory lists a large number of (mostly) domain-specific metadata standards. Look here first if you are looking for a system to annotate your data that is established for your field of research.