Module 1: Introduction to Research Data Management
Learning Objectives
- Learning Objective 1 (LO1): Define research data with examples.
- Learning Objective 2 (LO2): Apply FAIR principles and share data effectively.
- Learning Objective 3 (LO3): Recognise the importance of FAIR data, "CARE-ful" data treatment, documentation, and organisation.
Total Module Duration
4 hours 15 minutes
Learning Objective 1
LO1: Define research data with examples.
Learning Activities
- Presentation (20 mins): Define research data and introduce the importance of data collection during the research process.
- Discussion (40 mins): Give the participants a case study (suggestions for where to source one provided in instructor notes) or ask them to present an example of a research project. Discuss in pairs what is the research data collected during different stages of the research process.
Materials to Prepare
- Presentation slides on defining research data.
- Example research project case study.
Instructor Notes
Presentation:
- Introduce the concept of research data and how research data is collected during the different phases of the research life cycle. To help with the slide presentation, definitions of research data and how research data is collected can be found in Resources 1 and 2.
Discussion:
- The instructor can initiate the discussion exercise by asking learners to discuss a research case (from their experience) or by playing one or two of the videos that are available within Resource 1 where researchers talk about their research.
- The instructor can also use one of the case studies provided under Scary Tales (Resource 3) to demonstrate how things can go badly with poor research data collection.
Resources
Input for Presentation and discussion:
- Martinez-Lavanchy, P. M., et al. TU Delft Research Data Management 101 Course. Zenodo, 1 Mar. 2024. DOI.org (Datacite). https://doi.org/10.5281/ZENODO.10732095.
- Course: Essentials 4 Data Support (English) - Public | DANS. https://danstraining.moodlecloud.com/course/view.php?id=11. Accessed 17 Mar. 2025.
- Lang, Kevin, et al. Research Data ScaryTales. Zenodo, 12 Mar. 2025. DOI.org (Datacite). https://doi.org/10.5281/ZENODO.4066679.
Learning Objective 2
LO2: Apply FAIR principles and share data effectively.
Learning Activities
- Presentation (45 mins): Explain each of the FAIR principles, the benefits of applying the FAIR principles, and how to implement each principle. Explain how each principle is implemented during a research project through the use of persistent identifiers, licences for datasets, (machine-readable) metadata and tools such as the use of repositories. Highlight the difference between 'open' data and FAIR data.
- Case study analysis (60 mins):
- Case 1: A research project where data was made FAIR (a good example).
- Case 2: A dataset where metadata is missing.
- Case 3: A dataset where the formats are no longer supported.
Materials to Prepare
- Presentation on FAIR principles and main tools to help apply FAIR principles.
- Case studies to facilitate a discussion activity on FAIR principles.
Instructor Notes
Key messages in presentation on FAIR principles:
- Introduce the learner to FAIR principles (Findable, Accessible, Interoperable, Reusable) and present the purposes of FAIR principles to help share reusable data. The aim is to give an overview of data sharing challenges in research and how FAIR principles address them. Use real-world examples of FAIR data implementation to explain the principles.
- Explain why the FAIR principles are being taught. The main objective of FAIR is to increase data reuse by researchers. The core concepts of the FAIR principles are based on good scientific practice and intuitively grounded. As data stewards, it will be the learners' job to support good research practice and the implementation of the FAIR principles.
- One principle can be explained at a time. For each principle show case how they can be applied. For example, how licences help Reusability and how metadata help Findability, Reusability and Interoperability. Explain how the principles are implemented using data management plans, persistent identifiers, licences for datasets, metadata and using tools such as repositories. For better illustration of the implementation of the FAIR principle, the instructor can highlighting where in the research project life cycle FAIR principles can be applied and where support may be provided. Consider including an interactive quiz to assess the learners understanding of FAIR/non-FAIR data practices with regard to the FAIR principles.
- The instructor can cover the following key takeaways:
- "Open" data and FAIR data: Ensure learners grasp the difference between "open" data and FAIR data.
- Be clear that FAIR principles are guidelines, not strict rules, and that flexibility is essential. FAIR principles are designed to make data reusable and more accessible.
- Sharing data using FAIR principles leads to more reproducible and impactful research.
- Accessibility is not just about Open Data but making data "as open as possible, as closed as necessary."
- Make sure learners understand that the metadata for a given dataset is often linked to the data repository where it has been published. For instance, in Zenodo, the metadata is very basic, whereas in disciplinary repositories, for instance the Cambridge Structural Database, the metadata is much more detailed. Accordingly, address discipline-specific considerations for implementing FAIR principles in different fields, for example differences in metadata standards or data formats.
Case Studies:
- Prepare three cases as below and share it with the learners: find a "good example" dataset
- 1) that has been made "FAIR",
- 2) where the metadata or usage licence is missing, and
- 3) where the format is outdated.
- For each dataset, ask learners to check if all the elements/practices to implement the FAIR principles discussed are present. Is there any principle that does not entirely apply to these datasets? What would be your recommendations to improve them.
- Prepare questions for the learners to work with such as:
- Which metadata would be appropriate to add to Case 2?
- How can we ensure that data is usable in years to come?
- Which formats could we recommend (Case 3)?
- Many university libraries provide guides to where you can find teachable datasets, such as those linked in Resources 1–3. These datasets may require you to edit the README file or remove some metadata to make interesting cases. Alternatively, you can search for datasets from different data repositories which are more or less (or not) FAIR. Exposing the learners to how the datasets look in different repositories is an add-on learning experience.
Resources
Input for presentation and case studies:
- Datasets for Teaching and Learning. https://www.lib.ncsu.edu/formats/teaching-and-learning-datasets. Accessed 24 Mar. 2025.
- Huck, Jennifer. LibGuides: Data and Statistics: Find Data for Teaching and Learning. https://guides.lib.virginia.edu/data/teachandlearn. Accessed 24 Mar. 2025.
- Data Education in School. Useful datasets for data education in schools. https://dataschools.education/resource/useful-datasets-for-data-education-in-schools/. Accessed 24. Mar. 2025
Background reading for slide preparation and discussion with learners. These resources are also suitable for sharing with learners:
- A FAIRy Tale -- A Fake Story in a Trustworthy Guide to the FAIR Principles for Research Data. https://forskningsdata.dk/fairytale/index.html. Accessed 19 Mar. 2025.
Inspiration for instructor - "FAIR-Aware: Assess Your Knowledge of FAIR". FAIRsFAIR, 3 Jul. 2020. https://www.fairsfair.eu/fair-aware.
- "FAIR Principles". GO FAIR. https://www.go-fair.org/fair-principles/. Accessed 19 Mar. 2025.
- Adams, Jenni, m.fl. "Supporting FAIR Data Management Planning Across Different Disciplines at the University of Sheffield". Data Science Journal, bd. 22, nr. 1, June 2023. datascience.codata.org. https://doi.org/10.5334/dsj-2023-017.
- FAIR for Beginners | Danish E-Infrastructure Consortium. https://www.deic.dk/en/data-management/instructions-and-guides/FAIR-for-Beginners. Accessed 24 Mar. 2025.
- Jasinska, Agnes, et al. Open Licences for Data. https://doi.org/10.5281/zenodo.14921877.
Please note the Open learning object which is an interactive checklist to help learners choose a licence before sharing data.
Learning Objective 3
LO3: Recognise the importance of FAIR data, "CARE-ful" data treatment, documentation, and organisation.
Learning Activities
- Lecture (45 mins): Lecture on FAIR and CARE principles in data curation (Resources 1, 2).
- Discussion (45 mins): Debate about data sovereignty, CARE, and data curation.
Materials to Prepare
- Slides for lecture, review materials from the other modules within RDM, particularly FAIR data.
Instructor Notes
General Remark:
- The instructor can view other modules from the section of Research Data Management such as modules on FAIR data, Data Documentation and Storage, Data Sharing and Publishing.
Lecture and Discussion:
- The instructor can reiterate some of the key points from the FAIR data module if required: FAIR is not an "is" or "is not" concept but rather a spectrum—flexibility is key, and should work on a case by case basis.
- Introduce the CARE principles through a case study (Resource 3).
- The instructor can facilitate discussion (Resources 1–3) around the following questions: how to ensure the CARE principles in data that is to be published or shared? How do these principles impact researchers in different countries? Different research domains? The idea here being researchers doing research on or in post-colonial countries will be impacted by these ethical considerations.
Resources
- Carroll, Stephanie Russo, et al. "Operationalizing the CARE and FAIR Principles for Indigenous Data Futures." Scientific Data, vol. 8, no. 1, Apr. 2021, p. 108. DOI.org (Crossref), https://doi.org/10.1038/s41597-021-00892-0.
- "CARE Principles for Indigenous Data Governance". Global Indigenous Data Alliance, 23. Jan. 2023. https://www.gida-global.org/care.
- Carroll, Stephanie Russo, et al. "The CARE Principles for Indigenous Data Governance." Data Science Journal, vol. 19, Nov. 2020, p. 43. DOI.org (Crossref), https://doi.org/10.5334/dsj-2020-043.