Skip to content

Data Steward MVS

Introduction: Open Science mission for this role

The Minimum Viable Skillset (MVS) sets out to describe competencies required for Open Science practitioner roles. Data Stewards put Open Science principles into practice and are a key role for the European Open Science Cloud. The MVS includes two variations of the role, one described as ‘Coordinator’ and the other ‘Embedded’.

These represent two ends of a spectrum, with ‘Coordinator’ describing a role providing support across an organisation’s research domains and units, and ‘Embedded’ describing a role close to a research team and to its domain-specific practices. We acknowledge that these roles can overlap, influenced by the availability of resources, and by disciplinary and organisational cultures. Data Stewardship expertise is typically distributed across a team, drawing on in-house capabilities and external services provided to the Research Performing Organisation, e.g. by Research Infrastructures, Service Providers (e.g scholarly communications), and Competence Centres.

In general terms, Data Stewards work with stakeholders to establish, govern and maintain processes to collect research data, make it usable for research objectives, facilitate its transformation into research output, assist in quality assurance, and support informed decision-making on its openness for reuse according to ethical, legal and social expectations.

In the EOSC specifically, in their roles as practitioners and champions of Open Science, trainers, and enablers of others in their organisation, Data Stewards are likely to be both consumers and providers of EOSC services or resources.

Coordinator Data Steward Embedded Data Steward
Coordinator Data Stewards act as a ‘centralised knowledge and communication hub’ for researchers. They advise and train on policy, guidelines, data management plans, institutional infrastructure and tools to implement FAIR and CARE principles across the organisation. Embedded Data Stewards serve research teams, faculties, departments, sections of organisations directly involved in producing research outputs, supporting them to plan and implement FAIR and CARE principles, meeting needs of researchers as they arise, and working with others to ensure outputs are preserved and reusable in the long term.
Associated function titles: Data Steward, Data Librarian, Research Data Management Specialist, Research Data Manager, Research Data Management Consultant, Reproducibility Librarian. Associated function titles: Data Steward, Data Manager, Data Curator, Research Data Manager.

Essential Skills and Competences

Technical

  • Cross-domain/domain-specific knowledge on Open Science practices, policies, and regulations, and translating these (when necessary) to the appropriate levels of the organisation.
  • Service provision to support cross-domain/domain-specific Open Science practices including: use of FAIR and CARE principles, Open Access, data optimization, data preservation, archiving, and responsible re-use.
  • Knowledge about Research Data Management, (personal) data governance and ethics, Open Science data publication and exchange (sharing) services, information security, and risk management.
  • Raising awareness of the value of good data management among data creators and users, researchers, organisational colleagues, and decision-makers.
  • Advise/provide support on the use of infrastructure and tools at the appropriate levels of the organisation, e.g., for data storage, data versioning, and documentation, FAIR software, and databases.
  • Training design and delivery to support Open Science practices, policies, and practices.
  • Monitor the research and funding ecosystem, including possible conflicting motivations, drivers, and incentives among different stakeholders.
  • Knowledge/awareness of programming, FAIR code, and FAIR software, and use of standards and ontologies.

Soft/Transversal

  • Communication
  • Conflict management/mediation (with a patient, empathic approach)
  • Critical and analytical thinking
  • Stakeholder engagement and networking to translate and bridge needs
  • Creativity, curiosity, and openness (willingness to learn)
  • Team management and project management (results-oriented planning and organising)

Background assumptions

Main activities – Coordinator role Main activities – Embedded role
• Contributes to Open Science policy development by engaging with (inter)national policy-making, bringing the cross-disciplinary expertise needed for local policy development, implementation, and monitoring. • Develops Data Management Plans templates tailored for research teams and supports researchers in writing a DMP according to the relevant template. Includes provision for post-project archiving and FAIR sharing (standards, metadata, licensing, repository selection).
• Understands research stakeholder needs and contributes to developing, implementing, and monitoring institutional RDM policy and Data Governance, along with tools and services to support these. Promotes and communicates the importance of Open Science and FAIR to all levels within the organization (e.g., policy makers, senior management, researchers, postgraduates, etc.). • Supports researchers in good practice on data and/or software/code when writing applications to funders, implements this good practice as a regular aspect of doing research, and liaises with (technical) RDM experts inside and outside the institute to adopt effective solutions to challenges.
• Analyses trends in data management infrastructure, tools, and methods that potentially improve the organisation’s implementation of FAIR and CARE principles to enhance support for decision-making on Open Science. Advises on (meta)data standards and contextual documentation for data archiving. • Advises and supports researchers on data-infrastructure and tools, and adoption of innovative techniques or tools, including those provided by relevant (inter)national data-infrastructure and tools.
• Monitors RDM skills of researchers and research support staff in the institute and refers researchers to RDM related facilities and services. • Identifies gaps and takes action if needed to ensure ethical conduct and awareness of the potential impacts of data reuse, management, and sharing on wider society.
• Develops and delivers training tailored to learners’ needs, aligned with wider institutional policies and plans. • Advises on the use of disciplinary standards and ontologies, and relevant community practices that are applied in producing FAIR research outputs.
• Maintains networks of RDM and research support related colleagues. • Supports researchers on legal and regulatory compliance aligning local practices with ethical conduct through connections with the institutional privacy officers, legal advisers, and research ethics bodies.
• Provides ongoing support and guidance to researchers in following best practices for data management, ensuring ethical considerations and compliance are met.
• Encourages collaboration across departments and institutions to share knowledge and improve research data management practices.

Contributes to which Open Science outcomes?

  • Digital research objects are as FAIR and open as possible and as closed as necessary.
  • Opportunities are identified for creating or connecting with professional Open Science networks at institutional, cross-institutional, regional, national, or international levels.
  • Relevant competence centres with a FAIR data and Open Science support role are utilised effectively according to local needs and policies.
  • Open Science skills and practices are facilitated and enhanced using, where appropriate, EOSC resources and services, including any relevant Open Educational Resources.
  • Research data and other digital objects are effectively managed to ensure their suitability for archiving and sharing, and advancement of research methods appropriate to the discipline(s).

Further information -

Contributors: Nida van Leersum, Päivi Rauste, Bojana Koteska, Laurence Horton, Karsten Kryger Hansen, Angus Whyte

Open Science Skills Terms

OS skills terms match the essential skills in this MVS to competence definitions from relevant taxonomies. The selected terms offer further information to help identify the learning objectives for skills development. Sources: European Skills, Competences and Occupations ontology (ESCO), ResearchComp, terms4FAIRskills, Center Scientific Collaboration and Community Engagement.

ESCO Research SkillsDemonstrate disciplinary expertise; Manage findable accessible interoperable and reusable data;  Apply research ethics and scientific integrity principles in research activities; Increase the impact of science on policy and society; Interact professionally in research and professional environments.

ESCO Transversal SkillsNegotiate compromisesThink critically; Think analytically; Advise others; Participate actively in civic life; Demonstrate curiosity; Approach challenges positively; Adapt to change; Lead others; Work in teams; Meet commitments; Organise information, objects and resources.

ResearchComp:  Manage research data; Develop networks; Teach in academic or vocational contexts; Promote open innovation; Promote the transfer of knowledge; Build mentor-mentee relationships.

Terms4FAIRskills: Data policy; knowledge to contextualise fair principles to domain; Service level management; Data curation; Preservation; Open access publishing; Data access risk assessment and mitigation; Information security; Securing sustainable funding; Software preservation; Metadata exposure;

CSCCEConsultation and listening; Advocacy; Landscape analysis.

Link to any other MVS that this MVS is based on (from those in Skills4EOSC D2.1)

References

1 2 3 4 5 6 7 8 9 10 11 12 13


  1. B. Ayres, L. Lehtsalu, G. Parton, Á. Száldobágyi, E. Warren, A. Whyte, and N. Zimmer. Rda professionalising data stewardship—current models of data stewardship: survey report. 2022. URL: https://doi.org/10.15497/RDA00075, doi:10.15497/RDA00075

  2. Y. Demchenko, L. Stoy, C. Engelhardt, and V. Gaillard. D7.3 fair competence framework for higher education (data stewardship professional competence framework). 2021. URL: https://doi.org/10.5281/zenodo.4562089, doi:10.5281/zenodo.4562089

  3. EMBL-EBI. Competency hub. 2023. URL: https://competency.ebi.ac.uk/framework/datasteward/1.0/competencies

  4. C. Engelhardt. How to be FAIR with your data: A teaching and training handbook for higher education institutions. Göttingen University Press, 2022. URL: https://doi.org/10.17875/gup2022-1915, doi:10.17875/gup2022-1915

  5. EOSC Executive Board; Skills, Training Working Group, European Commission. Directorate General for Research, and Innovation. Digital skills for fair and open science: report from the eosc executive board skills and training working group. 2021. URL: https://data.europa.eu/doi/10.2777/59065, doi:10.2777/59065

  6. K. U. Förstner, E. Seidlmayer, W. ZB MED-Informationszentrum Lebenswissenschaften, Deutschland. Bundesministerium Für Bildung, J. Dierkes, R. Depping, Universitäts- Und Stadtbibliothek Köln Technische Hochschule Köln, B. Lindstädt, Universität Zu Köln, and F. Hoffmann. Ergebnisse des projektes datastewforschung unterstützen—empfehlungen für data stewardship an akademischen forschungsinstitutionen. 2023. URL: https://doi.org/10.4126/FRL01-006441397, doi:10.4126/FRL01-006441397

  7. M. Jetten, M. Grootveld, A. Mordant, M. Jansen, M. Bloemers, M. Miedema, and C. W. G. V. Gelder. Professionalising data stewardship in the netherlands. competences, training and education. dutch roadmap towards national implementation of fair data stewardship. 2021. URL: https://doi.org/10.5281/ZENODO.4320504, doi:10.5281/ZENODO.4320504

  8. S. Scholtens, M. Jetten, J. Böhmer, C. Staiger, I. Slouwerhof, M. Van Der Geest, and C. W. G. Van Gelder. Final report: towards fair data steward as profession for the lifesciences. report of a zonmw funded collaborative approach built on existing expertise. 2022. URL: https://doi.org/10.5281/ZENODO.3471707, doi:10.5281/ZENODO.3471707

  9. Alexandre Ribas Semeler, Adilson Luiz Pinto, and Helen Beatriz Frota Rozados. Data science in data librarianship: core competencies of a data librarian. Journal of Librarianship and Information Science, 51(3):771–780, 2019. URL: https://doi.org/10.1177/0961000617742465, arXiv:https://doi.org/10.1177/0961000617742465, doi:10.1177/0961000617742465

  10. L. Wildgaard, E. Vlachos, L. Nondal, A. V. Larsen, and M. Svendsen. National coordination of data steward education in denmark: final report to the national forum for research data management (dm forum). 2020. URL: https://doi.org/10.5281/ZENODO.3609515, doi:10.5281/ZENODO.3609515

  11. I. Verheul, M. Imming, J. Ringerma, A. Mordant, J.-L. van der Ploeg, and M. Pronk. Data stewardship on the map: a study of tasks and roles in dutch research institutes. 2019. URL: https://doi.org/10.5281/ZENODO.2669150, doi:10.5281/ZENODO.2669150

  12. Christian Wendelborn, Michael Anger, and Christoph Schickhardt. What is data stewardship? towards a comprehensive understanding. Journal of Biomedical Informatics, 140:104337, 2023. URL: https://www.sciencedirect.com/science/article/pii/S1532046423000588, doi:https://doi.org/10.1016/j.jbi.2023.104337

  13. L. Wildgaard and J. Rantasaari. Gaps in data stewardship: what kind of needs for training do data stewards face in supporting research? 2022. URL: https://doi.org/10.15497/RDA00076, doi:10.15497/RDA00076