Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
Learning objectives
Main take-home message
Introduce and describe reproducibility and open science
Activities to discuss and brainstorm (~40 min)
Resources
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
Become aware of what reproducible and open practices are
Hear and learn about what the experiences and thoughts of your colleagues are
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
Become aware of what reproducible and open practices are
Hear and learn about what the experiences and thoughts of your colleagues are
Learn about some simple ways to become more open and reproducible
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
Become aware of what reproducible and open practices are
Hear and learn about what the experiences and thoughts of your colleagues are
Learn about some simple ways to become more open and reproducible
Identify ways to adopt some of these practices at NCRR (1)
(1): NCRR and any work on DST has unique challenges.
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
Take-home message:
And I will add, share you code at any stage of your research. From beginning, to end, to middle.
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
They are part of multiple large trends like team science, computing, meta-research, higher quality/rigor
(Raise of hands) How many could describe the difference between reproducibility and replicability?
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
Just as much about inspectability as it is about actual reproducibility.
Part of this workshop and a lot of my teaching and work is to get that us as a community to move closer to this end.
Reproducibility is about HOW EXACTLY a finding was found in a study.
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
For 4 min, to yourself, think what your workflow is exactly like for doing research. As you think of what you do, add them to the Mentimeter, until the time runs out.
For 2 min, as the whole group we'll briefly go over some of the workflow items.
This helps to get us thinking and mentally primed for later activities.
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
+ One folder per manuscript (with associated files)
+ Relative file paths used (data/project-data.csv
vs C:/User1/some-data-file.csv
)
+ Version controlled (like with Git)
+ Automated and explicit pipeline management (re-generate results with single command)
+ Reproducible document system
I wanted you to think about your workflow so you can start appreciating how even small things can be changed to improve reproducibility.
And if more reproducible, if things change, its easier to update later work with literally a push of a button (or single command)
Some things are easier to do than others, and some require more technical knowledge and skill than most researchers have the time or motivation to acquire. (More about encouraging these skills).
But, we're missing key component here...
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
If you were to give me your code and I had access to your data, would I know how you got any given result presented in your paper?
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
Components of open science are in all stages of research.
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
+ Open access (like preprints)
+ Open protocol
+ Open data/data format
+ Open analysis plan/code
+ Open source (like software used) (1)
(1): Example, AU institutionally approves and supports a closed source software (Stata), reducing reproducibility.
Focus down to reproducibility side (data, code).
E.g. AU has institutionally approved a closed source statistical software (Stata), which by definition, makes work less open and reproducible (someone else needs a Stata license to run your software), unlike open source software like R where anyone can install and run it.
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
What might be some benefits to being more reproducible and open at NCRR at the organizational, group, and/or individual level (from the trainee to those more established)?
For 1 min, to yourself, think about the question.
For 2 min, discuss with your neighbour what you've thought about.
For 2 min, we'll have a group-wide sharing of some of the thoughts.
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
What might be some barriers to being more reproducible and open at NCRR at the organizational, group, and/or individual level (from knowledge and technical capacity, to what gets supported and what doesn't)?
For 1 min, to yourself, think about the question.
For 2 min, discuss with your neighbour what you've thought about.
For 2 min, we'll have a group-wide sharing of some of the thoughts.
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
Do code reviews, pair programming/analysis
Reviewing (pre-analysis)
Discuss with DST to support reproducible/open systems (within law)
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
Do code reviews, pair programming/analysis
Reviewing (pre-analysis)
Discuss with DST to support reproducible/open systems (within law)
Publicly link your code with your research output using GitHub and Zenodo
Decide on standard folder and file structure for each project/manuscript
Now that we're getting more into thinking about benefits and barriers, we can get into thinking about potential actions to take to be more reproducible and open. Before getting into brainstorming activities, I want to give you some ideas for actions to help prime your thinking.
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
What are some potential options that you, your group, and NCRR could implement relatively easily and within the short-term (<1 year) to be more open and reproducible?
For 2 min, think about ideas to this question (and write down if you want to)
For 3 min, brainstorm with your neighbour(s) on what you've all thought about
For 2 min, decide on one person to write down some of the ideas in the Mentimeter
For 3 min, we'll briefly go over and discuss some ideas from the Mentimeter
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
What are some potential options that you, your group, and NCRR could implement that are more difficult and within the longer-term (>1 years) to be more open and reproducible?
For 2 min, think about ideas to this question (and write down if you want to)
For 3 min, brainstorm with your neighbour(s) on what you've all thought about
For 2 min, decide on one person to write down some of the ideas in the Mentimeter
For 3 min, we'll briefly go over and discuss some ideas from the Mentimeter
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
Introduction course to Reproducible Research in R: r-cubed.rostools.org
Intermediate course to Reproducible Research in R: r-cubed-intermediate.rostools.org
Slides: slides.lwjohnst.com/au/2022-04-25
Licensed under CC-BY
Learning objectives
Main take-home message
Introduce and describe reproducibility and open science
Activities to discuss and brainstorm (~40 min)
Resources
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
Esc | Back to slideshow |