“Data! data! data!” he cried impatiently. “I can’t make bricks without clay.”

Sherlock Holmes (PBS Series, 2010-2017)

Reproducibility and Replicability in Science (2019)

 The report provides recommendations to researchers, academic institutions, journals, and funders on steps they can take to improve reproducibility and replicability in science.

Conceptualizing a US Research Software Sustainability Institute 


 URSSI mission

To improve the quality, usefulness, and sustainability of research software by improving practices, and increasing diversity of practitioners

rOpenSci fosters a culture that values open and reproducible research using shared data and reusable software.

Open Source Software

280+ software packages to support data science. e.g. spatial data, biodiversity informatics & climate change, glue for workflows.

pip install <software>
pip install data?

How do we make research data more easily accessible alongside compute?

The extent to which software would actually build with reasonable effort was quite low

Collberg et al 2014​

< 20%

Code + Computing environment + Data

Minimally reproducible computational research

 Isolate your computing environment

It's important to isolate the computing environment so that changes in software dependencies don't break your analysis.



Binder is an open source project that is designed to make it really easy to share analyses that are in notebooks.




Thank you for creating this. I've been wanting to try Binder for awhile and holepunch makes it a lot more accessible to me.

Without ever having committed or pushed anything to GitHub before, I was able to get your binder-test example working here.