Tools for Reproducible Research
This presentation covers the topic "Tools for reproducible research". One of the key principles of proper scientific procedure is the act of repeating an experiment or analysis and being able to reach similar conclusions. Published research based on computational analysis, e.g. bioinformatics or computational biology, have often suffered from incomplete method descriptions (e.g. list of used software versions); unavailable raw data; and incomplete, undocumented and/or unavailable code. This essentially prevents any possibility of attempting to reproduce the results of such studies. The term “reproducible research” has been used to describe the idea that a scientific publication based on computational analysis should be distributed along with all the raw data and metadata used in the study, all the code and/or computational notebooks needed to produce results from the raw data, and the computational environment or a complete description thereof.
Reproducible research not only leads to proper scientific conduct but also provides other researchers the access to build upon previous work. Most importantly, the person setting up a reproducible research project will quickly realize the immediate personal benefits: an organized and structured way of working. The person that most often has to reproduce your own analysis is your future self!
This presentation was presented at the recurring session "Meet a Data Steward" on the 21st of Sep 2023. These sessions are organized by SciLifeLab and are open to all researchers affiliated with a Swedish university or institute. Each event starts with a 15 minutes mini-lecture, and 45 minutes of free Q & A. Here you can bring your own research question, discuss data sharing, data repositories, data management plans, compute and storage or other data-related issues with data stewards and other experts from SciLifeLab Data Centre and NBIS (National Bioinformatics Infrastructure Sweden).