Reproducible Research at Vanderbilt

 

In the Vanderbilt Department of Biostatistics, reproducible statistical analysis and reporting is of the utmost importance. The goals of reproducible analysis and reporting are

  1. To combine programming code with report text in a single self-documenting file
  2. To document the code and its results, including interpretation of the results
  3. To allow an analysis to be re-executed and the resulting report be re-typesetted by running a single command
  4. To avoid all transcription of calculations and tables into text

Besides improving accuracy of reports, reproducible analysis saves time and allows a single command to be run whenever the data or the analysis changes. It also solves the problem of updating analyses after a long pause in the project, in which case the analyst frequently forgets the needed sequence of steps.

Biomedical researchers have praised the comprehensive and beautiful PDF reports we provide them. Our PDF reports are generated with R's Sweave tools, using the R language for computation. The reports combine text, plots, and code. This provides maximum information for the researchers and greatly facilitates their manuscript writing process.

All new MS biostatisticians at Vanderbilt receive training in R's Sweave and other reproducible reporting tools. We routinely use the following tools of reproducible research.

  1. Sweave - an extremely flexible tool for producing the typical statistical report using R and LaTeX
  2. R tools for creating LaTeX markup code for advanced statistical tables optionally containing results of statistical tests
  3. rreport - a package of extremely high-level R functions for clinical trial reporting using R and LaTeX. The reports produced by rreport contained advanced tables and are very graphical. rreport is being improved to make greater use of micrographics inside tables. rreport has been used in production mode in three pharmaceutical industry-sponsored clinical trials. rreport is used for recurring reports where more control is needed by the analyst than that provided by Sweave.

Read more about reproducible research and reporting at Vanderbilt here.

 

Home