Conducting Open Science Using R and DataONE: A Hands-on Primer (Open Format)
Sunday, August 4, 2013: 12:00 PM-5:00 PM
101B, Minneapolis Convention Center
Matthew B. Jones
The inherent variability in ecological systems has placed a premium on well-designed studies that employ sophisticated analysis and modeling approaches. These analytical approaches are critically important to understanding and verifying conclusions from these experiments. Although students learn how to acquire and analyze data, recently there has been increasing emphasis on doing so in an open manner that allows for full reproducibility of ecological science. In this workshop, we will examine the convergence of open data and open source tools and their ability to jointly facilitate open science. The purpose of this workshop is to highlight the use of open software tools for conducting open science in ecology. Through a series of hands-on activities (see http://help.nceas.ucsb.edu/r), participants will explore approaches to accessing data from the DataONE federation in the R analytical system, and then conduct basic data assessment, summarization, and visualization using that data. We will also expose students to various R libraries available from the rOpenSci project for accessing data, and to the use of R for generating data documentation and archiving data in DataONE-compatible data repositories. We are targeting early-career scientists, including students, postdocs, and faculty that would benefit from new techniques for open science and educators who want to incorporate open science concepts in curricula. Participants should have a basic understanding of data analysis and statistics, and preferably basic exposure to the R system. Participants must bring their own laptop to participate in hands-on activities, and must have the ability to install new software.