PS 30-82 - Implementation of quality assurance and quality control measures in the National Phenology Database

Thursday, August 11, 2016
ESA Exhibit Hall, Ft Lauderdale Convention Center
Katharine L. Gerst1,2, Sara Schaffer1,2, Alyssa H. Rosemartin1,2, Lee Marsh1,2, Ellen G. Denny1,2 and LoriAnne Barnett1,2, (1)School of Natural Resources and the Environment, University of Arizona, Tucson, AZ, (2)National Coordinating Office, USA National Phenology Network, Tucson, AZ
Background/Question/Methods

The USA National Phenology Network (USA-NPN; www.usanpn.org) serves science and society by promoting a broad understanding of plant and animal phenology and the relationships among phenological patterns and environmental change. The National Phenology Database has over 6.6 million observation records for plants and animals for the period 1954-2016. These data have been used in a number of science, conservation and resource management applications, including national assessments of historical and potential future trends in phenology, regional assessments of spatio-temporal variation in organismal activity, and local monitoring for invasive species detection. Customizable data downloads are freely available, and data are accompanied by FGDC-compliant metadata, data-use and data-attribution policies, and vetted documented methodologies and protocols. The National Coordinating Office of USA-NPN is developing a suite of standard data products to facilitate use and application by a diverse set of data users. We have implemented a number of measures to ensure both quality assurance and quality control of these data products.  

Results/Conclusions

Here we describe activities that ensure that incoming data submitted by citizen and professional scientists are reliable; these include training materials, such as a botanical primer and species profiles. We also describe a number of automated quality control processes applied to incoming data streams to optimize data output quality. Existing and planned quality control measures for raw and derived data output include: (1) Validation of site locations, including latitude, longitude, and elevation; (2) Flagging of records that conflict for a given date for an individual plant; (3) Flagging where species occur outside known ranges; (4)  Flagging of records when phenophases occur outside of the plausible order for a species; (5) Flagging of records when intensity measures do not follow a plausible progression for a phenophase; (6) Flagging of records when a phenophase occurs outside of the plausible season, and (7) Quantification of precision and uncertainty for estimation of phenological metrics. Finally, we will describe preliminary work to develop methods for outlier detection that will inform plausibility checks.  Ultimately we aim to maximize data quality of USA-NPN data and data products to ensure that this database can continue to be reliably applied for science and decision-making for multiple scales and applications.