Enriching Agronomic Experiments with Data Provenance
Reproducibility is a major feature of Science. Even agronomic research of exemplary quality may have irreproducible empirical findings because of random or systematic error. The ability to reproduce agronomic experiments based on statistical data and legacy scripts are not easily achieved. We propose RFlow, a tool that aid researchers to manage, share, and enact the scientific experiments that encapsulate legacy R scripts. RFlow transparently captures provenance of scripts and endows experiments reproducibility. Unlike existing computational approaches, RFlow is non-intrusive, does not require users to change their working way, it wraps agronomic experiments in a scientific workflow system. Our computational experiments show that the tool can collect different types of provenance metadata of real experiments and enrich agronomic data with provenance metadata. This study shows the potential of RFlow to serve as the primary integration platform for legacy R scripts, with implications for other data- and compute-intensive agronomic projects.
Year of publication: |
2017
|
---|---|
Authors: | do Nascimento, Jose Antonio Pires ; Serra da Cruz, Sergio Manuel |
Published in: |
International Journal of Agricultural and Environmental Information Systems (IJAEIS). - IGI Global, ISSN 1947-3206, ZDB-ID 2695927-6. - Vol. 8.2017, 3 (01.07.), p. 21-38
|
Publisher: |
IGI Global |
Subject: | Agrobiology | Data Provenance | Lineage | Pedigree | R System | Scientific Workflows | Scripts | SisGExp | Statistics |
Saved in:
Online Resource
Saved in favorites
Similar items by subject
-
A comparison of classification models to identify the Fragile X Syndrome
Pino-Mejias, Rafael, (2008)
-
A community-oriented workflow reuse and recommendation technique
Zhang, Jia, (2015)
-
Estimating genotyping error rates from parent–offspring dyads
Haaland, Øystein A., (2013)
- More ...