State tagging application for environmental data quality assurance
This R application is an implementation of state tagging approach for improved quality assurance of environmental data. The application returns state-dependent prediction intervals on input data. The states are determined based on clustering of auxiliary inputs (such as meteorological data) made on the same day. The method provides contextual information to assess the quality of observational data and is applicable to any point-based, daily time series observational data.
To use this application, the user will need to input two separate csv files: one for state variables and the other for observations.
This work was supported by the Natural Environment Research Council award number NE/R016429/1 as part of the UK-SCAPE programme delivering National Capability. Full details about this application can be found at https://doi.org/10.5285/1de712d3-081e-4b44-b880-b6a1ebf9fcd8
Default
Identification info
- Metadata Language
- English (en)
- Character set
- utf8
- Dataset Reference Date ()
- 2020-04-22
- Dataset Reference Date ()
- 2020-04-20
- Identifier
- doi: / 10.5285/1de712d3-081e-4b44-b880-b6a1ebf9fcd8
- Other citation details
- Tso, C.-H.M. (2020). State tagging application for environmental data quality assurance. NERC Environmental Information Data Centre 10.5285/1de712d3-081e-4b44-b880-b6a1ebf9fcd8
- Keywords
-
- Modelling
- cluster analysis
- Environmental informatics
- Environmental Monitoring
- Data science
- Quality assurance
- Data analytics
- R shiny
- UK-SCAPE
- Limitations on Public Access
- otherRestrictions
- Other constraints
- no limitations
- Use constraints
- otherRestrictions
- Use constraints
- otherRestrictions
- Other constraints
- If you reuse this data, you should cite: Tso, C.-H.M. (2020). State tagging application for environmental data quality assurance. NERC Environmental Information Data Centre https://doi.org/10.5285/1de712d3-081e-4b44-b880-b6a1ebf9fcd8
- Topic category
-
- Environment
- Geoscientific information
Distribution Information
- Data format
-
-
R
()
-
R
()
- Resource Locator
-
Download the data
Download a copy of this data
- Resource Locator
-
Supporting information
Supporting information available to assist in re-use of this dataset
- Resource Locator
- Screenshot
- Resource Locator
-
Download the data
Download a copy of this data
- Resource Locator
-
Supporting information
Supporting information available to assist in re-use of this dataset
- Resource Locator
- Screenshot
- Quality Scope
- application
- Other
- application
Report
- Dataset Reference Date ()
- 2010-12-08
- Statement
- This application comprises the source code of a R shiny app (app.R) that implements the method described in https://doi.org/10.3389/fenvs.2020.00046. It can be run on any machine with R and the required packages installed. The clustering is based on the k means function in the {stats} package in R. It has been tested up to R version 3.5.3. More testing info can be found in session_info.txt.
Metadata
- File identifier
- 1de712d3-081e-4b44-b880-b6a1ebf9fcd8 XML
- Metadata Language
- English (en)
- Character set
- ISO/IEC 8859-1 (also known as Latin 1)
- Resource type
- application
- Hierarchy level name
- application
- Metadata Date
- 2023-02-09T11:09:43
- Metadata standard name
- UK GEMINI
- Metadata standard version
- 2.3