-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Develop a plan for GDAC QC #380
Comments
The Near-Term goal has been implemented in the GDAC: About implementing or repairing existing required and/or strongly recommended QARTOD tests: :: Five geophysical datasets are quality-controlled using the IOOS-QC QARTOD modules. :: Five test functions are used to perform quality control on the geophysical variables. :: The QARTOD location test is customized to align with the parameters available in the gliders' profile files. The two variables used in the location test are: About using global thresholds for all GDAC data. :: The thresholds for the spike test and the rate of change test are updated during the QC process to align with the datasets' ranges. About publishing aggregate QC flags (qc_agg). |
The main Mid-term goal has been implemented in the GDAC: MAIN :: The IOOS-QC module's functions were developed based on the QARTOD manuals and are used by GDAC to generate data quality flags. Links to the test functions used to generate the flags are listed here: Gross Range Test About using standards to understand existing tests. :: A [configuration file] (https://github.com/ioos/glider-dac/blob/main/data/qc_config.yml) is used to set thresholds as function arguments for the GDAC QC tests. Consult the list of links below for more information.
About identifying additional real-time tests. :: The GDAC has developed new functions or tests to:
:: The GDAC has implemented the dac_qc_comment
ADDITIONAL :: Add regional refinement of thresholds: This is an additional level of quality control that requires further information before implementation. For example, a table with thresholds by region needs to be created. :: Add dataset quality control dependency: This is another level of quality control that requires checking the flags of interdependent datasets and applying an attribute to reject or accept a dataset. This is mostly for the GTS application (see issue #391). |
About the long-term goal:
|
What can be improved? What is missing?
FY24 GDAC SOW:
Per IOOS Certification requirements, data served via the GDAC must be quality controlled, whether by routines applied by the provider or by the GDAC. The GDAC must apply QARTOD to variables that have existing QARTOD manuals (e.g. Manual for QC of Glider Temperature and Salinity Data). Flags must be published in the data files following IOOS Metadata standards (ioos.github.io).
The GDAC Team shall develop a plan for full implementation of QC in the GDAC using the following tiered approach:
Near-term: implement or repair existing Required and Strongly Recommended QARTOD tests, using global thresholds for all GDAC data, and publish aggregate QC flags (qc_agg).
Mid-term: Use QARTOD manuals and standards to understand and identify the potential for improvements to existing tests (e.g. regional refinement of thresholds, additional real-time tests, etc).
Long-term: Engage a scientific working group, possibly under UG2, to investigate the potential for delayed-mode quality control and/or to gain feedback on additional QC considerations.
Document an implementation plan that addresses the above 3 aspects of QC, which includes what we're doing now, gaps that remain, and what we intend to do, for both real-time and delayed mode datasets. Consider this plan to be a proposal that we'd run by the community.
Identify a forum to document this plan. I highly discourage Word documents, since those aren't interactive. Options are the GitHub wiki (I can activate this), some type of Slack feature (?), the GitHub Discussions option...something else?
The text was updated successfully, but these errors were encountered: