Skip to content
This repository was archived by the owner on Mar 17, 2025. It is now read-only.

Commit 17ee75f

Browse files
committed
Processing container details
1 parent bff8380 commit 17ee75f

File tree

1 file changed

+40
-1
lines changed

1 file changed

+40
-1
lines changed

README.md

Lines changed: 40 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,9 +15,48 @@ ArcGIS Online item url. Changes with addition/removal of features to dataset req
1515
**Example**: `https://services6.arcgis.com/ghjer345tert/arcgis/rest/services/PROD_PHU_Base_Aggregated/FeatureServer/0/query`
1616

1717
**2. `output`**
18-
The filename where the output will be written.
18+
The filename where the output in CSV format will be written.
1919
**Example**: `wsi.csv`
2020

21+
## Processing Container
22+
This container takes the CSV output from the retrieval container, and performs standardization and trend analysis on the data. There are disease target-specific outputs at both the sewershed and region-level. Sewershed weighting is required in order to perform region-level analyses. The container uses the following arguments:
23+
24+
**1. `input`**
25+
CSV file containing at minimum columns: sampleDate, siteName, mN1, mN2, mFluA, mFluB, and mBiomarker. Intention is to use the file that is output from the retrieval container for this.
26+
**Example**: `wsi.csv`
27+
28+
**2. `weights`**
29+
CSV file with columns: Site, and Weight. The site column corresponds to siteName values in the `input`. Weights represents factor used for combing site-specific trends into a single regional trend. Weights are decimal numbers and should sum to 1. The weights may be set to be equal, or correspond to population weighting, sampling frequency, or any other user-determined criteria
30+
**Example**: `weights.csv`
31+
32+
**3. `patch`**
33+
Optional CSV file with columns: Date, Site, and one or more of mN1, mN2, mFluA, mFluB, mBiomarker. Values in the patch file will add or overide any existing values in the primary input file. Useful, for adding historical data not present in WSI, or fixing erroneous data.
34+
**Example**: `patch.csv`
35+
36+
**4. `output_region_covid`**
37+
Optional output location for CSV file containing regional summary for SARS-CoV-2. No output will be generated if left blank.
38+
**Example**: `output_region_covid.csv`
39+
40+
**5. `output_region_flu_a`**
41+
Optional output location for CSV file containing regional summary for Influenza A. No output will be generated if left blank.
42+
**Example**: `output_region_flu_a.csv`
43+
44+
**6. `output_region_flu_b`**
45+
Optional output location for CSV file containing regional summary for Influenza B. No output will be generated if left blank.
46+
**Example**: `output_region_flu_b.csv`
47+
48+
**7. `output_covid`**
49+
Optional output location for CSV file containing site-specific SARS-CoV-2 data. No output will be generated if left blank.
50+
**Example**: `output_covid.csv`
51+
52+
**8. `output_flu_a`**
53+
Optional output location for CSV file containing site-specific Influenza A data. No output will be generated if left blank.
54+
**Example**: `output_flu_a.csv`
55+
56+
**9. `output_flu_b`**
57+
Optional output location for CSV file containing site-specific Influenza B data. No output will be generated if left blank.
58+
**Example**: `output_flu_b.csv`
59+
2160
## Pipeline Orchestration
2261
This data pipeline can be orchestrated by a variety of tools that support containerized components, but has been developed and tested with [Kubeflow Pipelines](https://www.kubeflow.org/), which is based on [Argo Workflows](https://argoproj.github.io/argo-workflows/).
2362

0 commit comments

Comments
 (0)