3 ENACT
3.1 summarizing.R
R script used to extract the positive cases and generate the summary statistics for the study population. The user need to manually enter the site name, select the input data folder and output folder created from data preparation step.
- Input:
- site (which should be your site name)
- data folder (where save the cases.csv and dems_cases.csv)
- output directory
- cohort (using the default cases)
- Output:
- cases_map_CCSR_site.csv
- cov_pats.RData
- cases_race_stat_site.csv
- cases_dems_stat_site.csv
- cases_eth_stat_site.csv
3.3 cases_incidences.R
This R script creates incident-level data from patient encounters for COVID infections. The rule is to cluster infections dates and recognize an infection if a cluster is 90 days or longer apart from another.
- Input:
- site (which should be your site name)
- cov_pats.RData (which is from the summarizing.R and should be saved under data folder)
- output directory
- Output:
- cov_pats.RData (which should be saved under output folder)
- site_cov_pats_summary.RData
3.4 Ref_DxX.R
This R script implements WHO definition of long COVID using the reference J thresholds from the MGB study.
- Input:
- site (which should be your site name)
- cov_pats.RData (which is from the cases_incidences.R and should be saved under output folder)
- cases_map_CCSR_site.csv (which is from summarizing.R and should be saved under data folder)
- output directory
- ref_corrs.RData (provided in the docker container)
- ref_J_thresholds.RData (provided in the docker container)
- ref_J.RData (provided in the docker container)
- ref_phenxlookup.RData (provided in the docker container)
- Output:
- longCOVID_patients_site_ref_thresholds0.05.csv (which is the final result)
- longCOVID_summary_site_ref_thresholds0.05.csv
- db_longhauler_chunk_x.RData
- patlookup.RData
- phenxlookup_site.RData