--- title: "Missingness" output: rmarkdown::html_vignette vignette: > %\VignetteIndexEntry{Missingness} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8} --- ```{r, include = FALSE} options(rmarkdown.html_vignette.check_title = FALSE) knitr::opts_chunk$set( warning = FALSE, message = FALSE, collapse = TRUE, comment = "#>" ) ``` ## Run missingness check ```{r setup} library(DrugExposureDiagnostics) library(dplyr) library(DT) # acetaminophen concept id is 1125315 acetaminophen <- 1125315 cdm <- mockDrugExposure() acetaminophen_checks <- executeChecks( cdm = cdm, ingredients = acetaminophen, checks = "missing" ) ``` ## Overall missingness This shows the missingness of the drug records summarised on ingredient level. ```{r} datatable(acetaminophen_checks$missingValuesOverall, rownames = FALSE ) ``` | Column | Description | :------------- | :------------- | ingredient_concept_id | Concept ID of ingredient. | ingredient | Name of drug ingredient. | variable | the variable for which missingness was assessed. | n_records | Number of records for ingredient concept. If n_records is the same as n_sample this means that there are more records but the number was cut at the pre-specified sample number for efficiency reasons. | n_sample | The pre-specified maximum sample. If n_records is smaller than the sample it means that sampling was ignored because the total number of records was already too small. | n_records_not_missing_value | The number of records for which there is no missingness in the variable of interest. | n_records_missing_value | The number of records with missing values for the variable of interest. | proportion_records_missing_value | The proportion of records with missing values for the variable of interest. | result_obscured | TRUE if count has been suppressed due to being below the minimum cell count, otherwise FALSE. | ## Missingness by drug concept This shows the missingness on drug concept level. ```{r} datatable(acetaminophen_checks$missingValuesByConcept, rownames = FALSE ) ``` | Column | Description | :------------- | :------------- | drug_concept_id | ID of the drug concept. | drug | Name of the drug concept. | ingredient_concept_id | Concept ID of ingredient. | ingredient | Name of drug ingredient. | variable | the variable for which missingness was assessed. | n_records | Number of records for drug concept. If n_records is the same as n_sample this means that there are more records but the number was cut at the pre-specified sample number for efficiency reasons. | n_sample | The pre-specified maximum sample. If n_records is smaller than the sample it means that sampling was ignored because the total number of records was already too small. | n_records_not_missing_value | The number of records for which there is no missingness in the variable of interest. | n_records_missing_value | The number of records with missing values for the variable of interest. | proportion_records_missing_value | The proportion of records with missing values for the variable of interest. | result_obscured | TRUE if count has been suppressed due to being below the minimum cell count, otherwise FALSE. |