*Add `ds_*` use case

## Use Case  



The most commonly seen medications with a non-null `administration_dose` field 
are first derived from the Drug Exposures table. 

```sql
WITH ct AS (
SELECT de.drug_concept_id, COUNT(de.drug_concept_id) AS drug_concept_count
FROM omop_cdm_1.drug_exposure de 
WHERE de.administration_dose IS NOT NULL
GROUP BY de.drug_concept_id 
ORDER BY COUNT(de.drug_concept_id) DESC
)
SELECT ct.drug_concept_count, c.*
FROM ct 
LEFT JOIN omop_vocabulary.concept c
ON c.concept_id = ct.drug_concept_id
;
```












```{r,echo=FALSE,results='hide'}
drug_concept_count <-
pg13::query(
  conn = conn, 
  sql_statement = 
  "
   WITH ct AS (
  SELECT de.drug_concept_id, COUNT(de.drug_concept_id) AS drug_concept_count
  FROM omop_cdm_1.drug_exposure de 
  WHERE de.administration_dose IS NOT NULL
  GROUP BY de.drug_concept_id 
  ORDER BY COUNT(de.drug_concept_id) DESC
  )
  SELECT ct.drug_concept_count, c.*
  FROM ct 
  LEFT JOIN omop_vocabulary.concept c
  ON c.concept_id = ct.drug_concept_id
  ;
  "
)
```

```{r}
drug_concept_count
```
For testing, the top 10 most frequently seen drugs in the Drug Exposure table 
are filtered. 

```{r}
drug_concept <-
  drug_concept_count %>%
  slice(1:10)
drug_concept
```
The top 10 drugs are joined back with the Drug Exposures table to retrieve the 
`administration_dose`, `administration_unit`, and `frequency_concept_id` fields. 

```sql
SELECT DISTINCT 
  a.*, b.administration_dose,b.administration_unit,b.frequency_concept_id 
FROM @drug_concept a 
LEFT JOIN omop_cdm_1.drug_exposure b 
ON a.concept_id = b.drug_concept_id
```

```{r,echo=FALSE,cache=TRUE,results='hide'}
drug_record <-
pg13::join1(
  conn = conn, 
  write_schema = "patelm9",
  data = drug_concept, 
  column = "concept_id",
  select_join_on_fields = c("administration_dose", 
                            "administration_unit"),
  join_on_schema = "omop_cdm_1",
  join_on_table = "drug_exposure",
  join_on_column = "drug_concept_id",
  distinct = TRUE
)
```

```{r}
drug_record
```

For easier visualization, the formatting of the concept attributes are merged 
into a single `drug` string, with the `concept_id` field now called the `drug_id`.  

```{r}
drug_record2 <-
  drug_record %>%
  chariot::merge_strip(into = "drug")
drug_record2
```

This dataset is then joined to the `Drug Strength Staged` table to get the staged 
`value` and `unit` fields for each drug. 

```sql
SELECT a.*, b.ingredient_concept_id,b.value,b.unit 
FROM @drug_record2 a 
LEFT JOIN patelm9.drug_strength_staged b 
ON a.drug_id = b.drug_concept_id;
```

```{r,echo=FALSE,results='hide',cache=TRUE}
drug_strength_record <-
  pg13::join1(
    conn = conn, 
    write_schema = "patelm9",
    data = drug_record2,
    column = "drug_id",
    select_join_on_fields = c("ingredient_concept_id",
                              "value",
                              "unit"),
    join_on_schema = "patelm9",
    join_on_table = "drug_strength_staged",
    join_on_column = "drug_concept_id"
  ) 
```

The resulting table tells the story of the drug exposure for a given record. The 
dose of the drug at each administration, the units of administration, the corresponding 
`ingredient_concept_id` from the Drug Strength table, and the staged `value` and `unit` 
corresponding to the amount of the ingredient in 1 unit of the drug. 

```{r}
drug_strength_record %>%
  select(drug_id, administration_dose, administration_unit, ingredient_concept_id, value, unit)
```

The `value` field requires evaluation as a numeric value, which would require looping
over almost 40,000 rows. Instead, each unique `value` is isolated, resulting in 9 
rows. These 9 values are mapped to their corresponding numeric value. 

```{r}
values <-
  drug_strength_record %>%
  select(value) %>%
  distinct()
values
```

```{r}
values$numeric_value <- sapply(values$value, function(x) eval(rlang::parse_expr(x)))
values
```

The resulting dataset is joined back with the original data.  

```{r}
drug_strength_record2 <- 
drug_strength_record %>%
  left_join(values, by = "value")
drug_strength_record2
```



```{r,include=FALSE,cache=FALSE}
fantasia::dcOMOP(conn = conn)
```



## Themes    

Themes can be viewed at: https://bootswatch.com/3/.  


## Syntax Highlighting  

Syntax Highlighting Styles can be viewed at https://www.garrickadenbuie.com/blog/pandoc-syntax-highlighting-examples/.  


## Dataframe  

Dataframe printing options include `default`, `kable`, `tibble`, or `paged`.  

For paged dataframes, the chunk options include:  

* max.print: the number of rows to print  
* rows.print: the number of rows to display    
* cols.print: the number of columns to print  
* cols.min.print: the minimum number of columns to display  
* pages.print: the number of pages to display under page navigation   
* paged.print: when set to `FALSE` turns off paged display for the chunk    
* rownames.print: when set to `FALSE` turns off row names for the chunk  

## Figure Captions

```{r,fig.align="center", fig.width=6, fig.height=6, fig.cap="Figure: Here is a really important caption."}
library(tidyverse)
mpg %>%
  ggplot( aes(x=reorder(class, hwy), y=hwy, fill=class)) + 
    geom_boxplot() +
    xlab("class") +
    theme(legend.position="none")
```


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `ds_` use case #25

Use Case

Themes

Syntax Highlighting

Dataframe

Figure Captions

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

*Add ds_* use case #25

Description

Use Case

Themes

Syntax Highlighting

Dataframe

Figure Captions

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions

Add `ds_` use case #25