--- title: "linktree" output: rmarkdown::html_vignette vignette: > %\VignetteIndexEntry{linktree} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8} --- ```{r, include = FALSE} knitr::opts_chunk$set( collapse = TRUE, echo = TRUE, warning = FALSE, message = FALSE, dpi=400, comment = "#>", #fig.path = "man/figures/README-", out.width = "100%", fig.align = "center" ) ``` # linktree The goal of `linktree` is to estimate the *group transmission assortativity coefficient*. Details about the method are published [here](https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0313037). ## Installation ```{r, eval = FALSE} # install.packages("pak") pak::pak("CyGei/linktree") library(linktree) ``` ```{r, eval = TRUE, echo= FALSE} library(linktree) ``` ## Transmission assortativity **Group transmission assortativity quantifies the tendency of individuals to transmit within their own group relative to other groups**. It is defined as the relative increase, or decrease, in the probability of within-group transmission compared to what would be expected under random transmission. Random transmission expectations are calculated based on the group's relative size, defined as proportion of the total susceptible population that belongs to the group. The metric characterises transmission patterns, reflecting both contact patterns and group-specific differences in susceptibility or infectiousness. ### Gamma $\gamma$ ranges from $0$ to $\infty$, where: - $\gamma_A = 1$ represents **homogeneous** (random) transmission:\ Individuals in group *A* are equally likely to transmit in *A* as in other groups. - $\gamma_A$ \>1 indicates **assortative** transmission:\ Individuals in group *A* are more likely to transmit in *A* than in other groups. - $\gamma_A < 1$ indicates disassortative transmission:\ Individuals in group *A* are less likely to transmit in *A* than in other groups. For example, if $\gamma_A = 2$, individuals in group *A* are **twice** more likely to transmit within group *A* compared to other groups. Conversely, if $\gamma_A = 1/2$, individuals in group *A* are **half** as likely to transmit within group *A* compared to other groups. ![](images/gamma.png) ### Delta To simplify interpretation, we introduce a **rescaled** parameter $\delta$, ranging between $-1$ (fully disassortative) and $1$ (fully assortative), with 0 indicating homogeneous patterns. ![](images/delta.png) We can visualise the relationship between $\gamma$ and $\delta$: ```{r, fig.width=8, fig.height=6} delta <- seq(-1, 1, 0.1) gamma <- delta2gamma(delta) plot(gamma, delta, type = "l") points(1, 0, col = "red", pch = 16) # Homogeneous ``` ### Estimation `get_gamma` and `get_delta` estimate $\gamma$ and $\delta$ from a transmission tree. It requires the following inputs: - `from`: a vector of infector's group (*e.g.* age group, sex, vaccination status, etc.). - `to`: a vector of infectee's group. - `f`: a named vector of the groups' sizes or relative sizes (*i.e.* proportion of the susceptible population in each group). ## Example ### Data We can simulate an outbreak with multiple groups using the [`o2groups`](https://github.com/CyGei/o2groups) package and then use `linktree` to estimate the assortativity coefficients from the transmission tree. Below is a pre-loaded simulated transmission tree, see `?sim_tree` for more details. ```{r} #?sim_tree true_gammas <- c(HCW = 2, patient = 1/1.25) sizes <- c(HCW = 100, patient = 350) # data(sim_tree) sim_tree <- read.csv("https://raw.githubusercontent.com/CyGei/linktree/refs/heads/main/vignettes/sim_tree.csv") head(sim_tree) ``` ### Visualise the transmission tree ```{r, message=FALSE, warning=FALSE} library(epicontacts) x <- epicontacts::make_epicontacts( linelist = sim_tree[, c("group", "id", "date_infection", "date_onset")], contacts = sim_tree[, c("source", "id", "source_group", "group")], id = "id", from = "source", to = "id", directed = TRUE ) plot(x, method = "visNetwork", label = FALSE, node_color = "group", node_shape = "group", shapes = c(patient = "bed", HCW = "user-md"), edge_color = "source_group", edge_width = 1, node_size = 20, col_pal = epicontacts::spectral, edge_col_pal = epicontacts::spectral ) ``` ### Estimate the transmission assortativity coefficient Our published [paper](https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0313037) provides guidelines as to when and under which circumstances we can reliably infer group transmission assortativity. Notably, we should analyse transmission chains up to the group's epidemic peak to avoid the saturation effect, and we should have at least \~30 cases in the group. ```{r} est_delta <- get_delta( from = sim_tree$source_group, to = sim_tree$group, f = sizes, ) true_deltas <- gamma2delta(true_gammas) plot(est_delta, main = "Estimated vs True assortativity coefficients") abline(h = true_deltas, col = c("#d53e4f", "#3288bd"), lty = 2) legend("topright", legend = c("HCW (truth)", "patient (truth)"), fill = c("#d53e4f", "#3288bd"), bty = "n") ``` In this example, we estimate that patients are 1.25 times less likely to transmit to other patients, while HCWs are 1.34 times more likely to transmit to other HCWs (although none are statistically significant).