-
Notifications
You must be signed in to change notification settings - Fork 2
/
publication--04--16s_database_comparision.Rmd
78 lines (57 loc) · 2.44 KB
/
publication--04--16s_database_comparision.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
---
title: "Comparing 16S databases"
bibliography: bibliography.bibtex
---
```{r init, echo = FALSE, message = FALSE}
library(knitr)
if (! is.null(current_input())) { # if knitr is being used
knitr::read_chunk("settings.R")
} else {
source("settings.R")
}
```
```{r rendering_settings, include=FALSE}
```
## Requirements
**NOTE:** This analysis requires at least 10Gb of RAM to run.
It uses large files not included in the repository and many steps can take a few minutes to run.
## Analysis input/output
```{r io_settings}
```
## Run individual analyses
This analysis just compiles the results from the three seperate analyses corresponding to the three reference databases being compared: RDP [@maidak2001rdp], SILVA [@quast2012silva], and Greengenes [@desantis2006greengenes].
The code below will run those three analyses, but it is not needed if they have been already run independently and it is not run during the rendering of this page.
```{r run_all, eval = FALSE}
library(rmarkdown)
render(input = "publication--01--silva.Rmd")
render(input = "publication--02--rdp.Rmd")
render(input = "publication--03--greengenes.Rmd")
```
## Load plots
After the three analyses have been completed, I loaded their results.
```{r load, cache = TRUE}
load(file.path(output_folder, "silva_data.RData"))
load(file.path(output_folder, "rdp_data.RData"))
load(file.path(output_folder, "greengenes_data.RData"))
```
## Combine plots
I then combine plots from the three analyses into a single graph and save the result.
```{r combine, fig.width = 7.5, fig.height = 10, cache = TRUE}
library(gridExtra)
library(grid)
library(metacoder)
combo_plot <- grid.arrange(ncol = 2, nrow = 3,
top = "Whole database Not amplified ",
left = "Greengenes RDP SILVA",
silva_plot_all, silva_plot_pcr_fail,
rdp_plot_all, rdp_plot_pcr_fail,
greengenes_plot_all, greengenes_plot_pcr_fail)
output_path <- file.path(output_folder, "figure_2--16s_database_comparison.pdf")
ggplot2::ggsave(output_path, combo_plot, width = 7.5, height = 10)
file.copy(output_path, "publication/revision_1/figure_4.pdf")
```
## Software and packages used
```{r, cache = TRUE}
sessionInfo()
```
## References